Skip to content

Cannot simulate transcriptomic reads with intron retention #253

@SimonHegele

Description

@SimonHegele

Hello,

I was trying to simulate transcriptomic nanopore reads.

First I created a model for the mouse using the following command:
read_analysis.py transcriptome -i reads/long/SRR11517552.fastq -rg reference/genome.fna -rt reference/transcriptome.fna -annot reference/annotation.gff -o reads/long/SRR11517552_nanosim/model -t 128
Organism: mus musculus
Genome: GRCm39
Reads: SRR11517552 (from NCBI-SRA, single-cell transcriptomic, blastocyst)

I then tried to simulate reads using the model and an expression profile I created with Kallisto from Illumina short reads I found in the NCBI-SRA ( PRJNA375882) using the following command:
simulator.py transcriptome -rt reference/transcriptome2.fna -rg reference/genome.fna -e reads/short/abundance_nanosim.tsv -c reads/long/SRR11517552_nanosim/model -o reads/long/nanosim -t 128

NanoSim reads all input files but gets stuck at the simulation step. If I add the --no_model_ir parameter reads are simulated.
The issue may originate from the modelling step as the training files look suspicious.

model_IR_info.txt
model_IR_markov_model.txt

Best,
simon

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions