DeepSomatic does not finish for some samples in Slurm runs

Hi all,

I am running DeepSomatic on an HPC cluster using a slurm-based parallel workflow. The pipeline works successfully for the majority of samples (n>50), but for a small subset of samples, the job never finishes. 

There is no error message, no crash, and the job remains running indefinitely until manually terminated. All samples were preprocessed using the exact same pipeline and tools.

Command used:
```
singularity exec \
  -B "${TMPDIR_HOST}":"${TMPDIR_HOST}" \
  -B /usr/lib/locale:/usr/lib/locale \
  -B "${REF_DIR}:${REF_DIR}" \
  -B "${TUMOR_DIR}:${TUMOR_DIR}" \
  -B "${OUT_DIR}:${OUT_DIR}" \
  $IMAGE run_deepsomatic \
  --model_type=FFPE_WGS_TUMOR_ONLY \
  --ref=${REF} \
  --reads_tumor=${BAM} \
  --output_vcf=${OUT_DIR}/${SAMPLE}.deepsomatic.vcf.gz \
  --sample_name_tumor=${SAMPLE} \
  --num_shards=8 \
  --logging_dir=${OUT_DIR}/logs/${SAMPLE} \
  --use_default_pon_filtering=true \
  --intermediate_results_dir=${OUT_DIR}/logs/${SAMPLE}

```
Please find some screenshot of logs attached:

<img width="2182" height="950" alt="Image" src="https://github.com/user-attachments/assets/6e721283-8c9f-4a47-be77-603a782a9245" />

<img width="3020" height="1258" alt="Image" src="https://github.com/user-attachments/assets/0d022b82-4012-4b7a-b758-07720c2d6ea5" />

Any guidance on how to diagnose or trace where execution stalls would be greatly appreciated.

Thank you very much for your help.

Best regards,
Lipika

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DeepSomatic does not finish for some samples in Slurm runs #82

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

DeepSomatic does not finish for some samples in Slurm runs #82

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions