Milestone 2.2.0 #66

charles-plessy · 2025-05-22T01:17:32Z

This PR adds bam and cram to the list of supported export formats. A new subworkflow takes care of ensuring that the genome file is appropriately compressed and indexed for CRAM encoding and to support the user running BAM/CRAM indexing commands later. A sequence dictionary is computed, and will be useful in a future update of last/mafconvert (PR under review).

Other updates preparing the 2.2.0 release will follow, but I thought that it would be useful to review this bunch of commits separately. In particular I welcome critical comments on how I manage the optional run of the subworkflow in workflows/pairgenomealign.nf.

PR checklist

This is in preparation for CRAM support.

2.1.0 release

FASTA_BGZIP_INDEX_DICT_SAMTOOLS is a subworkflow that takes a FASTA file regardless of its compression, and returns it BGZIPped together with index files needed to sort the alignments and a sequence dictionary needed to ensure that alignments of different _queries_ to the same _target_ can be merged later.

Closes #43 Closes #31

Pushing this commit now to trigger a new CI run of the nf-core branch protection. This said, multiqc_assemblyscan_plot_data combines one file per _query_ genome, and removing the `tag` ensure that the list of file names does not clutter the screen when monitoring the pipeline run. The nf-core MultiQC module also does not have a `tag`. Closes #64

jfy133

General minor things:

Missing citations.md entry for SAMTOOLS
Possibly missing diagram update to add SAMTOOLS
Missing reference to SAMTOOLS on README

But otherwise code nice and clean as always, so will give you a premptive approval :)

assets/multiqc_config.yml

docs/output.md

modules/local/multiqc_assemblyscan_plot_data/main.nf

subworkflows/local/fasta_bgzip_index_dict_samtools/main.nf

subworkflows/local/fasta_bgzip_index_dict_samtools/tests/main.nf.test

workflows/pairgenomealign.nf

Co-authored-by: James A. Fellows Yates <[email protected]>

…into milestone_2.2.0

charles-plessy · 2025-05-23T02:28:38Z

Thanks a lot for the very useful comments. I have added credit to Samtools and opened an issue about the diagram (#68)

charles-plessy added 10 commits May 12, 2025 10:43

Install samtools/bgzip and samtools/faidx.

fc14a72

This is in preparation for CRAM support.

Merge pull request #57 from nf-core/dev

9b5fd10

2.1.0 release

Update samtools/bgzip to fix output file name.

149fa02

Install samtools/dict in preparation for CRAM support

afe540c

Add support for BAM/CRAM output.

82f08ae

Closes #43 Closes #31

Mark it 2.2.0dev and document current changes

e46254d

Merge branch 'master' into milestone_2.2.0

6f2c3e7

Merge branch 'master' into milestone_2.2.0

06e9c94

jfy133 approved these changes May 22, 2025

View reviewed changes

charles-plessy and others added 5 commits May 22, 2025 15:34

Clarify what is meant with 'always compressed'

56200ae

Remove extra empty line.

52142f8

Co-authored-by: James A. Fellows Yates <[email protected]>

Add a space.

a4fb9c2

Co-authored-by: James A. Fellows Yates <[email protected]>

Merge branch 'milestone_2.2.0' of github.com:nf-core/pairgenomealign …

5629c97

…into milestone_2.2.0

Credit Samtools

55bb07f

charles-plessy merged commit 0374473 into dev May 23, 2025
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Milestone 2.2.0 #66

Milestone 2.2.0 #66

Uh oh!

charles-plessy commented May 22, 2025

Uh oh!

jfy133 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

charles-plessy commented May 23, 2025

Uh oh!

Uh oh!

Uh oh!

Milestone 2.2.0 #66

Milestone 2.2.0 #66

Uh oh!

Conversation

charles-plessy commented May 22, 2025

PR checklist

Uh oh!

jfy133 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

charles-plessy commented May 23, 2025

Uh oh!

Uh oh!

Uh oh!