Rmats prep PR by akaviaLab · Pull Request #10128 · nf-core/modules

akaviaLab · 2026-02-23T09:55:12Z

PR checklist

This is a PR of rmats/prep.
RMATS processes BAM files from RNAseq and identifies splice junctions used, and differences between groups of samples.
RMATS is composed of 4 stages (that can be run together), but I'm planning to split into steps

Prep which processes BAM files (this PR)
Post, which process the output of prep
Dividing the post files into groups, based on the statistics
Stats - comparing two groups. Since stats can only compare two groups, I will set up stage 3 to read contrasts and set up the groups.

I will then set up a workflow that does all 4.

Eventually, the workflow will close #8699 . Please do not close the issue yet.

This creates the main.nf for rmats.py prep command, and adds tests for multiple parameters that can be given to rmats prep.

modules/nf-core/rmats/prep/optional_parameters

…s and starting a list of optional parameters

SPPearce · 2026-03-09T11:51:40Z

modules/nf-core/rmats/prep/main.nf

+
+    script:
+    def args = task.ext.args ?: ''
+    def prefix = task.ext.prefix ?: "${meta.id}"


If I understand the comment in line 33, This will ensure that files are named with an appropriate default prefix, but it can be overridden:

Suggested change

def prefix = task.ext.prefix ?: "${meta.id}"

def prefix = task.ext.prefix ?: "${meta.id}_prep"

SPPearce · 2026-03-09T11:52:33Z

modules/nf-core/rmats/prep/main.nf

+    // NOTES - post seems to need only the BAM *names*, not the actual files. Could we just get the first line of each file to get the names?
+    // for file in `ls multi_bam_rmats_prep_tmp/*.rmats`; do head -1 $file; done | tr '\n' ','
+    // NOTES - for stats, it should be possible to parse the formula using patsy, but if we include PAIRADISE we might have R - just do this in R, first pass
+    path reference_gtf


This should have a meta.

Do you mean it should be
tuple val(meta), path(reference_gtf)?

Also, should rmats_read_len have a meta? (one line below)

meta2, but yes.
I wouldn't put a meta on the value channel.

SPPearce · 2026-03-09T11:52:53Z

modules/nf-core/rmats/prep/tests/nextflow.config

+            params.novel_splice_site ? "--novelSS" : "",
+            (params.novel_splice_site && params.minimum_intron_length) ? "--mil ${params.minimum_intron_length}" : "",
+            (params.novel_splice_site && params.max_exon_length) ? "--mel ${params.max_exon_length}" : "",


Don't think we should be using params here?

But the documentation states that optional parmeters should be given via ext.args, and the example shows it with params
https://nf-co.re/docs/guidelines/components/modules#optional-command-arguments

How else can I put these optional params?

But this is a config for the nf-test, not for the module itself.

So params.novel_splice_site is not defined at all, and even if you put the module into a pipeline this config won't get used at all anyway, because it is just for nf-test.

There was a long discussion on Slack with @jfy133 about the usage of config files and parameters, right https://nfcore.slack.com/archives/C043FMKUNLB/p1768941551558009 and I thought I was doing what was discussed there.
I plan to have a modules.config for the rmats sub-workflow, which will have one file, but 4 config sections (one for each module). Currently, I added a module.config for this test, just to check that eveything behaves as it should for this task.

@jfy133 - could you please clarify this question for me?

SPPearce · 2026-03-09T11:54:59Z

modules/nf-core/rmats/prep/main.nf

+    // NOTES - post seems to need only the BAM *names*, not the actual files. Could we just get the first line of each file to get the names?
+    // for file in `ls multi_bam_rmats_prep_tmp/*.rmats`; do head -1 $file; done | tr '\n' ','


If you only need the bam names, then you could pass along the ${prefix}.prep.b1.txt file as an output of this module potentially.

Good suggestion, thank you.
I'm going to need to see how rmats post behaves to figure out the best way, but I'll keep it in mind.

mashehu marked this pull request as draft February 24, 2026 11:51

mashehu reviewed Feb 25, 2026

View reviewed changes

modules/nf-core/rmats/prep/optional_parameters Outdated Show resolved Hide resolved

akaviaLab and others added 13 commits March 8, 2026 17:19

Initial commit that passes pre-commit hooks, trying minimal parameter…

256a774

…s and starting a list of optional parameters

Initial version of rmats prep task correctly using args

660c9de

fixed some minor problems

781cffa

updated meta.yml for rmats prep task

017cb04

Added read outcome file to output

e970c26

Added test config with args, and removed some TODOs

7530889

Trying to use topics

98f07ac

fix version section of meta.yml

eae57c3

fix patterns

bea0518

meta.yml that works with topics

6b9964e

Deleted unnecessary file

4f8cd93

rmats prep test works as exepcted for rnasplice files

3e28311

Added tests for all parameters of rmats

a992913

akaviaLab force-pushed the rmats branch from 90fdee0 to a992913 Compare March 8, 2026 17:20

akaviaLab marked this pull request as ready for review March 8, 2026 17:20

Remoevd TODO lines from main.nf

7c7a02d

akaviaLab changed the title ~~Rmats work in progress PR~~ Rmats prep PR Mar 8, 2026

SPPearce reviewed Mar 9, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rmats prep PR#10128

Rmats prep PR#10128
akaviaLab wants to merge 14 commits intonf-core:masterfrom
akaviaLab:rmats

akaviaLab commented Feb 23, 2026 •

edited

Loading

Uh oh!

Uh oh!

SPPearce Mar 9, 2026

Uh oh!

SPPearce Mar 9, 2026

Uh oh!

akaviaLab Mar 9, 2026

Uh oh!

SPPearce Mar 9, 2026

Uh oh!

SPPearce Mar 9, 2026

Uh oh!

akaviaLab Mar 9, 2026

Uh oh!

SPPearce Mar 9, 2026

Uh oh!

SPPearce Mar 9, 2026

Uh oh!

akaviaLab Mar 9, 2026

Uh oh!

SPPearce Mar 9, 2026

Uh oh!

akaviaLab Mar 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	def prefix = task.ext.prefix ?: "${meta.id}"
	def prefix = task.ext.prefix ?: "${meta.id}_prep"

		// NOTES - post seems to need only the BAM names, not the actual files. Could we just get the first line of each file to get the names?
		// for file in `ls multi_bam_rmats_prep_tmp/*.rmats`; do head -1 $file; done \| tr '\n' ','

Conversation

akaviaLab commented Feb 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR checklist

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

akaviaLab commented Feb 23, 2026 •

edited

Loading