-
Notifications
You must be signed in to change notification settings - Fork 88
DSL2: nf-test humanpopgen
#1154
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from 9 commits
Commits
Show all changes
15 commits
Select commit
Hold shift + click to select a range
ca6c329
Draft to add the nf-test
jbv2 28532db
update
jbv2 39754f4
wrong space
jbv2 9350da0
removing samtools from test, and ignoring geno, ind
jbv2 0bd747d
Merge branch 'dev' into nf-test-humanpopgen
TCLamnidis 5887e39
Merge pull request #1146 from jbv2/nf-test-humanpopgen
TCLamnidis db2f8cf
update snapshot
TCLamnidis a464f53
correct typo
TCLamnidis efb173a
harshil align
TCLamnidis 7e9f34b
Update tests/test_humanpopgen.nf.test
TCLamnidis 41aa559
Merge branch 'dev' into nf-test-humanpopgen
TCLamnidis 70e653d
add profile tag. update instructions
TCLamnidis ff1600a
rename test
TCLamnidis 0365726
update snapshot
TCLamnidis 40bf247
exclude command log from checksum
TCLamnidis File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,140 @@ | ||
| nextflow_pipeline { | ||
|
|
||
| name "Test pipeline: NFCORE_EAGER" | ||
| script "main.nf" | ||
| tag "pipeline" | ||
| tag "nfcore_eager" | ||
| tag "test_humanpopgen" | ||
|
|
||
| test("test_humanmpopgen_profile") { | ||
|
|
||
| when { | ||
| params { | ||
| outdir = "$outputDir" | ||
| } | ||
| } | ||
|
|
||
| then { | ||
|
|
||
| /////////////////// | ||
| // DOCUMENTATION // | ||
| /////////////////// | ||
|
|
||
| // The contents of each top level results directory should be tested with individually named snapshots. | ||
| // Within each snapshot, there should be two to three distinct variables, that contain the files to be tested. | ||
| // - stable_name_<dir> is for files with variable md5sums (i.e. content) so only names will be compared | ||
| // - stable_content_<dir> is for files with stable md5sums (i.e. content) so md5sums will be compared | ||
| // - bams_<dir> is for BAM files, where the headerMD5 is checked for stability (since the content can be unstable) | ||
| // If a directory is fully stable, you can drop `stable_name_*` | ||
| // If a directory contains no BAMs, you can drop `bams_*` | ||
|
|
||
| // Generate with: nf-test test --tag test_humanpopgen --profile docker,test_humanpopgen --update-snapshot | ||
| // Test with: nf-test test --tag test_humanpopgen --profile docker,test_humanpopgen | ||
| // NOTE: BAMs are always only stable in name, because: | ||
| // a) sharding breaks header since the shard that was first is named in the header (Fixed in https://github.com/nf-core/eager/pull/1112) | ||
| // b) the order of the reads in the BAMs is not stable (sorted, but reads that share a start position can be in any order) | ||
| // point b) also causes BAIs to be unstable. | ||
| // c) Merging of multiple BAMs with duplicate @RG / @PG tags can cause the header to be unstable (particularly in the case of shards/lanes) | ||
|
|
||
| ////////////////////// | ||
| // DEFINE VARIABLES // | ||
| ////////////////////// | ||
|
|
||
| // Define exclusion patterns for files with unstable contents | ||
| // NOTE: When a section needs more than a couple of small patterns, consider adding a variable to store the patterns here | ||
| // This is particularly important if the patterns excluded in the stable content section should be included in the stable name section | ||
| def unstable_patterns_auth = [ | ||
| '**/mapped_reads_gc-content_distribution.txt', | ||
| '**/mapped_reads_nucleotide_content.txt', | ||
| '**/genome_gc_content_per_window.png', | ||
| '**/*.{svg,pdf,html,png}', | ||
| '**/DamageProfiler.log', | ||
| '**/3p_freq_misincorporations.txt', | ||
| '**/DNA_comp_genome.txt', | ||
| '**/DNA_composition_sample.txt', | ||
| '**/genome_results.txt', | ||
| '**/Runtime_log.txt', | ||
| '**/3pGtoA_freq.txt', | ||
| '**/5pCtoT_freq.txt', | ||
| '**/dnacomp.txt', | ||
| '**/lgdistribution.txt', | ||
| '**/misincorporation.txt', | ||
| '**/*sexdeterrmine.tsv', | ||
| ] | ||
|
|
||
| // Check that no files are missing/added | ||
| // Command legend: Result directory to index , includeDir: include dirs? , ignore: exclude patterns , ignoreFile: exclude pattern list , include: include patterns | ||
| def stable_name_all = getAllFilesFromDir("$outputDir/" , includeDir: false , ignore: ['pipeline_info/*'] , ignoreFile: null , include: ['*', '**/*'] ) | ||
|
|
||
| // Authentication | ||
| def stable_content_authentication = getAllFilesFromDir("$outputDir/authentication" , includeDir: false , ignore: unstable_patterns_auth , ignoreFile: null , include: ['*', '**/*'] ) | ||
| def stable_name_authentication = getAllFilesFromDir("$outputDir/authentication" , includeDir: false , ignore: null , ignoreFile: null , include: unstable_patterns_auth) | ||
|
|
||
| // Damage manipulation | ||
| def stable_content_damage_manipulation = getAllFilesFromDir("$outputDir/damage_manipulation" , includeDir: false , ignore: ['**/*.{bam,bai}'] , ignoreFile: null , include: ['*', '**/*'] ) | ||
| def stable_name_damage_manipulation = getAllFilesFromDir("$outputDir/damage_manipulation" , includeDir: false , ignore: null , ignoreFile: null , include: ['**/*.{bam,bai}'] ) | ||
|
|
||
| // Deduplication | ||
| def stable_content_deduplication = getAllFilesFromDir("$outputDir/deduplication" , includeDir: false , ignore: null , ignoreFile: null , include: ['**/*.flagstat'] ) | ||
| def stable_name_deduplication = getAllFilesFromDir("$outputDir/deduplication" , includeDir: false , ignore: null , ignoreFile: null , include: ['**/*.{bam,bai}'] ) | ||
|
|
||
| // Final_bams | ||
| def stable_content_final_bams = getAllFilesFromDir("$outputDir/final_bams" , includeDir: false , ignore: null , ignoreFile: null , include: ['**/*.flagstat'] ) | ||
| def stable_name_final_bams = getAllFilesFromDir("$outputDir/final_bams" , includeDir: false , ignore: null , ignoreFile: null , include: ['**/*.{bam,bai}'] ) | ||
|
|
||
| // Genotyping | ||
| def stable_content_genotyping = getAllFilesFromDir("$outputDir/genotyping" , includeDir: false , ignore: ['**/*.{geno,ind,snp,tsv}'] , ignoreFile: null , include: ['**/*'] ) | ||
| def stable_name_genotyping = getAllFilesFromDir("$outputDir/genotyping" , includeDir: false , ignore: null , ignoreFile: null , include: ['**/*.{geno,ind,snp,tsv}'] ) | ||
|
|
||
| // Mapping (incl. bam_input flasgstat) | ||
| def stable_content_mapping = getAllFilesFromDir("$outputDir/mapping" , includeDir: false , ignore: null , ignoreFile: null , include: ['**/*.flagstat'] ) | ||
| def stable_name_mapping = getAllFilesFromDir("$outputDir/mapping" , includeDir: false , ignore: null , ignoreFile: null , include: ['**/*.{bam,bai}'] ) | ||
|
|
||
| // Metagenomics | ||
| def stable_content_metagenomics = getAllFilesFromDir("$outputDir/metagenomics" , includeDir: false , ignore: ['**/*.biom', '**/*table.tsv'], ignoreFile: null , include: ['**/*'] ) | ||
| def stable_name_metagenomics = getAllFilesFromDir("$outputDir/metagenomics" , includeDir: false , ignore: null , ignoreFile: null , include: ['**/*.biom', '**/*table.tsv'] ) | ||
|
|
||
| // Preprocessing | ||
| // NOTE: FastQC html appears stable, but I worry it might just include a day timestamp instead of a full timestamp. To keep the expression simpler I removed both from checksum testing. | ||
| def stable_content_preprocessing = getAllFilesFromDir("$outputDir/preprocessing" , includeDir: false , ignore: ['**/*.{zip,log,html}'] , ignoreFile: null , include: ['**/*'] ) | ||
| def stable_name_preprocessing = getAllFilesFromDir("$outputDir/preprocessing" , includeDir: false , ignore: null , ignoreFile: null , include: ['**/*.{zip,log,html}'] ) | ||
|
|
||
| // Read filtering | ||
| def stable_content_readfiltering = getAllFilesFromDir("$outputDir/read_filtering" , includeDir: false , ignore: null , ignoreFile: null , include: ['**/*.flagstat'] ) | ||
| def stable_name_readfiltering = getAllFilesFromDir("$outputDir/read_filtering" , includeDir: false , ignore: null , ignoreFile: null , include: ['**/*.{bam,bai}'] ) | ||
|
|
||
| // MultiQC | ||
| def stable_name_multiqc = getAllFilesFromDir("$outputDir/multiqc" , includeDir: false , ignore: null , ignoreFile: null , include: ['*', '**/*'] ) | ||
|
|
||
| /////////////////////// | ||
| // DEFINE ASSERTIONS // | ||
| /////////////////////// | ||
|
|
||
| assertAll( | ||
| { assert workflow.success }, | ||
| // This checks that there are no missing or additional output files. | ||
| // Also a good starting point to look at all the files in the output folder than need to be checked in subsequent sections. | ||
| { assert snapshot( stable_name_all*.name ).match("all_files") }, | ||
|
|
||
| // Checking changes to contents of each section | ||
| // NOTE: Keep the order of the sections in the alphanumeric order of the output directories. | ||
| // Each section should first check stable_content, stable_name second (if applicable). | ||
| { assert snapshot( stable_content_authentication , stable_name_authentication*.name ).match("authentication") }, | ||
| { assert snapshot( stable_content_damage_manipulation , stable_name_damage_manipulation*.name ).match("damage_manipulation") }, | ||
| { assert snapshot( stable_content_deduplication , stable_name_deduplication*.name ).match("deduplication") }, | ||
| { assert snapshot( stable_content_final_bams , stable_name_final_bams*.name ).match("final_bams") }, | ||
| { assert snapshot( stable_content_genotyping , stable_name_genotyping*.name ).match("genotyping") }, | ||
| // NOTE: The snapshot section for mapping cannot be named 'mapping'. See https://github.com/askimed/nf-test/issues/279 | ||
| { assert snapshot( stable_content_mapping , stable_name_mapping*.name ).match("mapping_output") }, | ||
| { assert snapshot( stable_content_metagenomics , stable_name_metagenomics*.name ).match("metagenomics") }, | ||
| { assert snapshot( stable_content_preprocessing , stable_name_preprocessing*.name ).match("preprocessing") }, | ||
| { assert snapshot( stable_content_readfiltering , stable_name_readfiltering*.name ).match("read_filtering") }, | ||
| { assert snapshot( stable_name_multiqc*.name ).match("multiqc") }, | ||
|
|
||
| // Versions | ||
| { assert new File("$outputDir/pipeline_info/nf_core_eager_software_mqc_versions.yml").exists() }, | ||
|
|
||
| ) | ||
| } | ||
| } | ||
| } | ||
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.