Skip to content

Adjustments for #76 , rename certain options to be more explicit#81

Merged
apeltzer merged 16 commits intodevfrom
rename_bam_flags
Nov 19, 2018
Merged

Adjustments for #76 , rename certain options to be more explicit#81
apeltzer merged 16 commits intodevfrom
rename_bam_flags

Conversation

@apeltzer
Copy link
Copy Markdown
Member

@apeltzer apeltzer commented Nov 5, 2018

This renames a couple of things to different names, to make sure people understand what these options mean.

cf #76

Namely:

bam_keep_mapped_only ==> bam_analyse_mapped_only

bam_filter_reads ==> bam_retain_all_reads

  • This comment contains a description of changes (with reason)
  • If you've fixed a bug or added code that should be tested, add tests!
  • If necessary, also make a PR on the nf-core/eager branch on the nf-core/test-datasets repo
  • Ensure the test suite passes (nextflow run . -profile test,docker).
  • Make sure your code lints (nf-core lint .).
  • Documentation in docs is updated
  • CHANGELOG.md is updated

Learn more about contributing: https://github.com/nf-core/eager/tree/master/.github/CONTRIBUTING.md

Comment thread docs/usage.md Outdated
### `--bam_analyse_mapped_only`

This can be used to only keep mapped reads for downstream analysis. By default turned off, all reads are kept in the BAM file. Unmapped reads are stored both in BAM and FastQ format e.g. for different downstream processing.
This can be used to only keep mapped reads in the BAM file for downstream analysis. By default turned off, all reads are kept in the BAM file. Unmapped reads are stored both in BAM and FastQ format e.g. for different downstream processing.
Copy link
Copy Markdown
Member

@jfy133 jfy133 Nov 5, 2018

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe change following line to: 'By default turned off, where all reads are kept in the bam file.

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done!

@jfy133
Copy link
Copy Markdown
Member

jfy133 commented Nov 5, 2018

Generally looks good, made a minor suggestion for description.

only think that isn't clear to me now is what if you want to discard unmapped reads entirely?

Unless I misunderstand - I currently see only two cases: 1) where mapped and unmapped reads are retained but in separate BAMs and 2) where mapped and unmapped reads are in the same BAM.

I am thinking for cases where people have storage space issues and want to minimise their data to stuff of interest. (This would be another issue, unless I have misunderstood)

@apeltzer
Copy link
Copy Markdown
Member Author

apeltzer commented Nov 6, 2018

I added an option that can remove the unrequired unmapped reads on user request. By default, its turned off, but of course can be turned on by users if they want to :-)

Also added documentation on that @jfy133 let me know what you think .-)

Comment thread docs/usage.md Outdated
### `--bam_analyse_mapped_only`

This can be used to only keep mapped reads for downstream analysis. By default turned off, all reads are kept in the BAM file. Unmapped reads are stored both in BAM and FastQ format e.g. for different downstream processing.
This can be used to only keep mapped reads in the BAM file for downstream analysis. By default turned off, where all reads are kept in the bam file. Unmapped reads are stored both in BAM and FastQ format e.g. for different downstream processing.
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Possibly rephrase again for clarity (my bad):
Use only mapped reads in the BAM file for downstream analysis. Unmapped reads are stored in a separate BAM and FASTQ format e.g. for different downstream processing. By default turned off, where default behaviour is to keep both mapped and unmapped reads in the output BAM file.

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I wonder if it makes sense to instead have a independent flag to save unmapped reads in FASTQ rather than BAM format. In this case we will be make redundancy.

Comment thread docs/usage.md Outdated

This discards all unmapped and extracted reads entirely. By default, this is turned off.

### `--bam_keep_all`
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What is the difference of this flag verses --bam_retain_all_reads?

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

--bam_discard_unmapped_entirely

Removes the unmapped reads file only, the BAM file contains only mapped reads and unmapped reads are entirely discarded (no fastq/bam at all present).

I guess I'll have to give this another proper thought. Right now bam_retain_all_reads just is the thing to turn on BAM filtering in general.

@apeltzer apeltzer merged commit d91226f into dev Nov 19, 2018
@apeltzer apeltzer deleted the rename_bam_flags branch November 19, 2018 14:45
@jfy133 jfy133 mentioned this pull request Nov 20, 2018
@apeltzer apeltzer mentioned this pull request Dec 9, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants