Enable gzipped FastA input as reference genome by apeltzer · Pull Request #111 · nf-core/eager

apeltzer · 2018-12-15T21:45:23Z

Adds support for gzipped FastA reference genome input.

PR checklist

This comment contains a description of changes (with reason)
If you've fixed a bug or added code that should be tested, add tests!
Ensure the test suite passes (nextflow run . -profile test,docker).
Make sure your code lints (nf-core lint .).
Documentation in docs is updated
CHANGELOG.md is updated

apeltzer · 2018-12-15T21:45:44Z

Should add support following on #91

apeltzer · 2018-12-15T22:57:33Z

@jfy133 please review this, then you may request changes and/or merge it :-)

apeltzer · 2018-12-15T22:57:53Z

Would like to stick to review / merge pattern from now on to keep things protected here :-)

jfy133

Only potential issue I see is, if we are assuming one wants to have compressed FASTAs in the first place, when the --saveReference flag is used - we would want to re-compress the saved FASTA once the reference files not needed anymore in the pipeline.

This would then save disk space when that particular file is not being used - which I guess was my motivation for that feature request.

apeltzer · 2018-12-17T12:10:12Z

Hm, I don#t get what you mean with this:

Input FastA.gz (solved by this already)
Index creation
Usage in the pipeline
... ?

Zipping the index doesn't make that much sense, as we'd have to uncompress everytime we use the index again before running something in a pipeline (which is too much overhead ...) .

Or do you mean we should save the indexed reference genome as compressed zip archive as well?

jfy133 · 2018-12-17T12:30:40Z

Hm, I don#t get what you mean with this:

Input FastA.gz (solved by this already)

Index creation

Usage in the pipeline

... ?

Zipping the index doesn't make that much sense, as we'd have to uncompress everytime we use the index again before running something in a pipeline (which is too much overhead ...) .

Or do you mean we should save the indexed reference genome as compressed zip archive as well?

The latter. But this is still a rare case I imagine.

Just accepting a gzipped reference the first time the reference is used would be a sufficient purpose of this functionality as implmented here (e.g. genomes downloaded from NCBI are gzipped).

Maybe keep this commit as it is for the moment. If someone else requests a recompressed indexed FASTA we can consider that later.

apeltzer added 4 commits December 15, 2018 22:14

Add Changelog

e4a6b9c

Make gzipped input great again!

67e10ad

Add proper testcase for zipped FastA input

a8257c1

Document me :-)

47ad6f4

Handle unzipping more logically

0d2241d

apeltzer requested a review from jfy133 December 15, 2018 22:13

jfy133 requested changes Dec 17, 2018

View reviewed changes

Merge branch 'dev' into zip_fasta

16d4412

jfy133 approved these changes Dec 17, 2018

View reviewed changes

apeltzer merged commit 2a7e70e into nf-core:dev Dec 17, 2018

apeltzer deleted the zip_fasta branch December 17, 2018 12:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable gzipped FastA input as reference genome#111

Enable gzipped FastA input as reference genome#111
apeltzer merged 6 commits intonf-core:devfrom
apeltzer:zip_fasta

apeltzer commented Dec 15, 2018

Uh oh!

apeltzer commented Dec 15, 2018

Uh oh!

apeltzer commented Dec 15, 2018

Uh oh!

apeltzer commented Dec 15, 2018

Uh oh!

jfy133 left a comment •

edited

Loading

Uh oh!

apeltzer commented Dec 17, 2018

Uh oh!

jfy133 commented Dec 17, 2018 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

apeltzer commented Dec 15, 2018

PR checklist

Uh oh!

apeltzer commented Dec 15, 2018

Uh oh!

apeltzer commented Dec 15, 2018

Uh oh!

apeltzer commented Dec 15, 2018

Uh oh!

jfy133 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

apeltzer commented Dec 17, 2018

Uh oh!

jfy133 commented Dec 17, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jfy133 left a comment •

edited

Loading

jfy133 commented Dec 17, 2018 •

edited

Loading