Skip to content
This repository was archived by the owner on Jan 27, 2020. It is now read-only.

Commit d4b3a1d

Browse files
committed
update docs [skip ci]
1 parent 6ce9d62 commit d4b3a1d

4 files changed

Lines changed: 29 additions & 15 deletions

File tree

CHANGELOG.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -12,7 +12,7 @@ and this project adheres to [Semantic Versioning](http://semver.org/spec/v2.0.0.
1212
- [#671](https://github.com/SciLifeLab/Sarek/pull/671) - New `publishDirMode` param and docs
1313
- [#673](https://github.com/SciLifeLab/Sarek/pull/673), [#675](https://github.com/SciLifeLab/Sarek/pull/675), [#676](https://github.com/SciLifeLab/Sarek/pull/676) - Profiles for BinAC and CFC clusters in Tübingen
1414
- [#679](https://github.com/SciLifeLab/Sarek/pull/679) - Add container for `CreateIntervalBeds`
15-
- [#692](https://github.com/SciLifeLab/Sarek/pull/692) - Add AWS iGenomes possibilities (currently under `iGRCh37` and `iGRCh38`)
15+
- [#692](https://github.com/SciLifeLab/Sarek/pull/692), [#697](https://github.com/SciLifeLab/Sarek/pull/697) - Add AWS iGenomes possibilities (within `conf/igenomes.conf`)
1616
- [#694](https://github.com/SciLifeLab/Sarek/pull/694) - Add monochrome and grey logos for light or dark background
1717

1818
### `Changed`

conf/genomes.config

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -4,10 +4,12 @@
44
* -------------------------------------------------
55
* Path to reference files
66
* -------------------------------------------------
7-
* Imported under all Nextflow profiles in
7+
* Imported under Nextflow profiles in
88
* nextflow.config
99
* -------------------------------------------------
10-
* Modify to add specific versions of genomes
10+
* Defines reference genomes, using paths
11+
* Can be used by any config that customises the base
12+
* path using $params.genome_base / --genome_base
1113
* -------------------------------------------------
1214
*/
1315

conf/igenomes.config

Lines changed: 5 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -2,12 +2,14 @@
22
* -------------------------------------------------
33
* Nextflow config file for Sarek
44
* -------------------------------------------------
5-
* Path to reference files
5+
* Path to iGenomes reference files
66
* -------------------------------------------------
7-
* Imported under all Nextflow profiles in
7+
* Imported under Nextflow profiles in
88
* nextflow.config
99
* -------------------------------------------------
10-
* Modify to add specific versions of genomes
10+
* Defines reference genomes, using iGenome paths
11+
* Can be used by any config that customises the base
12+
* path using $params.genome_base / --genome_base
1113
* -------------------------------------------------
1214
*/
1315

docs/REFERENCES.md

Lines changed: 19 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -1,11 +1,13 @@
11
# Genomes and reference files
22

3-
Sarek currently uses GRCh38 by default. The settings are in `genomes.config`, they can be tailored to your needs.
3+
Sarek currently uses GRCh38 by default.
4+
The settings are in `genomes.config`, they can be tailored to your needs.
45
The [`buildReferences.nf`](#buildreferencesnf) script is used to build the indexes for the reference test.
56

67
## GRCh37
78

8-
Use `--genome GRCh37` to map against GRCh37. Before doing so and if you are not on UPPMAX, you need to adjust the settings in `genomes.config` to your needs.
9+
Use `--genome GRCh37` to map against GRCh37.
10+
Before doing so and if you are not on UPPMAX, you need to adjust the settings in `genomes.config` to your needs.
911

1012
### GATK bundle
1113

@@ -20,21 +22,27 @@ The following files need to be downloaded:
2022

2123
### Other files for GRCh37
2224

23-
From our repo, get the [`intervals` list file](https://raw.githubusercontent.com/SciLifeLab/Sarek/master/repeats/wgs_calling_regions.grch37.list). More information about this file in the [intervals documentation](INTERVALS.md)
25+
From our repo, get the [`intervals` list file](https://raw.githubusercontent.com/SciLifeLab/Sarek/master/repeats/wgs_calling_regions.grch37.list).
26+
More information about this file in the [intervals documentation](INTERVALS.md)
2427

2528
Description of how to generate the Loci file used in the ASCAT process is described [here](https://github.com/SciLifeLab/Sarek/blob/master/docs/ASCAT.md).
2629

2730
You can create your own cosmic reference for any human reference as specified below in the Cosmic section.
2831

2932
## GRCh38
3033

31-
Use `--genome GRCh38` to map against GRCh38. Before doing so and if you are not on UPPMAX, you need to adjust the settings in `genomes.config` to your needs.
34+
Use `--genome GRCh38` to map against GRCh38.
35+
Before doing so and if you are not on UPPMAX, you need to adjust the settings in `genomes.config` to your needs.
3236

33-
To get the needed files, download the GATK bundle for GRCh38 from [ftp://gsapubftp-anonymous@ftp.broadinstitute.org/bundle/hg38/](ftp://gsapubftp-anonymous@ftp.broadinstitute.org/bundle/hg38/). You can also download the required files from the Google Cloud mirror link [here](https://console.cloud.google.com/storage/browser/genomics-public-data/resources/broad/hg38/v0).
37+
To get the needed files, download the GATK bundle for GRCh38 from [ftp://gsapubftp-anonymous@ftp.broadinstitute.org/bundle/hg38/](ftp://gsapubftp-anonymous@ftp.broadinstitute.org/bundle/hg38/).
38+
You can also download the required files from the Google Cloud mirror link [here](https://console.cloud.google.com/storage/browser/genomics-public-data/resources/broad/hg38/v0).
3439

3540
The MD5SUM of `Homo_sapiens_assembly38.fasta` included in that file is 7ff134953dcca8c8997453bbb80b6b5e.
3641

37-
If you download the data from the FTP servers `beta/` directory, which seems to be an older version of the bundle, only `Homo_sapiens_assembly38.known_indels.vcf` is needed. Also, you can omit `dbsnp_138_` and `dbsnp_144` files as we use `dbsnp_146`. The old ones also use the wrong chromosome naming convention. The Google Cloud mirror has all data in the `v0` directory, but requires you to remove the `resources_broad_hg38_v0_` prefixes from all files.
42+
If you download the data from the FTP servers `beta/` directory, which seems to be an older version of the bundle, only `Homo_sapiens_assembly38.known_indels.vcf` is needed.
43+
Also, you can omit `dbsnp_138_` and `dbsnp_144` files as we use `dbsnp_146`.
44+
The old ones also use the wrong chromosome naming convention.
45+
The Google Cloud mirror has all data in the `v0` directory, but requires you to remove the `resources_broad_hg38_v0_` prefixes from all files.
3846

3947
The following files need to be downloaded:
4048

@@ -68,7 +76,8 @@ You can create your own cosmic reference for any human reference as specified be
6876
To annotate with COSMIC variants during MuTect1/2 Variant Calling you need to create a compatible VCF file.
6977
Download the coding and non-coding VCF files from [COSMIC](http://cancer.sanger.ac.uk/cosmic/download) and
7078
process them with the [Create\_Cosmic.sh](https://github.com/SciLifeLab/Sarek/tree/master/scripts/Create_Cosmic.sh)
71-
script for either GRCh37 or GRCh38. The script requires a fasta index `.fai`, of the reference file you are using.
79+
script for either GRCh37 or GRCh38.
80+
The script requires a fasta index `.fai`, of the reference file you are using.
7281

7382
Example:
7483

@@ -87,11 +96,12 @@ igvtools index <cosmicvxx.vcf>
8796

8897
## smallGRCh37
8998

90-
Use `--genome smallGRCh37` to map against a small reference genome based on GRCh37. `smallGRCh37` is the default genome for the testing profile (`-profile testing`).
99+
Use `--genome smallGRCh37` to map against a small reference genome based on GRCh37.
100+
`smallGRCh37` is the default genome for the testing profile (`-profile testing`).
91101

92102
## AWS iGenomes
93103
Sarek is using [AWS iGenomes](https://ewels.github.io/AWS-iGenomes/), which facilitate storing and sharing references.
94-
Both `GRCh37` and `GRCh38` are available with `--genome iGRCh37` or `--genome iGRCh38` respectively, it contains all data previously detailed.
104+
Both `GRCh37` and `GRCh38` are available with `--genome GRCh37` or `--genome GRCh38` respectively with any profile using the `conf/igenomes.config` file (eg.: `awsbatch`), or you can specify it with `-c conf/igenomes.config`, it contains all data previously detailed.
95105

96106
## buildReferences.nf
97107

0 commit comments

Comments
 (0)