Skip to content

Commit e4c9aa5

Browse files
committed
Ensure compliance against chapter guidelines
1 parent 4206591 commit e4c9aa5

1 file changed

Lines changed: 10 additions & 5 deletions

File tree

introduction-to-ngs-sequencing.qmd

Lines changed: 10 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,12 @@ bibliography: assets/references/introduction-to-ngs-sequencing.bib
88
Next generation sequencing (NGS) revolutionised biology by providing rapid and cheap access to huge amounts of DNA sequence data.
99
One unexpected benefit of the technology used in Illumina NGS sequencers was that it was also ideal for sequencing ultra-short ancient DNA.
1010

11-
In this chapter, we will go through a brief overview of how DNA is structured, how DNA sequencing works, and how most NGS sequenced DNA sequences are digitally represented.
11+
In this chapter, we will go through:
12+
13+
- A brief overview of how DNA is structured
14+
- How DNA sequencing works
15+
- How most NGS sequenced DNA sequences are digitally represented
16+
1217
Finally we will cover some important considerations of NGS sequencing for ancient metagenomic datasets.
1318

1419
## Basic Concepts
@@ -66,7 +71,7 @@ This concept is important because it is the basis of how DNA sequencing works. W
6671
To understand why specifically _NGS_ sequencing revolutionised the field of palaeogenomics, we need to briefly compare the differences between how we get modern and ancient DNA.
6772

6873
To get DNA from 'modern' samples (i.e., living organisms), first a biological tissue or sample is acquired.
69-
You then typically break down (lyse) the cell membrane and/or walls, to release the molecular contents of the cell [@Danaeifar2022-sk].
74+
We then typically break down (lyse) the cell membrane and/or walls, to release the molecular contents of the cell [@Danaeifar2022-sk].
7075
Extraction protocols then use a variety of enzymes or other mechanisms to degrade the other biomolecules in the cell (e.g., proteins, lipids, RNA) so that they do not 'interfere' with the extraction of the DNA itself.
7176
Finally, the DNA is separated and isolated out from the rest of the now-broken cell contents (purification).
7277

@@ -239,10 +244,10 @@ The process as depicted in @fig-intro-ngs-fig-sequencingbysynthesis can be broke
239244

240245
On Illumina sequencers, the number of repetitions (known as cycles) typically happens either 50, 75, or 125 times, depending on the machine and the type of sequencing chemistry kit.
241246

242-
You can see a small fraction of such a flow cell in @fig-intro-ngs-fig-sbsimagecapture.
247+
We can see a small fraction of such a flow cell in @fig-intro-ngs-fig-sbsimagecapture.
243248
Each coloured dot corresponds to a cluster of DNA molecules.
244249
At each cycle (each photo), a new nucleotide is added to the strand, and a laser is fired to excite the fluorophores.
245-
You can see two different clusters emit different lights, as they are different DNA molecules and thus have different nucleotides at that particular 'cycle' (or position in the sequence) of the 'replication' process.
250+
We can see two different clusters emit different lights, as they are different DNA molecules and thus have different nucleotides at that particular 'cycle' (or position in the sequence) of the 'replication' process.
246251
By converting the emitted light to the known corresponding A, C, G, T, at each photo, we can reconstruct the sequence of the DNA molecule.
247252

248253
![Diagram of sequencing by synthesis. Each frame represents a camera photo taken at the same location on the flow cell. Each coloured dot represents a different cluster of many copies of the same immoblised DNA molecule. By recording the colour change at each location across each frame, the DNA molecule's nucleotide sequence can be reconstructed. Source: EBI Training, [CC BY 4.0](https://creativecommons.org/licenses/by/4.0/) via [EBI Training](https://www.ebi.ac.uk/training/online/courses/functional-genomics-ii-common-technologies-and-data-analysis-methods/next-generation-sequencing/second-generation-sequencing/illumina-sequencing/)](assets/images/chapters/intro-to-ngs/fig-intro-ngs-fig-sbsimagecapture.png){#fig-intro-ngs-fig-sbsimagecapture height=300px}
@@ -387,7 +392,7 @@ The rest of a FASTQ file is simply just a repeated set of these four lines.
387392
Each line corresponds to an independent DNA cluster - and thus DNA molecule - that was sequenced.
388393
In the case of Illumina pair-end sequencing, we will normally have two FASTQ files for each sample - and we can match the forward and reverse reading of each strand by the metadata line and a `/1` or `/2` at the end of the ID[^1].
389394

390-
[^1]: You can occasionally see a format called 'interleaved' FASTQ files, where the forward and reverse reads are placed right after one another in the same file, but this is not common practice any more.
395+
[^1]: We can occasionally encounter a format called 'interleaved' FASTQ files, where the forward and reverse reads are placed right after one another in the same file, but this is not common practice any more.
391396

392397
## Sequencing and considerations for ancient metagenomics
393398

0 commit comments

Comments
 (0)