Ensure compliance against chapter guidelines

jfy133 · jfy133 · commit e4c9aa56e0be · 2025-11-25T16:44:48.000+01:00
diff --git a/introduction-to-ngs-sequencing.qmd b/introduction-to-ngs-sequencing.qmd
@@ -8,7 +8,12 @@ bibliography: assets/references/introduction-to-ngs-sequencing.bib
 Next generation sequencing (NGS) revolutionised biology by providing rapid and cheap access to huge amounts of DNA sequence data. 
 One unexpected benefit of the technology used in Illumina NGS sequencers was that it was also ideal for sequencing ultra-short ancient DNA.
 
-In this chapter, we will go through a brief overview of how DNA is structured, how DNA sequencing works, and how most NGS sequenced DNA sequences are digitally represented.
+In this chapter, we will go through:
+
+- A brief overview of how DNA is structured
+- How DNA sequencing works
+- How most NGS sequenced DNA sequences are digitally represented
+
 Finally we will cover some important considerations of NGS sequencing for ancient metagenomic datasets.
 
 ## Basic Concepts
@@ -66,7 +71,7 @@ This concept is important because it is the basis of how DNA sequencing works. W
 To understand why specifically _NGS_ sequencing revolutionised the field of palaeogenomics, we need to briefly compare the differences between how we get modern and ancient DNA.
 
 To get DNA from 'modern' samples (i.e., living organisms), first a biological tissue or sample is acquired.
-You then typically break down (lyse) the cell membrane and/or walls, to release the molecular contents of the cell [@Danaeifar2022-sk].
+We then typically break down (lyse) the cell membrane and/or walls, to release the molecular contents of the cell [@Danaeifar2022-sk].
 Extraction protocols then use a variety of enzymes or other mechanisms to degrade the other biomolecules in the cell (e.g., proteins, lipids, RNA) so that they do not 'interfere' with the extraction of the DNA itself.
 Finally, the DNA is separated and isolated out from the rest of the now-broken cell contents (purification).
 
@@ -239,10 +244,10 @@ The process as depicted in @fig-intro-ngs-fig-sequencingbysynthesis can be broke
 
 On Illumina sequencers, the number of repetitions (known as cycles) typically happens either 50, 75, or 125 times, depending on the machine and the type of sequencing chemistry kit.
 
-You can see a small fraction of such a flow cell in @fig-intro-ngs-fig-sbsimagecapture.
+We can see a small fraction of such a flow cell in @fig-intro-ngs-fig-sbsimagecapture.
 Each coloured dot corresponds to a cluster of DNA molecules.
 At each cycle (each photo), a new nucleotide is added to the strand, and a laser is fired to excite the fluorophores.
-You can see two different clusters emit different lights, as they are different DNA molecules and thus have different nucleotides at that particular 'cycle' (or position in the sequence) of the 'replication' process.
+We can see two different clusters emit different lights, as they are different DNA molecules and thus have different nucleotides at that particular 'cycle' (or position in the sequence) of the 'replication' process.
 By converting the emitted light to the known corresponding A, C, G, T, at each photo, we can reconstruct the sequence of the DNA molecule.
 
 ![Diagram of sequencing by synthesis. Each frame represents a camera photo taken at the same location on the flow cell. Each coloured dot represents a different cluster of many copies of the same immoblised DNA molecule. By recording the colour change at each location across each frame, the DNA molecule's nucleotide sequence can be reconstructed. Source: EBI Training, [CC BY 4.0](https://creativecommons.org/licenses/by/4.0/) via [EBI Training](https://www.ebi.ac.uk/training/online/courses/functional-genomics-ii-common-technologies-and-data-analysis-methods/next-generation-sequencing/second-generation-sequencing/illumina-sequencing/)](assets/images/chapters/intro-to-ngs/fig-intro-ngs-fig-sbsimagecapture.png){#fig-intro-ngs-fig-sbsimagecapture height=300px}
@@ -387,7 +392,7 @@ The rest of a FASTQ file is simply just a repeated set of these four lines.
 Each line corresponds to an independent DNA cluster - and thus DNA molecule - that was sequenced.
 In the case of Illumina pair-end sequencing, we will normally have two FASTQ files for each sample - and we can match the forward and reverse reading of each strand by the metadata line and a `/1` or `/2` at the end of the ID[^1].
 
-[^1]: You can occasionally see a format called 'interleaved' FASTQ files, where the forward and reverse reads are placed right after one another in the same file, but this is not common practice any more. 
+[^1]: We can occasionally encounter a format called 'interleaved' FASTQ files, where the forward and reverse reads are placed right after one another in the same file, but this is not common practice any more. 
 
 ## Sequencing and considerations for ancient metagenomics