You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: assets/references/authentication.bib
+11-11Lines changed: 11 additions & 11 deletions
Original file line number
Diff line number
Diff line change
@@ -32,17 +32,17 @@ @unpublished{Pochon2022-hj
32
32
language = {en}
33
33
}
34
34
35
-
@article {Zampirolo2023.12.01.569562,
36
-
author = {Zampirolo, Giulia and Holman, Luke E. and Sawafuji, Rikai and Pt{\'a}kov{\'a}, Michaela and Kova{\v c}ikov{\'a}, Lenka and {\v S}{\'\i}da, Petr and Pokorn{\'y}, Petr and Pedersen, Mikkel Winther and Walls, Matthew},
37
-
title = {Early Pastoralism in Central European Forests: Insights from Ancient Environmental Genomics},
38
-
elocation-id = {2023.12.01.569562},
39
-
year = {2023},
40
-
doi = {10.1101/2023.12.01.569562},
41
-
publisher = {Cold Spring Harbor Laboratory},
42
-
abstract = {Central European forests have been shaped by complex human interactions throughout the Holocene, with significant changes following the introduction of domesticated animals in the Neolithic (\~{}7.5 {\textendash} 6.0 kyr BP). However, understanding early pastoral practices and their impact on forests is limited by methods for detecting animal movement across past landscapes. Here we examine ancient sedimentary DNA (sedaDNA) preserved at the Velk{\'y} Mamu{\v t}{\'a}k rock shelter, in northern Bohemia (Czech Republic), which has been a forested enclave since the early Holocene. We find that domesticated animals, their associated microbiomes, and plants potentially gathered for fodder, have clear representation by the Late Neolithic, around 6.0 kyr BP, and persist throughout the Bronze Age into recent times. We identify a change in dominant grazing species from sheep to pigs in the Bronze Age (\~{}4.1 {\textendash} 3.0 kyr BP) and interpret the impact this had in the mid-Holocene retrogressions that still define the structure of Central European forests today. This study highlights the ability of ancient metagenomics to bridge archaeological and paleoecological methods and provide an enhanced perspective on the roots of the Anthropocene.Competing Interest StatementThe authors have declared no competing interest.},
title={Tracing early pastoralism in Central Europe using sedimentary ancient DNA},
37
+
author={Zampirolo, Giulia and Holman, Luke E and Sawafuji, Rikai and Pt{\'a}kov{\'a}, Michaela and Kova{\v{c}}ikov{\'a}, Lenka and {\v{S}}{\'\i}da, Petr and Pokorn{\`y}, Petr and Pedersen, Mikkel Winther and Walls, Matthew},
@@ -475,9 +475,9 @@ For custom reference genomes not covered by NCBI, their accession IDs and the co
475
475
The `ngsLCA` program considers a chosen similarity interval between each read and its reference in the generated bam/sam file.
476
476
The similarity can be set as an edit distance `[-editdist[min/max]]`, i.e., number of mismatches between the read to reference genome, or as a similarity distance `[-simscore[low/high]]`, i.e., percentage of mismatches between the read to reference genome.
477
477
478
-
The main files produced by this command have the extensions `.bdamage.gz` and `lca.gz`.
478
+
The main files produced by this command have the extensions `.bdamage.gz`, `lca.gz` and 'stat.gz'.
479
479
The first consists of a nucleotide misincorporation matrix (also called **mismatch matrix**) which represents the nucleotide substitution counts across the reads (@tbl-authentication-examplecodetable2).
480
-
The lca file reports the sequence analysed and its taxonomic path, as well as other statistics (gc content, fragment length).
480
+
The lca file reports the sequences analysed and their taxonomic paths, while the stat file includes other statistics (gc content, fragment length).
481
481
482
482
We report an example of the bdamage.gz file output printed using the command`metaDMG-cpp print`:
483
483
@@ -564,16 +564,16 @@ While Pitch-6 and Cave-22 samples, which are 6 and 22 thousand years old and thu
564
564
565
565
### Ancient metagenomic dataset
566
566
567
-
In this section, we will use 6 metagenomic libraries downsampled with eukaryotes reads from the study by [@Zampirolo2023.12.01.569562] (@fig-authentication-fig6).
567
+
In this section, we will use 6 metagenomic libraries downsampled with eukaryotes reads from the study by [@zampirolo2024tracing] (@fig-authentication-fig6).
568
568
The libraries originate from sediment samples of the Velký Mamut'ák rock shelter located in Northern Bohemia (Czech Republic) and covering the period between the Late Neolithic (~6100-5300 cal. BP) to more recent times (800 cal BP).
569
569
570
-
![Screenshot of preprint of the source dataset by [@Zampirolo2023.12.01.569562]](assets/images/chapters/authentication/BioxRiv_paper.png){#fig-authentication-fig6}
570
+
![Screenshot of the study of the source dataset by [@zampirolo2024tracing]](assets/images/chapters/authentication/paper.png){#fig-authentication-fig6}
571
571
572
572
### Ancient metagenomics with metaDMG-cpp: the workflow
573
573
574
574
This section will cover the metaDMG analysis which involve taxonomic classification of the reads starting from sorted SAM files, the damage estimation and compilation of the final metaDMG output.
575
575
576
-
To begin, we can find raw SAM files used as input to `metaDMG` we will use for the exercise are stored in `metadmg`.
576
+
To begin, we can find raw SAM files used as input to `metaDMG` we will use for the exercise are stored in the `metadmg` folder.
577
577
578
578
We also need the taxonomy files, which are in the folder `metadmg/small_taxonomy/`, these include `names.dmp`, `nodes.dmp` and `small_accession2taxid.txt.gz`.
579
579
@@ -582,16 +582,10 @@ We also need the taxonomy files, which are in the folder `metadmg/small_taxonomy
582
582
The best documentation is currently found in the –help function.
583
583
:::
584
584
585
-
We need to activate a dedicated environment for `metaDMG` as it is still under development. We candeactivate the current one with
585
+
metaDMG is installed in the conda environment 'authentication`. If not activated yet, we run
586
586
587
587
```bash
588
-
conda deactivate
589
-
```
590
-
591
-
And we will work with metaDMG by activating the environment with the following command.
592
-
593
-
```bash
594
-
conda activate metaDMG
588
+
conda activate authentication
595
589
```
596
590
597
591
:::{.callout-warning}
@@ -701,16 +695,9 @@ Open the TSV file `concatenated_metaDMGfinal.tsv` in a spreadsheet manner and in
701
695
702
696
We will now investigate the TSV table produced by metaDMG to authenticate damage patterns, visualise the relationship between the damage and the significance, and the degree of damage through depth and time.
703
697
704
-
R packages forthis exercise are locatedinour original conda environment `authentication`.
698
+
R packages for this exercise are located in the same conda environment `authentication`.
705
699
706
-
While still in the `authentication/metadmg/` folder, We deactivate the current conda environment and we re-activate the environment `authentication`.
707
-
708
-
```bash
709
-
conda deactivate
710
-
conda activate authentication
711
-
```
712
-
713
-
We load R by running `R`in your terminal
700
+
While still in the `authentication/metadmg/` folder, we load R by running `R` in your terminal
714
701
715
702
```bash
716
703
R
@@ -729,6 +716,96 @@ library(purrr)
729
716
library(ggpubr)
730
717
```
731
718
719
+
### Amplitude of damage vs Significance
720
+
721
+
We provide an R script to investigate the main statistics.
722
+
723
+
Here we visualise the amplitude of damage (A) and its significance (Zfit), for the full dataset but filtering it to a minimum of 100 reads and at the genus level (@fig-authentication-fig7).
724
+
725
+
```{r eval=F}
726
+
#We load our metaDMG output data (TSV file) and the metadata with information on the age of each sample.
{#fig-authentication-fig7}
772
+
773
+
### Amplitude of damage and mean fragment length through time
774
+
775
+
Here we visualise the amplitude of damage (A) and the mean length of the fragments (mean_rlen) by date (BP) for the filtered dataset with a minimum of 100 reads and at the genus level (@fig-authentication-fig8).
{#fig-authentication-fig8}
808
+
732
809
#### Deamination patterns
733
810
734
811
We run the damage plot to visualise the deamination patterns along forward and reverse strands, and we save the results per each taxon detected in the samples.
We load our metaDMG output data (TSV file) and the metadata with information on the age of each sample. We generate the damage plots as seen in @fig-authentication-fagusovisdmg using the function`get-damage`.
927
+
We generate the damage plots as seen in @fig-authentication-fagusovisdmg using the function `get-damage`.
{#fig-authentication-fagusovisdmg}
916
969
917
-
### Amplitude of damage vs Significance
918
-
919
-
We provide an R script to investigate the main statistics.
920
-
921
-
Here we visualise the amplitude of damage (A) and its significance (Zfit), for the full dataset but filtering it to a minimum of 100 reads and at the genus level (@fig-authentication-fig8).
922
-
923
-
```{r eval=F}
924
-
#Subset dataset animal and plants at the genus level
{#fig-authentication-fig8}
945
-
946
-
### Amplitude of damage and mean fragment length through time
947
-
948
-
Here we visualise the amplitude of damage (A) and the mean length of the fragments (mean_rlen) by date (BP) for the filtered dataset with a minimum of 100 reads and at the genus level (@fig-authentication-fig9).
{#fig-authentication-fig9}
981
-
982
970
::: {.callout-tip}
983
971
Once finished examining the plots you can quit R
984
972
```bash
@@ -1038,7 +1026,8 @@ In addition, we:
1038
1026
1039
1027
## Acknowledgments
1040
1028
1041
-
We thank Mikkel Winther Pedersen and Antonio Fernandez Guerra for their contribution to the development of the `metaDMG` section.
1029
+
We thank Mikkel Winther Pedersen and Antonio Fernandez Guerra for their contribution to the development of the `metaDMG` section.
1030
+
G.Z. would like to thank the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (Grant Agreement No. 856488, project SEACHANGE).
0 commit comments