Skip to content
Mark Wilkinson edited this page Mar 25, 2020 · 30 revisions

Fair Data

As an initial remark about data sharing: SARS-CoV-2 genomes are sequenced by a variety of different institutions, who submit their results to GISAID.org. From there, these data are only accessible after making a user account and then clicking through the UI to get the record you want. Simply fetching all genomes (it's only a few hundred, and they're 30k bases each so it's not a huge set) is currently not possible at all, let alone via an API.

Communication

Coming.

Participants

  • Mark Wilkinson
  • Stian Soiland-Reyes
  • Philippe Rocca-Serra
  • Susanna-Assunta Sansone
  • Lynn Schriml

Ideas

Repackage SARS-CoV-2 sequences

FAIRify (add metadata, identifiers, etc) reproducible research

For instance describe/package as an RO-Crate:

Clone this wiki locally