Skip to content

Create gold standard dataset #7

@ksuderman

Description

@ksuderman

Use spreadsheet provided by @tnabtaf to download copies of papers.

  • Is the paper in PMC? Check CSV index provided by @ksuderman
  • Does metadata (doi.org, crossref.org) contain a download link?
  • Can we scrape a URL for a PDF?

Things to determine:

  • How many formats are we dealing with? Does every publisher use a different DTD/schema?

Metadata

Metadata

Labels

taskSomething that needs doing.

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions