Skip to content

Allow Multiple Data Sources in Custom Metadata Block #287

@Saixel

Description

@Saixel

Background

In the platform, each dataset can have multiple data sources contributing to its final content. Currently, the custom metadata block "Metadata about Data Sources" allows the entry of only a single primary data source. For example, Jen Stowell’s NDVI dataset has five different sources used to derive the final uploaded dataset, but only the primary data source (Landsat satellite) can be entered into the custom metadata block.

Here is the metadata block: https://github.com/IQSS/dataverse.harvard.edu/blob/master/metadatablocks/customCAFEDataSources.tsv

Here is a dataset that uses the block: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/ART54Z

Recent example of making fields multiple:

Issue

The current limitation of the custom metadata block to accept only one primary data source means that it cannot capture all relevant information about the various data sources contributing to a dataset. This reduces the accuracy and completeness of the metadata, which can negatively impact the usage and understanding of the dataset by users.

Feature Request

The ability to add multiple data sources in the custom metadata block "Metadata about Data Sources," with clear distinctions between the different sources (e.g., separated with line breaks, etc.).

Justification

Having the ability to add multiple data sources will increase the accuracy and completeness of the metadata, allowing users to better understand the different contributions to the dataset. This is especially important for complex datasets that derive their data from multiple sources.

Implementation Considerations

  • Evaluate how multiple data source entries can be captured and stored in the metadata block.
  • Ensure that the user interface allows for easy and clear entry of multiple data sources.
  • Implement backend changes to effectively store and retrieve these multiple entries.

Additional Context

Jen Stowell’s NDVI dataset is an example where this functionality is important. Currently, only the Landsat satellite can be entered as the primary source, while there are four other sources that should also be captured in the metadata.

Metadata

Metadata

Assignees

Labels

FY25 Sprint 15FY25 Sprint 15 (2025-01-15 - 2025-01-29)FY25 Sprint 16FY25 Sprint 16 (2025-01-29 - 2025-02-12)FY25 Sprint 17FY25 Sprint 17 (2025-02-12 - 2025-02-26)FY25 Sprint 18FY25 Sprint 18 (2025-02-26 - 2025-03-12)FY25 Sprint 19FY25 Sprint 19 (2025-03-12 - 2025-03-26)FY25 Sprint 20FY25 Sprint 20 (2025-03-26 - 2025-04-09)FY25 Sprint 21FY25 Sprint 21 (2025-04-09 - 2025-04-23)FY25 Sprint 22FY25 Sprint 22 (2025-04-23 - 2025-05-07)NIH CAFEIssues associated with the NIH CAFE projectSize: 10A percentage of a sprint.Type: Feature

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions