Skip to content

Remove fake subjects from Works #2107

@tfmorris

Description

@tfmorris

Description

Three of the top five "subjects" are not subjects at all:

  • Accessible book - 2.5 million
  • Protected DAISY - 1.2 million
  • In library - 0.5 million

The are also some lower frequency noise terms like Lending library and Internet Archive Wishlist but the three above represent the bulk of the noise.

Expectation

The subject list should contain things which are actually subjects of the work.

Proposal & Constraints

Remove the three subjects above. If they're needed to provide functionality move them to a hidden portion of the Solr index where they don't pollute the UI.

Metadata

Metadata

Assignees

Labels

Affects: DataIssues that affect book/author metadata or user/account data. [managed]Lead: @horncIssues overseen by Charles (Staff: Data Engineering Lead) [managed]Priority: 3Issues that we can consider at our leisure. [managed]State: Work In ProgressThis issue is being actively worked on. [managed]Type: Refactor/Clean-upIssues related to reorganization/clean-up of data or code (e.g. for maintainability). [managed]

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions