-
-
Notifications
You must be signed in to change notification settings - Fork 1.9k
Remove fake subjects from Works #2107
Copy link
Copy link
Closed
Labels
Affects: DataIssues that affect book/author metadata or user/account data. [managed]Issues that affect book/author metadata or user/account data. [managed]Lead: @horncIssues overseen by Charles (Staff: Data Engineering Lead) [managed]Issues overseen by Charles (Staff: Data Engineering Lead) [managed]Priority: 3Issues that we can consider at our leisure. [managed]Issues that we can consider at our leisure. [managed]State: Work In ProgressThis issue is being actively worked on. [managed]This issue is being actively worked on. [managed]Type: Refactor/Clean-upIssues related to reorganization/clean-up of data or code (e.g. for maintainability). [managed]Issues related to reorganization/clean-up of data or code (e.g. for maintainability). [managed]
Metadata
Metadata
Assignees
Labels
Affects: DataIssues that affect book/author metadata or user/account data. [managed]Issues that affect book/author metadata or user/account data. [managed]Lead: @horncIssues overseen by Charles (Staff: Data Engineering Lead) [managed]Issues overseen by Charles (Staff: Data Engineering Lead) [managed]Priority: 3Issues that we can consider at our leisure. [managed]Issues that we can consider at our leisure. [managed]State: Work In ProgressThis issue is being actively worked on. [managed]This issue is being actively worked on. [managed]Type: Refactor/Clean-upIssues related to reorganization/clean-up of data or code (e.g. for maintainability). [managed]Issues related to reorganization/clean-up of data or code (e.g. for maintainability). [managed]
Description
Three of the top five "subjects" are not subjects at all:
Accessible book- 2.5 millionProtected DAISY- 1.2 millionIn library- 0.5 millionThe are also some lower frequency noise terms like
Lending libraryandInternet Archive Wishlistbut the three above represent the bulk of the noise.Expectation
The subject list should contain things which are actually subjects of the work.
Proposal & Constraints
Remove the three subjects above. If they're needed to provide functionality move them to a hidden portion of the Solr index where they don't pollute the UI.