-
Notifications
You must be signed in to change notification settings - Fork 31
Text Mining & Analysis
This page is dedicated to the 'Text mining & Analysis' division of the CoVid 2019-Biohackathon 2020.
Rich analyses shall be done, explanatory visualizations & dashboard shall be made, datasets shall be curated & maintained for future scientific research projects.
Ethical and social science considerations related to dealing with & combating against the CoVid 2019 pandemic
Paul Mooney's elaborative schema,
Breaking down the 'tasks' of COVID-19 Open Research Dataset Challenge (CORD-19): An AI challenge with AI2, CZI, MSR, Georgetown, NIH & The White House to the minute significant details will guide you further.
Twitter data analysis using (https://zenodo.org/record/3735274).
- Identification of symptoms on Twitter users - Quantify how many users are claiming symptoms.
- Identification of potential persons that have recovered - We only know the number of people that recover from hospitals, what about outside of them? Are people talking about this on Twitter?
- Sentiment analysis towards particular regulations such as social distancing measures - How are these measures perceived over time in the Twitter space.
- Characterize the information/misinformation around potential COVID-19 treatments using Twitter data
Since the LitCovid (by NCBI) and the CORD-19 (by Allen Institute for AI) datasets were released, many groups are producing and releasing annotations to the data set. We have setup an environment to collect and integrate those annotation datasets at PubAnnotation, a public repository of literature annotation, and are organizing collaborative annotation to the literature datasets of Covid-19. Production and collection of various annotation datasets is ongoing, and we are aiming at releasing a meaning amount of rich annotations in the end of the hackathon. Contribution with annotation datasets is completely open, and all the contributed annotation datasets will become immediately integrated and accessible, in various ways, including search, visualization, and fine-grained access.
- LivCovid : Home | PubAnnotation
- CORD-19 : Home | PubAnnotation
Join the Slack workspace & head on to to 'text-mining-and-analysis' channel. It shall be fun.
#Participants
- Ali Haider Bangash (coordinator)
- Juan M. Banda - Twitter data analysis
- Thanasis Vergoulis
- Ramya Tekumalla
- Jin-Dong Kim - Collaborative Covid-19 literature annotation @ PubAnnotation