COVID-19 NLP Resources | NLP COVID-19 Workshop
There are many relevant resources for NLP of COVID-19 under development. We list several of them here.
Literature Collections
- The National Library of Medicine (US NIH) LitCovid collection:
https://www.ncbi.nlm.nih.gov/research/coronavirus/ - The Elsevier COVID-19 collection:
https://www.elsevier.com/connect/coronavirus-information-center
CORD-19 Literature Collection and Kaggle Task
- Kaggle CORD-19 literature dataset:
https://www.kaggle.com/allen-institute-for-ai/CORD-19-research-challenge - TREC-COVID information retrieval challenge:
https://ir.nist.gov/covidSubmit/ - OHSU information retrieval topics:
https://dmice.ohsu.edu/hersh/COVIDSearch.html - SketchEngine analysis of CORD-19 dataset:
https://www.sketchengine.eu/covid19/ - PubAnnotation annotations over CORD-19:
http://pubannotation.org/collections/CORD-19
COVID-19 Twitter collections
- Panacea Lab:
http://www.panacealab.org/covid19/ - Public Coronavirus Twitter Dataset (Emily Chen, Kristina Lerman, Emilio Ferrara):
https://github.com/echen102/COVID-19-TweetIDs - English Twitter Data set (Cassandra Jacobs):
https://github.com/BayesForDays/coronada - IEEE Twitter dataset:
https://ieee-dataport.org/open-access/corona-virus-covid-19-tweets-dataset - A collection of COVID-19 tweets in Italian:
http://twita.di.unito.it/dataset/40wita