Need authorization agreement

COVID-19 Open Research Dataset (CORD-19)

A Free, Open Resource for the Global Research Community

In response to the COVID-19 pandemic, the Allen Institute for AI has partnered with leading research groups to prepare and distribute the COVID-19 Open Research Dataset (CORD-19), a free resource of over 29,000 scholarly articles, including over 13,000 with full text, about COVID-19 and the coronavirus family of viruses for use by the global research community.

This dataset is intended to mobilize researchers to apply recent advances in natural language processing to generate new insights in support of the fight against this infectious disease. The corpus will be updated weekly as new research is published in peer-reviewed publications and archival services like bioRxiv, medRxiv, and others.

  • Commercial use subset (includes PMC content) -- 9000 papers, 186Mb
  • Non-commercial use subset (includes PMC content) -- 1973 papers, 36Mb
  • PMC custom license subset -- 1426 papers, 19Mb
  • bioRxiv/medRxiv subset (pre-prints that are not peer reviewed) -- 803 papers, 13Mb
  • Metadata file -- 47Mb
  • Readme

Data and Resources

Additional Info

Field Value
Last Updated October 11, 2020, 07:15 (CST)
Created March 19, 2020, 10:19 (CST)


  • 臺北市政府工務局代辦工程

    Payment instrument Free
    Update frequency Irregular
  • 僑務委員會製作影音數量統計

    Payment instrument Free
    Update frequency Irregular
  • 甲仙攔河堰前日引水量

    Payment instrument Free
    Update frequency Irregular
  • 2001-2019年腸病毒分離前五大基因

    Payment instrument Free
    Update frequency Irregular
  • 國立臺灣大學醫療影像:冠狀動脈FFR

    Payment instrument Free
    Update frequency Irregular
    1.影像內容說明:侵入式的生理數值,可以了解該血管的壓力狀況進而推估血流狀 況。亦可分析狹窄病灶對於血流力學的影響,進一步推估缺氧狀況決定是否要使用心導管治療。 2.資料特色/用途或價值說明:當遇到形態學上(CAG +/- IVUS or...