COVID-19 Data

Why data matters? why caution is necessary for using data?

Many researchers around the world have dedicated themselves to collect COVID-19 data. In the beginning, little is known about the basic reproduction rate(R0), the case fatality, and even the exact number of deaths. For instance, Japan did not want to count the Diamond Prince cases, France did not want to count the nursing home infections, and China continuously changed the criteria counting infections and deaths. The temptation of agnosticism would be stronger than the painful effort of scientific inquiry. 

However, ARIC members believe that data contains more information than we imagine. We decide to collect data from different sources such as UN WHO, Johns Hopkins University, European CDC, Oxford University's policy tracker, etc. We merge these COVID-19 related data with socio-economic variables to help researchers analyze the relationship between COVID-19 with politics, economy, medical resources, or even corruption.

Data is updated every day

SNU ARIC : South Korea COVID.xlsx

Korea Raw Data


Source : Korea Centers for Disease Control & Prevention

1) History

ARIC constructs the dataset based on the press releases of KCDC. ARIC provides additional datasets that reflect statistical corrections by KCDC

2) Variables 

(1) Sheet 1 : Cases in Korea

  • CONFIRM : Cumulative confirmed cases 
  • RELEASE : Cumulative number of people released from quarantine 
  • QUARANT : Number of people quarantined
  • DEATH : Cumulative number of people deceased 
  • TOTAL_TEST : Total tests as the sum of tests in progress, positive and negative tests 
  • UNDER_TEST : Number of tests in progress
  • NEGATIVE : Cumulative number of people with negative test results 

(2) Sheet 2 : Number of COVID-19 vaccination by Province 

(3) Sheet 3 : Cases in Korea by Province

(4) Sheet 4 : Cases by gender & age group

(5) Sheet 5 : Cases in Korea_corrected

(6) Sheet 6 : Cases in Korea by Province_corrected

(7) Sheet 7 : Cases by gender & age group_corrected

(8) Sheet 8 : Cases in Seoul by district 

(9) Sheet 9 : Stats correction by KCDC 

3) Notable changes

As of March 2, the Korean government released official statistics as of 12 PM each day. 

Korean data includes nursing home cases as well as foreigners detected at the airport.

Seoul Metropolitan Government releases official statistics as of 10 AM each day

Usable Korea Data

(updated : 2022. 05. 17)

Source: Korea Centers for Disease Control & Prevention

1) History

ARIC change the above raw data into usable data by adding useful variables.

2) Data : COVID_KOREA

a. Metadata

SNU ARIC : Metadata of Usable Korea Data

b. 미리보기

covid_korea_n.csv

c. 다운로드

Integrated data for COVID-19

(updated : 2022. 05. 18)

1) Data : COVID_WORLD

On May 23, 2021, ARIC has updated socioeconomic variables that were the year before 2019. For example, GDP per capita of 2017 has been updated to GDP per capita of 2019. For more detail, refer to metadata. 

a. Metadata

SNU ARIC : Metadata of Integrated data

2) Description : Source

(1) Novel Coronavirus (COVID-19) Cases Data 



(2) COVID-19 Cases worldwide(up to 14 December 2020)

  • Variable : JVAR1-JVAR2
  • Source : EU Open Data Portal

(3) Total COVID-19 Tests Performed by Country 

  • Variable : SVAR1-SVAR29
  • Source : Our data in world, HDX


(4) Stringency of government response


(5) WHO COVID 19 Global Data


(6) Socioeconomic variables

  • Variable : BVAR1-BVAR16, BKVAR1-BKVAR35
  • Source : Word Bank (BVAR1-BVAR16)
INSCR (BKVAR1-BKVAR3), Varieties of Democracy (BKVAR4-BKVAR8), TRANSPARENCY INTERNATIONAL (BKVAR9), FRASER INSTITUTE (BKVAR10-BKVAR15), World Justice Project(BKVAR16-BKVAR24), Human Development Index(BKVAR25- BKVAR26),  World Governance Indicators(BKVAR27-BKVAR32), Freedom House Index(BKVAR33-BKVAR34), Penn World Table(BKVAR35)

3) Data : COVID_GOVMEA

a. Metadata

SNU ARIC : Metadata of ACAPS

(1) ACAPS COVID-19: Government Measures Dataset 

  • Variable : LVAR1-LVAR5
  • Source : ACAPS, HD

4) Data : COVID_CITY

a. Metadata

COVID-19 Cities

(1) COVID-19: Global Cities Dataset 

  • Variable : PVAR1-PVAR3
  • Source : Please refer to the metadata. 

5) Data : COVID_Vaccine in Asia Countries

a. Metadata

COVID-19 Vaccination

(1) COVID-19: Vaccination in Asia Countries Dataset 

  • Variable : AVAR1-AVAR35
  • Source : Please refer to the metadata.