COVID-19 Data
Why data matters? why caution is necessary for using data?
Many researchers around the world have dedicated themselves to collect COVID-19 data. In the beginning, little is known about the basic reproduction rate(R0), the case fatality, and even the exact number of deaths. For instance, Japan did not want to count the Diamond Prince cases, France did not want to count the nursing home infections, and China continuously changed the criteria counting infections and deaths. The temptation of agnosticism would be stronger than the painful effort of scientific inquiry.
However, ARIC members believe that data contains more information than we imagine. We decide to collect data from different sources such as UN WHO, Johns Hopkins University, European CDC, Oxford University's policy tracker, etc. We merge these COVID-19 related data with socio-economic variables to help researchers analyze the relationship between COVID-19 with politics, economy, medical resources, or even corruption.
Data is updated every day
Korea Raw Data
Source : Korea Centers for Disease Control & Prevention
1) History
ARIC constructs the dataset based on the press releases of KCDC. ARIC provides additional datasets that reflect statistical corrections by KCDC
2) Variables
(1) Sheet 1 : Cases in Korea
- CONFIRM : Cumulative confirmed cases
- RELEASE : Cumulative number of people released from quarantine
- QUARANT : Number of people quarantined
- DEATH : Cumulative number of people deceased
- TOTAL_TEST : Total tests as the sum of tests in progress, positive and negative tests
- UNDER_TEST : Number of tests in progress
- NEGATIVE : Cumulative number of people with negative test results
(2) Sheet 2 : Number of COVID-19 vaccination by Province
(3) Sheet 3 : Cases in Korea by Province
(4) Sheet 4 : Cases by gender & age group
(5) Sheet 5 : Cases in Korea_corrected
(6) Sheet 6 : Cases in Korea by Province_corrected
(7) Sheet 7 : Cases by gender & age group_corrected
(8) Sheet 8 : Cases in Seoul by district
(9) Sheet 9 : Stats correction by KCDC
3) Notable changes
As of March 2, the Korean government released official statistics as of 12 PM each day.
Korean data includes nursing home cases as well as foreigners detected at the airport.
Seoul Metropolitan Government releases official statistics as of 10 AM each day
Usable Korea Data
(updated : 2022. 05. 17)Source: Korea Centers for Disease Control & Prevention
1) History
ARIC change the above raw data into usable data by adding useful variables.
2) Data : COVID_KOREA
Integrated data for COVID-19
(updated : 2022. 05. 18)1) Data : COVID_WORLD
On May 23, 2021, ARIC has updated socioeconomic variables that were the year before 2019. For example, GDP per capita of 2017 has been updated to GDP per capita of 2019. For more detail, refer to metadata.
2) Description : Source
(1) Novel Coronavirus (COVID-19) Cases Data
ARIC aggregates time series data(confirmed, death and recovered) which publishes by Johns Hopkins Bloomberg School.
- Variable : MVAR1-MVAR12
- Source : Johns Hopkins Bloomberg School of Public Health, HDX
(2) COVID-19 Cases worldwide(up to 14 December 2020)
Data on the geographic distribution of COVID-19 cases worldwide
- Variable : JVAR1-JVAR2
- Source : EU Open Data Portal
(3) Total COVID-19 Tests Performed by Country
"Our data in world" provides not only data on confirmed cases, deceased, and testing but also variables which are potential interests
- Variable : SVAR1-SVAR29
- Source : Our data in world, HDX
(4) Stringency of government response
The OxCGRT collects information on 17 indicators of government responses, scores the stringency of measures, and aggregates the scores into a Stringency Index
- Variable : OVAR1-OVAR39
- Source : Hale, Thomas, Sam Webster, Anna Petherick, Toby Phillips, and Beatriz Kira (2020). Oxford COVID-19 Government Response Tracker, Blavatnik School of Government. Data use policy: Creative Commons Attribution CC BY standard.
(5) WHO COVID 19 Global Data
- Variable : WVAR1-WVAR4
- Source : WHO Coronavirus Disease (COVID-19) Dashboard
(6) Socioeconomic variables
ARIC collects socio-economic variables from a variety of sources
- Variable : BVAR1-BVAR16, BKVAR1-BKVAR35
- Source : Word Bank (BVAR1-BVAR16)
3) Data : COVID_GOVMEA
4) Data : COVID_CITY
(1) COVID-19: Global Cities Dataset
It provides information for daily confirmed and death cases of COVID-19 in global cities around the world until 31 March 2021.
(List of cities: Seoul, New York, London, Paris, Beijing, Tokyo, Jakarta, Metro Manila (NCR), Delhi)- Variable : PVAR1-PVAR3
- Source : Please refer to the metadata.
5) Data : COVID_Vaccine in Asia Countries
(1) COVID-19: Vaccination in Asia Countries Dataset
It provides information for vaccine data and supplies agreements of Asia countries upto 26 April 2021.
- Variable : AVAR1-AVAR35
- Source : Please refer to the metadata.