Skip to content
Snippets Groups Projects
Commit c42bb0d6 authored by Chandrasekhar Ramakrishnan's avatar Chandrasekhar Ramakrishnan
Browse files

feat: example for working with covidtracking data

parent 830a5e25
No related branches found
No related tags found
No related merge requests found
%% Cell type:code id: tags:
``` python
import pandas as pd
import altair as alt
```
%% Cell type:markdown id: tags:
# Look at the metadata
%% Cell type:code id: tags:
``` python
metadata_df = pd.read_json('../../data/covidtracking/states-metadata.json')
```
%% Cell type:code id: tags:
``` python
metadata_df.head()
```
%% Output
state dataSite \
0 AK http://dhss.alaska.gov/dph/Epi/id/Pages/COVID-...
1 AL http://www.alabamapublichealth.gov/infectiousd...
2 AR https://www.healthy.arkansas.gov/programs-serv...
3 AZ https://www.azdhs.gov/preparedness/epidemiolog...
4 CA https://www.cdph.ca.gov/Programs/CID/DCDC/Page...
covid19Site twitter \
0 http://dhss.alaska.gov/dph/Epi/id/Pages/COVID-... @Alaska_DHSS
1 http://www.alabamapublichealth.gov/infectiousd... @alpublichealth
2 https://www.healthy.arkansas.gov/programs-serv... @adhpio
3 https://www.azdhs.gov/preparedness/epidemiolog... @azdhs
4 https://www.cdph.ca.gov/Programs/CID/DCDC/Page... @CAPublicHealth
pui pum notes \
0 All data False Unclear if their reported number means "person...
1 No data False Last negative count from 3/16.
2 All data True Pending = "PUIs"
3 All data False Negative = “Ruled Out”. Our total is slightly ...
4 Only positives False Only positives reported regularly. Add deaths ...
name
0 Alaska
1 Alabama
2 Arkansas
3 Arizona
4 California
%% Cell type:markdown id: tags:
# Look at the data
%% Cell type:code id: tags:
``` python
data_df = pd.read_json('../../data/covidtracking/states-daily.json')
data_df['date'] = pd.to_datetime(data_df['date'], format="%Y%m%d")
```
%% Cell type:code id: tags:
``` python
data_df.head()
```
%% Output
date state positive negative pending death total \
0 2020-03-18 AK 6.0 406.0 NaN NaN 412.0
1 2020-03-18 AL 46.0 28.0 NaN 0.0 74.0
2 2020-03-18 AR 33.0 236.0 50.0 NaN 319.0
3 2020-03-18 AZ 28.0 148.0 102.0 0.0 278.0
4 2020-03-18 CA 611.0 7981.0 NaN 13.0 8592.0
dateChecked
0 2020-03-18T20:00:00Z
1 2020-03-18T20:00:00Z
2 2020-03-18T20:00:00Z
3 2020-03-18T20:00:00Z
4 2020-03-18T20:00:00Z
%% Cell type:markdown id: tags:
## How many data points per state?
%% Cell type:code id: tags:
``` python
summary_df = data_df.groupby('state').count()
summary_df.reset_index().sort_values(['date', 'state'], ascending=[False, True])
```
%% Output
state date positive negative pending death total dateChecked
4 AZ 15 15 15 15 6 15 15
5 CA 15 15 15 0 7 15 15
10 FL 15 15 15 14 8 15 15
11 GA 15 15 1 0 6 15 15
16 IL 15 15 15 12 2 15 15
21 MA 15 15 6 0 0 14 15
30 NC 15 15 7 2 3 15 15
33 NH 15 15 15 15 0 15 15
37 NY 15 15 13 4 4 15 15
40 OR 15 15 15 15 1 15 15
44 SC 15 15 15 3 3 15 15
47 TX 15 15 2 0 2 15 15
52 WA 15 15 13 2 8 15 15
53 WI 15 15 15 5 0 15 15
6 CO 14 14 14 4 5 14 14
8 DC 14 14 14 12 0 14 14
22 MD 14 14 14 2 0 14 14
24 MI 14 14 14 12 0 14 14
32 NE 14 14 14 11 0 14 14
34 NJ 14 14 9 12 8 14 14
36 NV 14 14 14 1 3 14 14
38 OH 14 14 14 12 0 14 14
46 TN 14 14 8 0 0 14 14
49 VA 14 14 14 5 4 13 14
0 AK 13 13 13 7 0 13 13
2 AR 13 13 13 13 0 13 13
9 DE 13 13 13 11 0 13 13
14 IA 13 13 13 8 0 13 13
17 IN 13 13 9 0 5 13 13
18 KS 13 13 13 5 5 13 13
19 KY 13 13 13 1 3 13 13
25 MN 13 13 13 0 0 13 13
35 NM 13 13 13 4 0 13 13
41 PA 13 13 8 5 0 12 13
51 VT 13 13 13 0 0 13 13
54 WV 13 12 13 13 4 13 13
1 AL 12 11 9 4 4 12 12
7 CT 12 11 11 1 0 12 12
13 HI 12 12 1 1 0 12 12
15 ID 12 12 12 0 0 12 12
20 LA 12 12 11 0 4 12 12
23 ME 12 12 12 6 0 12 12
26 MO 12 12 7 0 5 12 12
28 MS 12 12 8 0 0 12 12
29 MT 12 12 12 5 0 12 12
31 ND 12 11 12 8 6 12 12
39 OK 12 12 11 11 0 12 12
43 RI 12 12 11 11 0 12 12
45 SD 12 12 12 12 1 12 12
48 UT 12 12 7 0 4 12 12
55 WY 12 12 3 1 0 12 12
3 AS 3 3 0 0 1 3 3
12 GU 3 3 3 0 0 3 3
27 MP 3 3 0 0 1 3 3
42 PR 3 3 3 2 0 3 3
50 VI 3 3 0 0 1 3 3
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment