Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
covid-19-public-data
Manage
Activity
Members
Labels
Plan
Issues
2
Issue boards
Milestones
Wiki
Code
Merge requests
0
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Snippets
Build
Pipelines
Jobs
Pipeline schedules
Artifacts
Deploy
Releases
Container Registry
Model registry
Operate
Environments
Monitor
Incidents
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Model experiments
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
COVID-19
covid-19-public-data
Commits
0119c679
Commit
0119c679
authored
4 years ago
by
Chandrasekhar Ramakrishnan
Browse files
Options
Downloads
Patches
Plain Diff
feat: process notebook to download distancing data
parent
88545a4d
No related branches found
Branches containing commit
No related tags found
1 merge request
!160
distancing
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
notebooks/process/download-distancing-data.ipynb
+184
-0
184 additions, 0 deletions
notebooks/process/download-distancing-data.ipynb
with
184 additions
and
0 deletions
notebooks/process/download-distancing-data.ipynb
0 → 100644
+
184
−
0
View file @
0119c679
{
"cells": [
{
"cell_type": "code",
"execution_count": null,
"metadata": {},
"outputs": [],
"source": [
"import requests\n",
"import os\n",
"from io import BytesIO\n",
"import pandas as pd"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"tags": [
"parameters"
]
},
"outputs": [],
"source": [
"out_folder = \"../data/distancing-metrics/\"\n",
"PAPERMILL_OUTPUT_PATH = None"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"# Download Apple Mobility\n",
"\n",
"Download the Apple data. The URL for this changes every day and needs to be updated. You can get the \n",
"current URL from https://www.apple.com/covid19/mobility by looking at the link of the *All Data CSV* button."
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {},
"outputs": [],
"source": [
"url = 'https://covid19-static.cdn-apple.com/covid19-mobility-data/2007HotfixDev55/v2/en-us/applemobilitytrends-2020-05-10.csv'\n",
"r = requests.get(url, allow_redirects=True)\n",
"apple_csv = r.content"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {},
"outputs": [],
"source": [
"# save the result\n",
"if PAPERMILL_OUTPUT_PATH:\n",
" out_path = os.path.join(out_folder, 'mobility/apple/applemobilitytrends.csv')\n",
" with open(out_path, 'wb') as f:\n",
" f.write(apple_csv)"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {},
"outputs": [],
"source": [
"apple_df = pd.read_csv(BytesIO(apple_csv))\n",
"print(len(apple_df), \"rows of data\")\n",
"apple_df.head(2)"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"# Download Google Mobility\n",
"\n",
"Download the Google data."
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {},
"outputs": [],
"source": [
"url = 'https://www.gstatic.com/covid19/mobility/Global_Mobility_Report.csv'\n",
"r = requests.get(url, allow_redirects=True)\n",
"google_csv = r.content"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {},
"outputs": [],
"source": [
"# save the result\n",
"if PAPERMILL_OUTPUT_PATH:\n",
" out_path = os.path.join(out_folder, 'mobility/google/Global_Mobility_Report.csv')\n",
" with open(out_path, 'wb') as f:\n",
" f.write(google_csv)"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {},
"outputs": [],
"source": [
"google_df = pd.read_csv(BytesIO(google_csv))\n",
"print(len(google_df), \"rows of data\")\n",
"google_df.head(2)"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"# Download BU US State Measures\n",
"\n",
"Download the BU US State Measures spreadsheet."
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {},
"outputs": [],
"source": [
"url = 'https://docs.google.com/spreadsheets/d/1zu9qEWI8PsOI_i8nI_S29HDGHlIp2lfVMsGxpQ5tvAQ/gviz/tq?tqx=out:csv'\n",
"r = requests.get(url, allow_redirects=True)\n",
"bu_csv = r.content"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {},
"outputs": [],
"source": [
"# save the result\n",
"if PAPERMILL_OUTPUT_PATH:\n",
" out_path = os.path.join(out_folder, 'measures-us/bu-edu.csv')\n",
" with open(out_path, 'wb') as f:\n",
" f.write(bu_csv)"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {},
"outputs": [],
"source": [
"bu_df = pd.read_csv(BytesIO(bu_csv))\n",
"print(len(bu_df), \"rows of data\")\n",
"bu_df.head(2)"
]
}
],
"metadata": {
"kernelspec": {
"display_name": "Python 3",
"language": "python",
"name": "python3"
},
"language_info": {
"codemirror_mode": {
"name": "ipython",
"version": 3
},
"file_extension": ".py",
"mimetype": "text/x-python",
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3",
"version": "3.7.3"
}
},
"nbformat": 4,
"nbformat_minor": 4
}
%% Cell type:code id: tags:
```
python
import
requests
import
os
from
io
import
BytesIO
import
pandas
as
pd
```
%% Cell type:code id: tags:parameters
```
python
out_folder
=
"
../data/distancing-metrics/
"
PAPERMILL_OUTPUT_PATH
=
None
```
%% Cell type:markdown id: tags:
# Download Apple Mobility
Download the Apple data. The URL for this changes every day and needs to be updated. You can get the
current URL from https://www.apple.com/covid19/mobility by looking at the link of the
*All Data CSV*
button.
%% Cell type:code id: tags:
```
python
url
=
'
https://covid19-static.cdn-apple.com/covid19-mobility-data/2007HotfixDev55/v2/en-us/applemobilitytrends-2020-05-10.csv
'
r
=
requests
.
get
(
url
,
allow_redirects
=
True
)
apple_csv
=
r
.
content
```
%% Cell type:code id: tags:
```
python
# save the result
if
PAPERMILL_OUTPUT_PATH
:
out_path
=
os
.
path
.
join
(
out_folder
,
'
mobility/apple/applemobilitytrends.csv
'
)
with
open
(
out_path
,
'
wb
'
)
as
f
:
f
.
write
(
apple_csv
)
```
%% Cell type:code id: tags:
```
python
apple_df
=
pd
.
read_csv
(
BytesIO
(
apple_csv
))
print
(
len
(
apple_df
),
"
rows of data
"
)
apple_df
.
head
(
2
)
```
%% Cell type:markdown id: tags:
# Download Google Mobility
Download the Google data.
%% Cell type:code id: tags:
```
python
url
=
'
https://www.gstatic.com/covid19/mobility/Global_Mobility_Report.csv
'
r
=
requests
.
get
(
url
,
allow_redirects
=
True
)
google_csv
=
r
.
content
```
%% Cell type:code id: tags:
```
python
# save the result
if
PAPERMILL_OUTPUT_PATH
:
out_path
=
os
.
path
.
join
(
out_folder
,
'
mobility/google/Global_Mobility_Report.csv
'
)
with
open
(
out_path
,
'
wb
'
)
as
f
:
f
.
write
(
google_csv
)
```
%% Cell type:code id: tags:
```
python
google_df
=
pd
.
read_csv
(
BytesIO
(
google_csv
))
print
(
len
(
google_df
),
"
rows of data
"
)
google_df
.
head
(
2
)
```
%% Cell type:markdown id: tags:
# Download BU US State Measures
Download the BU US State Measures spreadsheet.
%% Cell type:code id: tags:
```
python
url
=
'
https://docs.google.com/spreadsheets/d/1zu9qEWI8PsOI_i8nI_S29HDGHlIp2lfVMsGxpQ5tvAQ/gviz/tq?tqx=out:csv
'
r
=
requests
.
get
(
url
,
allow_redirects
=
True
)
bu_csv
=
r
.
content
```
%% Cell type:code id: tags:
```
python
# save the result
if
PAPERMILL_OUTPUT_PATH
:
out_path
=
os
.
path
.
join
(
out_folder
,
'
measures-us/bu-edu.csv
'
)
with
open
(
out_path
,
'
wb
'
)
as
f
:
f
.
write
(
bu_csv
)
```
%% Cell type:code id: tags:
```
python
bu_df
=
pd
.
read_csv
(
BytesIO
(
bu_csv
))
print
(
len
(
bu_df
),
"
rows of data
"
)
bu_df
.
head
(
2
)
```
This diff is collapsed.
Click to expand it.
Preview
0%
Loading
Try again
or
attach a new file
.
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Save comment
Cancel
Please
register
or
sign in
to comment