Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
D
democrasci_preprocWP1
Manage
Activity
Members
Labels
Plan
Issues
Issue boards
Milestones
Wiki
Code
Merge requests
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Snippets
Build
Pipelines
Jobs
Pipeline schedules
Artifacts
Deploy
Releases
Container registry
Model registry
Operate
Environments
Monitor
Incidents
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Model experiments
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
Marta Balode
democrasci_preprocWP1
Repository graph
Repository graph
You can move around the graph by using the arrow keys.
9bf1f3286c7a034fc0333235f14ed8846d9a8045
Select Git revision
Branches
14
ClassTextbox
add-language-to-06
annotate-xml-1981etc
backbutton_in_labeling
bill_segmentation
debug_tarreading
diff_secs
doc2vec
extractMetaData
extract_text_summary
master
default
protected
provide-complete-file-paths
renku/autosave/luis.salamanca/master/5b70b3a/5b70b3a
webApp_labelling_backend
14 results
Begin with the selected commit
Created with Raphaël 2.2.0
5
Feb
4
1
30
Jan
29
28
25
24
23
22
21
16
15
14
9
8
7
4
3
2
1
30
Dec
29
28
24
12
11
10
9
8
7
6
5
4
3
2
1
30
Nov
29
26
23
21
16
Lfs of file into run
renku run --isolation python src/python/run_correctxml.py data/AB//1976/02_extractedxml.tar.gz data/AB//1976/04_correctedxml.tar.gz
Lfs of file into run
Lfs of file into run
renku dataset add lists ../data_from_nlp-democracy/results_overlap/dict_overlaps.pickle
WIP: implement excluding overlaps
Merge branch 'update-mp-extraction' of renkulab.io:luis.salamanca/democrasci_preprocwp1 into update-mp-extraction
renku run python src/python/run_extract_origxml.py data/AB//1979/00_rawpdfs.tar.gz data/AB//1979/02_extractedxml.tar.gz
renku run python src/python/run_extract_origxml.py data/AB//1978/00_rawpdfs.tar.gz data/AB//1978/02_extractedxml.tar.gz
create output file with speakers
adapt utils_annot.py to 3 additonal infos
renku run --isolation python src/python/extractMPs.py data/politicians/Ratsmitglieder_1848_DE_corr.xlsx data/politicians/MPs_additionalInfo.csv data/politicians/MPs_after1890.csv data/politicians/lastnames/
fine-tuning of name disambiguation
Merge branch 'master' into 'update-mp-extraction'
Merge branch 'name-disambiguation' into 'master'
minor changes to runextractdiscussions
renku run python src/python/run_correctxml.py data/AB//1991/02_extractedxml.tar.gz data/AB//1991/04_correctedxml.tar.gz
renku run --isolation python src/python/extractMPs.py data/politicians/Ratsmitglieder_1848_DE_corr.xlsx data/politicians/MPs_additionalInfo.csv data/politicians/MPs_after1890.csv data/politicians/lastnames/
clean up before renku run
correct mistakes in two input files
find firstname before lastname
clean up before renku run
implement role assignment and disambiguation for Bundesrat
add missing entry for Debétaz as Nationalrat
name disambiguation fine-tuning
add CantonAbbreviation to MPs additional Info
corrected two entries in Ratsmitglieder file, (Frey Emil J.R. (BL), Scherer Victor Emil)
last name which is both canton and double now also finds person which is double (Schmid-Ruedin Philipp, 1951)
update gitignore, minor change to runextractdiscussions
renku run python src/python/run_extract_origxml.py data/AB//1991/00_rawpdfs.tar.gz data/AB//1991/02_extractedxml.tar.gz
renku run --isolation python src/python/extractMPs.py data/politicians/Ratsmitglieder_1848_DE_corr.xlsx data/politicians/MPs_additionalInfo.csv data/politicians/MPs_after1890.csv data/politicians/lastnames/
cleanup before renku update
fine-tuning of code
worked on notnames and additoinalinfo 1951, 1971
add notunique files to gitignore
fine-tuning
more notnames and additionalinfo 1891, 1925
one more entry to MPs additionalINfo
implemented council and date disambiguation
updated list not names 1893
Loading