Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
D
democrasci_preprocWP1
Manage
Activity
Members
Labels
Plan
Issues
Issue boards
Milestones
Wiki
Code
Merge requests
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Snippets
Build
Pipelines
Jobs
Pipeline schedules
Artifacts
Deploy
Releases
Container Registry
Model registry
Operate
Environments
Monitor
Incidents
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Model experiments
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
Marta Balode
democrasci_preprocWP1
Graph
16697a3c35660544ffb079b38f508c6eef495fc7
Select Git revision
Branches
14
ClassTextbox
add-language-to-06
annotate-xml-1981etc
backbutton_in_labeling
bill_segmentation
debug_tarreading
diff_secs
doc2vec
extractMetaData
extract_text_summary
master
default
protected
provide-complete-file-paths
renku/autosave/luis.salamanca/master/5b70b3a/5b70b3a
webApp_labelling_backend
14 results
You can move around the graph by using the arrow keys.
Begin with the selected commit
Created with Raphaël 2.2.0
7
Jan
6
2
17
Dec
10
3
26
Nov
21
19
12
5
31
Oct
29
28
22
15
8
1
24
Sep
23
20
8
Jul
3
Jun
31
May
27
24
8
7
3
Apr
1
25
Mar
22
14
13
1
28
Feb
27
26
25
23
22
20
19
18
14
13
12
8
7
implemented back button
Merge branch 'feature_based_title_training_and_labelling' into 'master'
finished tutorial
Find Tagesordnung in AddP + old titlematching
MetaFile extraction
partially implemented page classification and documentation
added features of new years
adding documentation, LOCAL_CONSTANTS is now environment variable
first version
stuff
stuff
before refactoring the parsing infrastructure
Notebook duplicates
Merge branch 'master' of renkulab.io:luis.salamanca/democrasci_preprocwp1
Correcting the problem with repeated lines in two blocks
Merge branch 'bug_duplicated_linebbox' into 'master'
adding data
Merge branch 'bug_duplicated_linebbox' into feature_based_title_training_and_labelling
tiny bug
included the exclusion of the test set
refatored the get_list
ignore last page, merge duplicated textlines in pdf2xml
test case
add exploration notebook
In order to integrate the Constants stuff. Merge branch 'feature_based_title_training_and_labelling' into bug_duplicated_linebbox
textlines become duplicated between 02 and 04
added features to git
finished the train relabel loop
Notebooks for basic usage of additional protocols
Data of additional protocols, and notebook to parse them
Noteboojs
Merge branch 'master' of renkulab.io:luis.salamanca/democrasci_preprocwp1
Notebooks with basic for metadat usage
Update requirements.txt
Update Dockerfile
get_list bugfix human labelling stuff
stuff
training
Merge branch 'feature_based_title_classification' into 'master'
features and heuristic labels for title
Loading