From 1c5eddb73acfe7347b6addf9f173eb6a4f32bfc3 Mon Sep 17 00:00:00 2001 From: Luis Salamanca <luis.salamanca@sdsc.ethz.ch> Date: Wed, 21 Nov 2018 10:01:30 +0000 Subject: [PATCH] Project dev file --- projectdev_wp1.md | 55 +++++++++++++++++++++++++++++++++++++++++++++++ 1 file changed, 55 insertions(+) create mode 100644 projectdev_wp1.md diff --git a/projectdev_wp1.md b/projectdev_wp1.md new file mode 100644 index 00000000..021e34dc --- /dev/null +++ b/projectdev_wp1.md @@ -0,0 +1,55 @@ +--- +output: + html_document: default +title: Democrasci Project Development +author: LS, LG, LB, RR +--- + +## Project Overview + +- [ ] Work package 1: Mining document structure and contents + - [x] 1.1 Check if OCR on PDF scans is sufficient + - [ ] 1.2 Extracting XMLs from PDF scans + - [ ] 1.X .. +- [ ] Work package 2: Knowledge graph construction +- [ ] Work package 3: Interactive data exploration and visualization + +## Reporting Style + +* Description of the problem/task + * date started + * brief description + * PDF-figures on what the problem is about +* Description of the solution/code + * date completed + * brief description + * proof output (PDF-figures on how the problem was solved) +* Code documentation + * input data + * code scripts + * output data + +## Detailed Project Overview + +### Work package 1: Mining document structure and contents + +#### Task 1.1: Check if OCR on PDF scans is sufficient + +Problem description. +How the check was made. +Code used to check. +Date finished. + +#### Task 1.2: Extracting XMLs from PDF scans + + + + + + + + + + + + -- GitLab