--- output: html_document: default title: Democrasci Project Development author: LS, LG, LB, RR --- ## Project Overview - [ ] Work package 1: Mining document structure and contents - [x] 1.1 Check if OCR on PDF scans is sufficient - [ ] 1.2 Extracting XMLs from PDF scans - [ ] 1.X .. - [ ] Work package 2: Knowledge graph construction - [ ] Work package 3: Interactive data exploration and visualization ## Reporting Style * Description of the problem/task * date started * brief description * PDF-figures on what the problem is about * Description of the solution/code * date completed * brief description * proof output (PDF-figures on how the problem was solved) * Code documentation * input data * code scripts * output data ## Detailed Project Overview ### Work package 1: Mining document structure and contents #### Task 1.1: Check if OCR on PDF scans is sufficient Problem description. How the check was made. Code used to check. Date finished. #### Task 1.2: Extracting XMLs from PDF scans