Skip to content
Snippets Groups Projects
Forked from Luis Salamanca / democrasci_preprocWP1
785 commits behind the upstream repository.
output:
  html_document: default
title: Democrasci Project Development
author: LS, LG, LB, RR

Project Overview

  • Work package 1: Mining document structure and contents
    • 1.1 Check if OCR on PDF scans is sufficient
    • 1.2 Extracting XMLs from PDF scans
    • 1.X ..
  • Work package 2: Knowledge graph construction
  • Work package 3: Interactive data exploration and visualization

Reporting Style

  • Description of the problem/task
    • date started
    • brief description
    • PDF-figures on what the problem is about
  • Description of the solution/code
    • date completed
    • brief description
    • proof output (PDF-figures on how the problem was solved)
  • Code documentation
    • input data
    • code scripts
    • output data

Detailed Project Overview

Work package 1: Mining document structure and contents

Task 1.1: Check if OCR on PDF scans is sufficient

Problem description. How the check was made. Code used to check. Date finished.

Task 1.2: Extracting XMLs from PDF scans