cop-mining-participants/codemaster
cop-mining-participants/code
master
History Graph
History Graph
Commit | Author | Details | Committed | |||
---|---|---|---|---|---|---|
29e001a8ff57 | Jan Linder | IMPLEMENT PDF TO TXT CORRECTLY Now use pdfminer with the laparam argument to… | Oct 6 2020 | |||
66d3e48b2e2a | Jan Linder | the ocr now works correctly | Oct 5 2020 | |||
154caa57dcc3 | Jan Linder | blacklist | Oct 5 2020 | |||
2f69d7e02c12 | Jan Linder | whitelist ocr: not perfect | Oct 5 2020 | |||
9cc36860b206 | Jan Linder | Improve analysis for cop3 and cop4 | Oct 5 2020 | |||
0d36b15c21a8 | Jan Linder | Small corrections for OCR | Oct 5 2020 | |||
67d367f6243d | Jan Linder | finish modularization | Oct 5 2020 | |||
e2ddd912c5b8 | Jan Linder | File structure updated. (untested) | Oct 4 2020 | |||
26f03fd369c3 | Jan Linder | Begin with making a proper modularization | Oct 4 2020 | |||
2d93960b8990 | Jan Linder | inserted boxes for OCR and right parameters | Sep 30 2020 | |||
d37074c1d20f | Jan Linder | minor changes of process, added raw of cop3 | Sep 28 2020 | |||
5b6aa3b3ac8b | Jan Linder | progress on cop2-4 | Sep 28 2020 | |||
57cdc59e9210 | Jan Linder | Use of process_copX.py precised in README | Sep 27 2020 | |||
355f0c163915 | Jan Linder | Implemented processing of cop2-4. Works good for countries, but has major… | Sep 27 2020 | |||
a557a870c334 | Jan Linder | try with pypdf2 | Sep 27 2020 | |||
13691532eec5 | Jan Linder | implemented process cop for 5 - 25 with textract but there are major errors in… | Sep 26 2020 | |||
7f3f311f331c | Jan Linder | progress on the class and process script | Sep 23 2020 | |||
c747832a97fe | Jan Linder | Began the copx file | Sep 22 2020 | |||
aa92caa65e57 | Jan Linder | added raw txt for cop25 | Sep 22 2020 | |||
9514a37465f4 | Jan Linder | data complete | Sep 17 2020 | |||
77b17d5e7bd6 | Jan Linder | data complete | Sep 17 2020 | |||
45310b6c42f1 | Jan Linder | first part of the lists | Sep 17 2020 |
c4science · Help