Homec4science

classifier: support colons in file paths

Authored by Jan Aage Lavik <jan.age.lavik@cern.ch> on Apr 4 2015, 12:33.

Description

classifier: support colons in file paths

  • FIX Properly handles file paths containing a colon (:), avoiding bad text extraction that causes (1) wrong results and (2) much slower execution.
  • Improves the reporting of problems in the ontology.
  • Removes check if PDF text is English as it is irrelevant.
  • Refactors a bit the code to download remote files.

Signed-off-by: Jan Aage Lavik <jan.age.lavik@cern.ch>

Details

Committed
Jan Aage Lavik <jan.age.lavik@cern.ch>Apr 25 2015, 15:13
Parents
R3600:d64fffea5a05: classifier: fast_mode fix and task results
Branches
Unknown
Tags
Unknown

Event Timeline

Jan Aage Lavik <jan.age.lavik@cern.ch> committed R3600:4b56c071c54a: classifier: support colons in file paths (authored by Jan Aage Lavik <jan.age.lavik@cern.ch>).Apr 25 2015, 15:13