Homec4science

refextract: improve realign numeration

Description

refextract: improve realign numeration

  • Improve the realigning of numeration, across badly split affiliation lines (added config variable to specify an acceptable numeric gap between numerated affiliations, in the event of bad pdftotext conv.).
  • Remove -raw from pdftotext conversion, to do: use -layout instead.
  • Strip reference section from document when looking for auths/affs.
  • partially able to obtain standard authors through comma and numeration placement

Details

Committed
Tibor Simko <tibor.simko@cern.ch>Nov 23 2011, 00:33
Parents
R3600:b48c5ea50500: refextract: include author choice heuristics
Branches
Unknown
Tags
Unknown

Event Timeline

Tibor Simko <tibor.simko@cern.ch> committed R3600:f12b25aa95c4: refextract: improve realign numeration (authored by Christopher Hayward <christopher.james.hayward@cern.ch>).Nov 23 2011, 00:33