Homec4science

BibHarvest: add authorlist extraction post-process

Authored by Jan Aage Lavik <jan.age.lavik@cern.ch> on Aug 5 2011, 20:46.

Description

BibHarvest: add authorlist extraction post-process

  • Adds extration of huge collaboration authorlist post-process step (a) to harvesting workflow. Authorlists are extracted when an authorlist xml-file is found, indicating a collaboration paper. The extracted authors are added to the resulting MARCXML.
    • Adds conversion of LaTeX symbols for authornames in authorlists, using available functions in textutils.
    • Any UNDEFINED affiliations from authorlist extraction are also now extracted and added as a new RT ticket.
    • To do the convertion from authorlist XML to MARCXML a XSLT stylesheet is added.
  • Enhances BibTask log messages for OAI harvest jobs.
  • Status messages when editing/adding a harvesting source in OAI Harvest admin are now shown at the top.

Details

Committed
Tibor Simko <tibor.simko@cern.ch>Aug 11 2011, 17:00
Parents
R3600:3267d797c389: BibCatalog: fix bug with multi-line ticket body
Branches
Unknown
Tags
Unknown

Event Timeline

Tibor Simko <tibor.simko@cern.ch> committed R3600:ec706bc5917c: BibHarvest: add authorlist extraction post-process (authored by Jan Aage Lavik <jan.age.lavik@cern.ch>).Aug 11 2011, 17:00