Homec4science

BibAuthorID: backend and frontend updates

Authored by Samuele Carli <samuele.carli@cern.ch> on Aug 10 2011, 19:34.

Description

BibAuthorID: backend and frontend updates

  • New namestring to namestring comparison function. This takes into account gender and synonims dictionaries, as well as smart point assignment relative to initials and names order and levenshtein distance.
  • Fixed paper exclusion lists in config file
  • Addedd new makefiles to install names comparison dictionaries
  • bugfixes in personid
  • Max_threads and Max_processess incremented in config file
  • New personID garbage collector (update_personid_table_from_paper)
  • Minor changes and fixes
  • Bibauthorid: GC: table limitations updated Updated the maximum number of rows in memory per round from 1M to 10M.
  • Command line interface updated to allow easy run of prod regular updates through: --fast-update-personid [-r rec1,...,recN]

adds new and modified papers to personid tables

--personid-gc [-r rec1,...,recN]

removes deleted and stale papers from personid

Co-authored-by: Henning Weiler <henning.weiler@cern.ch>
Co-authored-by: Nikola Yolov <nikola.yolov@cern.ch>

Details

Committed
Tibor Simko <tibor.simko@cern.ch>Aug 29 2011, 13:26
Parents
R3600:4a72899341a4: bibdocfile: add --with-flags CLI parameter
Branches
Unknown
Tags
Unknown

Event Timeline

Tibor Simko <tibor.simko@cern.ch> committed R3600:f7182609ef22: BibAuthorID: backend and frontend updates (authored by Samuele Carli <samuele.carli@cern.ch>).Aug 29 2011, 13:26