Homec4science

BibIndex: wash input phrases for UTF-8

Authored by Tibor Simko <tibor.simko@cern.ch> on Dec 9 2010, 10:13.

Description

BibIndex: wash input phrases for UTF-8

  • Plugged wash_for_utf8() into all indexes; this should fix all the Unicode indexing troubles and exceptions at the price of some indexing performance due to repetitive washing.
  • Note that full-text index may not be washed very effectively either; this should be fixed in during the textification phase when washing all the ligatures and friends, which is a separate task (see #317).

Details

Committed
Tibor Simko <tibor.simko@cern.ch>Dec 9 2010, 10:17
Parents
R3600:246b1466dc42: WebSearch: more robust colon treatment
Branches
Unknown
Tags
Unknown

Event Timeline

Tibor Simko <tibor.simko@cern.ch> committed R3600:3ee31c4d53bc: BibIndex: wash input phrases for UTF-8 (authored by Tibor Simko <tibor.simko@cern.ch>).Dec 9 2010, 10:17