Homec4science

Added debugging code WRT encoding, just as for the bibrank word indexer. Now…

Authored by Tibor Simko <tibor.simko@cern.ch> on May 13 2008, 17:24.

Description

Added debugging code WRT encoding, just as for the bibrank word indexer. Now the bad non-UTF-8 words and simply ignored and the indexing process goes on. (And the admin is alerted about the record having the bad word.)

Event Timeline

Tibor Simko <tibor.simko@cern.ch> committed R3600:a3b4bad725ec: Added debugging code WRT encoding, just as for the bibrank word indexer. Now… (authored by Tibor Simko <tibor.simko@cern.ch>).May 13 2008, 17:24