Homec4science

get_words_from_phrase(): Use explicit non-alphanumeric separators because \w \W…

Authored by Tibor Simko <tibor.simko@cern.ch> on May 23 2003, 16:41.

Description

get_words_from_phrase(): Use explicit non-alphanumeric separators because \w \W exhibit some troubles for UTF-8, and locales didn't help either. The workaround now works fine for ASCII-like non-alphanumeric separators, but don't for e.g. Spanish reverse question mark. TODO: investigate why \w \W don't work well with UTF-8.

Event Timeline

Tibor Simko <tibor.simko@cern.ch> committed R3600:954665ff78b5: get_words_from_phrase(): Use explicit non-alphanumeric separators because \w \W… (authored by Tibor Simko <tibor.simko@cern.ch>).May 23 2003, 16:41