DocExtract: webinterface text box fix
- py2.4 compatibility patch for the webinterface decode() does not accept decode("utf-8", errors="ignore") and expects instead decode("utf-8", "ignore")
- Lazy regexps compiliing in authorextract_re.py do less at import time.