History Graph
History Graph
Commit | Author | Details | Committed | |||
---|---|---|---|---|---|---|
3c3dc2e7fcc5 | Tibor Simko | kwalitee: adapt disable-msg to new pylint | Aug 17 2010 | |||
d277958c9e4d | Benoit Thiell/Tibor Simko | Removal of unused imports | Aug 17 2010 | |||
e2feb8f6e2ee | Benoit Thiell/Tibor Simko | BibIndex: raise meaningful exceptions | May 26 2010 | |||
372bfd0ce949 | Samuele Kaplun/Tibor Simko | BibIndex: fix remote full-text file indexing | May 25 2010 | |||
a5a8668bf77d | Samuele Kaplun/Tibor Simko | BibIndex: quick hack to index hidden files | Apr 14 2010 | |||
d54e912fa70e | Samuele Kaplun | BibIndex: added support for idxPAIR | Apr 13 2010 | |||
7b3804d29a49 | Samuele Kaplun | WebSubmit: new fulltext conversion tools | Apr 13 2010 | |||
bdd48973620d | Tibor Simko | BibIndex: created exact and fuzzy author indexes | Mar 28 2010 | |||
f4c91b2e7a13 | Tibor Simko | BibIndex: new get_words_from_date_tag() | Mar 8 2010 | |||
3a120cbd10ae | Samuele Kaplun/Tibor Simko | BibIndex: do not email exceptions when repairing | Nov 30 2009 | |||
98a733305a94 | Tibor Simko | BibIndex: more explicit messages during reindexing | Aug 25 2009 | |||
1c3f5175204b | Tibor Simko | htmlutils: new remove_html_markup() API function | Jun 26 2009 | |||
40394efc1646 | Tibor Simko | Global move from javadoc to epydoc docstring style | Jun 15 2009 | |||
dfa81009a5b7 | Tibor Simko | Amended and merged branch sam/idxPHRASE-deployment | Jan 8 2009 | |||
042fd85e1ace | Tibor Simko | Improved index corruption emergency help messages. | Dec 8 2008 | |||
e28cd2bc7cd5 | Samuele Kaplun | Disable phrase breaking algorithm. | Oct 31 2008 | |||
cc46dc12bf20 | Samuele Kaplun | Correctly checking for proper use of --reindex. | Oct 31 2008 | |||
ac97c25cc183 | Samuele Kaplun | Fixed wrong call to wordTable.update_last_updated. | Oct 31 2008 | |||
2025e6af3243 | Samuele Kaplun | Implemented reindexing via temporary table swapping. | Oct 31 2008 | |||
a40f144704d3 | Samuele Kaplun | Disabled stemming for idxPHRASE. | Oct 31 2008 | |||
d60b499c3006 | Samuele Kaplun | Plugged back idxPHRASE creation. | Oct 31 2008 | |||
51bfbcb9fa42 | Tibor Simko | Removed $Id$ from file preamble comments. | Oct 27 2008 | |||
919c5c448915 | Samuele Kaplun | Fixing bug (reported by Devin Bougie) triggered when filename has no extension… | Sep 9 2008 | |||
ac6d47d5fa26 | Samuele Kaplun | Escaped filenames. in get_words_from_fulltext | Aug 12 2008 | |||
6c4510b06391 | Tibor Simko | Catch exceptions of lower_index_term() and continue in spite of UTF-8 problems… | Jun 17 2008 | |||
c3b76be701ba | Tibor Simko | Cosmetics of output messages for the stemming info. | Jun 4 2008 | |||
62b87aa91c88 | Tibor Simko | Lower phrase-to-be-indexed in a Unicode-friendly manner. | May 29 2008 | |||
4dd00adc2995 | Tibor Simko | First implementation of the intelligent 'journal' index. This is an index… | May 16 2008 | |||
e12a89b824b5 | Samuele Kaplun | Better closed mkstemp-created files. | May 14 2008 | |||
6d17ad8b673e | Samuele Kaplun | Fixed mkstemp calls that leaved open files. | May 14 2008 | |||
a3b4bad725ec | Tibor Simko | Added debugging code WRT encoding, just as for the bibrank word indexer. Now… | May 13 2008 | |||
ffb2b3a00592 | Samuele Kaplun | Trying to fixing select/update vs select/insert bug. | May 12 2008 | |||
8e489b9e2ffd | Samuele Kaplun | get_word_tables now returns a list of tuples (index_id, index_tags) instead of… | May 11 2008 | |||
02c67d5a4a19 | Samuele Kaplun | Deployed task_sleep_now_if_required to bibindex. | Apr 3 2008 | |||
cc5418ab136d | Samuele Kaplun | New cleaner signal handling. By using task_sleep_now_if_required(can_stop_too)… | Apr 1 2008 | |||
3d880f427bbe | Samuele Kaplun | Fixed bibindex --reindex to work when no indexes are specified (it reindexes… | Apr 1 2008 | |||
0fb4ae583079 | Samuele Kaplun | New bibsched_low_level_task_submission for enqueing tasks via API. (changes a… | Mar 31 2008 | |||
c47f7c426b8d | Samuele Kaplun | Removed last spourious printed newline. | Mar 26 2008 | |||
91c19744a9d8 | Samuele Kaplun | Clened fulltext indexing logging: if extraction method does not exist for a… | Mar 26 2008 | |||
caec8e4f6e4e | Samuele Kaplun | Removed notImplemented get_words_from_local_fulltext: it will be anyway… | Mar 26 2008 | |||
67226ac65ffd | Tibor Simko | Fixed 175 cases of bad code indentation throughout the codebase. (Please set up… | Mar 25 2008 | |||
2b99705c3821 | Tibor Simko | Updated codebase to use CFG_SITE_URL instead of weburl everywhere. Updated… | Mar 12 2008 | |||
a0235b21a588 | Samuele Kaplun | Hopefully fixed "[must be sys.stdout or sys.stderr] spurious messages. | Mar 12 2008 | |||
8b53ea007afc | Tibor Simko | Fixed verbosity of some necessary output messages. | Mar 11 2008 | |||
b6d3dcd69474 | Samuele Kaplun | Different small fixes after pylint. | Mar 11 2008 | |||
6dc20e3ca587 | Tibor Simko | Updated conf files. Replaced old style variable names (e.g. WEBURL) with new… | Mar 10 2008 | |||
0035392b57e0 | Samuele Kaplun | Fixed bug about last_update_time of an index no more updated. | Feb 28 2008 | |||
abe7f722fb22 | Samuele Kaplun | Disabled idxPHRASE indexing until websearch will exploit it. | Feb 5 2008 | |||
eceb995a1b50 | Tibor Simko | Updated copyright years. | Feb 4 2008 | |||
c555c383303e | Samuele Kaplun | Made washing of terms configurable. idxPHRASEs should infact not wash terms (i. | Jan 18 2008 | |||
22223b2f9e3d | Samuele Kaplun | Implemented idxPHRASE indexes that should help when searching for exact phrases… | Jan 15 2008 | |||
0affefbcb29c | Samuele Kaplun | Moved stemming support from static index config variable structure to column… | Jan 10 2008 | |||
229092586df1 | Samuele Kaplun | Fixed stemming applyed to all the indexes, by adding… | Dec 7 2007 | |||
17c7f225d6e1 | Samuele Kaplun | Ported to bibdocfile, implemented -R,--reindex feature, directly reading of… | Dec 4 2007 | |||
1115e53b3879 | Tibor Simko | Get rid of dbquery's escape_string() and cStringIO factory, escaping SQL… | Nov 6 2007 | |||
e72ca5f81610 | Tibor Simko | Fixed truncation of 50+ bytes long index terms, respecting strictly UTF-8… | Nov 6 2007 | |||
3bbb7e469769 | Tibor Simko | Fixed syntax error in the fulltext indexing branch. | Oct 18 2007 | |||
6048dd37581e | Tibor Simko | Make HTM and HTML files also fulltext-indexable. This is needed for articles… | Oct 16 2007 | |||
cfdd1ea34eed | Samuele Kaplun | New intbitset with universe support. | Aug 9 2007 | |||
602cc38d3271 | Samuele Kaplun | Correctly supporting -V flag in bibtasks. | Aug 7 2007 | |||
7a3572e4228f | Samuele Kaplun | Friday commit: removed dependency to Numeric. Migrated HitSet & Co. to… | Aug 3 2007 | |||
7b2f8868dcd2 | Samuele Kaplun | Refactored bibupload, bibindex and bibreformat tasks to new bibtask cleaned… | Jun 8 2007 | |||
c9303324e339 | Samuele Kaplun | Ported to new BibTask class | Jun 6 2007 | |||
9657e2ace771 | Samuele Kaplun | Moved from the obsolete sre module to re. | May 22 2007 | |||
db71f1e58654 | Tibor Simko | Clarified emergency output messages in case unrepairable errors were found… | May 15 2007 | |||
f5dd8d538885 | Tibor Simko | Handle UnicodeEncodeError related problems when printing messages in… | Mar 27 2007 | |||
df75ba0b1feb | Tibor Simko | Updated copyright years (2007). | Feb 14 2007 | |||
7cbd91d4ea9b | Tibor Simko | When authentifying user-supplied email or nickname, use two SQL queries rather… | Nov 1 2006 | |||
af4a85dac604 | Tibor Simko | Fixed config wildcard import everywhere. Fixed some randomly spotted import… | Sep 20 2006 | |||
ec761f4dff4b | Tibor Simko | Harmonized CFG_PATH_* and CFG_WEBSEARCH_* config variable names. | Sep 18 2006 | |||
770420bc2854 | Tibor Simko | Renamed cfg_bibindex_* config variables to follow the uppercase model. | Sep 15 2006 | |||
2311f7764a03 | Tibor Simko | Fixed the kwalitee problem of re-exposing some of the global config variables… | Sep 13 2006 | |||
0674ca3c5a30 | Tibor Simko | Fixed the kwalitee problem of re-exposure of some of the global config… | Sep 13 2006 | |||
dbc8be0c4e5d | Tibor Simko | Used urllib2 instead of urllib, mkstemp() instead of mktemp(), and removed some… | Aug 29 2006 | |||
536257883e68 | Tibor Simko | Updated every BibSched task to follow the BibTaskEx example of handling task… | Aug 29 2006 | |||
31f485a1e848 | Nicholas Robinson | Fixed a bug relating to the indexation of fulltexts: When the contents of a PDF… | Aug 9 2006 | |||
5aac7386e47f | Tibor Simko | BibSched task authentication now accepts nicknames as usernames too. Modified… | Jun 29 2006 | |||
1fe7d8836f40 | Tibor Simko | Changed database access code in order not to depend on MySQLdb but rather on… | Jun 20 2006 | |||
18b100fe8575 | Tibor Simko | Implemented name change CDSware to CDS Invenio. Also, introduced new configure… | May 4 2006 | |||
d5b36a574e3c | Tibor Simko | Updated copyright years. | May 2 2006 | |||
6b1700acdd20 | Tibor Simko | Python imports are now done in an absolute way (from cdsware.foo import bar)… | Dec 20 2005 | |||
561825a6b79f | Tibor Simko | Quick fix in order to be able to fulltext-index Indico URLs that do not contain… | Dec 5 2005 | |||
d41001de533e | Tibor Simko | Getting rid of WML. | May 12 2005 | |||
e694f41220b7 | Tibor Simko | Fixed add_recIDs_by_date() in case user wants to index records based on… | Mar 29 2005 | |||
84b18f1b27c5 | Tibor Simko | Updated copyright years. | Jan 6 2005 | |||
b2dc486ac9ea | Tibor Simko | Fixed a bug when calculating index id from tablename for indexes greater than… | Jan 6 2005 | |||
89932a344b16 | Tibor Simko | Changes due to the move of cfg_* variables into the general config file. | Dec 16 2004 | |||
5689d701af1c | Tibor Simko | Changed logic when applying stemming (first) and stopword check (second). | Dec 15 2004 | |||
25b3ea26e6b4 | Tibor Simko | Fixed regexp for HTML removal. | Dec 15 2004 | |||
5f3193cec995 | Tibor Simko | Precompile all major regexps to speed up the execution. (About 10% faster… | Dec 15 2004 | |||
ceb6867786d0 | Trond Aksel Myklebust | Added stemming/stopword functionality. Check /lib/bibindex_engine_config.py for… | Nov 19 2004 | |||
bc726da81963 | Tibor Simko | Activate our fancy non-interactive URL opener for fail-safe fulltext-indexing… | Sep 27 2004 | |||
220b909f90a4 | Trond Aksel Myklebust | Modified error messages when not authorized | Sep 23 2004 | |||
e8185fb0dfaf | Tibor Simko | Eliminated excessive write_message() function. Reduced serialize/deserialize… | Sep 8 2004 | |||
87ea8a9e07df | Tibor Simko | Added option to fulltext-index local files only, i.e. remote URLs are skipped… | Jul 7 2004 | |||
5df73cfb8570 | Tibor Simko | BibIndex monolithic bin file split into a small bin file and large lib engine… | Jun 15 2004 |
c4science · Help