Page MenuHomec4science

bibclassify.in
No OneTemporary

File Metadata

Created
Tue, May 21, 04:43

bibclassify.in

#!@PYTHON@
## -*- mode: python; coding: utf-8; -*-
##
## This file is part of CDS Invenio.
## Copyright (C) 2002, 2003, 2004, 2005, 2006, 2007, 2008 CERN.
##
## CDS Invenio is free software; you can redistribute it and/or
## modify it under the terms of the GNU General Public License as
## published by the Free Software Foundation; either version 2 of the
## License, or (at your option) any later version.
##
## CDS Invenio is distributed in the hope that it will be useful, but
## WITHOUT ANY WARRANTY; without even the implied warranty of
## MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU
## General Public License for more details.
##
## You should have received a copy of the GNU General Public License
## along with CDS Invenio; if not, write to the Free Software Foundation, Inc.,
## 59 Temple Place, Suite 330, Boston, MA 02111-1307, USA.
"""CDS Invenio BibClassify.
It extracts keywords from a pdf or text file based on a thesaurus.
Usage: bibclassify [options]
Examples:
bibclassify -f file.pdf -k thesaurus.txt -o TEXT
bibclassify -t file.txt -K ontology.rdf -m SLOW
Specific options:
-f, --pdffile=FILENAME name of the pdf file to be classified
-t, --textfile=FILENAME name of the text file to be classified
-k, --thesaurus=FILENAME name of the text thesaurus (taxonomy)
-K, --ontology=FILENAME name of the RDF or OWL ontology (experimental)
-o, --output=HTML|TEXT output list of keywords in either HTML or text
-n, --nkeywords=NUMBER max number of keywords to be found
-m, --mode=FAST|SLOW processing mode: FAST (run on abstract and selected pages), SLOW (run on whole document - more accurate)
General options:
-h, --help print this help and exit
-V, --version print version and exit
"""
try:
import sys
from invenio.bibclassifylib import main
except ImportError, e:
print "Error: %s" % e
import sys
sys.exit(1)
main()

Event Timeline