WebSubmit: re-implementation of pdf2hocr2pdf
- Re-implemented pdf2hocr2pdf fixing several shortcomings.
- Uses pyPdf library to put the recognized text directly under the original pages, instead of assembling a new PDF from rasterized, rotated, deskewed images. (closes #17)