Homec4science

Removed content of <script> and <style> tags during the HTML-to-text processing…

Authored by Jerome Caffaro <jerome.caffaro@cern.ch> on Jun 22 2007, 08:26.

Description

Removed content of <script> and <style> tags during the HTML-to-text processing in function get_as_text(..). Stripped starting/ending spaced from generated text output. Removed newlines \n from HTML input. Documented functions.

Details

Event Timeline

Jerome Caffaro <jerome.caffaro@cern.ch> committed R3600:c95739dfbceb: Removed content of <script> and <style> tags during the HTML-to-text processing… (authored by Jerome Caffaro <jerome.caffaro@cern.ch>).Jun 22 2007, 08:26