Fixed issue #137 Integrate with warc-hadoop-indexer for Shine
Description
Description
Details
Details
- Committed
lintool <jimmylin@umd.edu> Jul 22 2015, 18:11 - Pushed
dportabella Oct 19 2016, 16:29 - Parents
- R1473:fac3312c1785: Upgraded to a stable warc-hadoop-indexer artifact (2.2.0-BETA-5)
R1473:11dfe152d4cc: Added ExtractBoilerpipeText UDF - Branches
- Unknown
- Tags
Merged Changes
Merged Changes
Commit | Author | Details | Committed | |||
---|---|---|---|---|---|---|
fac3312c1785 | lintool | Upgraded to a stable warc-hadoop-indexer artifact (2.2.0-BETA-5) | Jul 22 2015 | |||
17393c207934 | lintool | Complete revamped cmdline parameter handling. | Jul 21 2015 | |||
0be900c474ef | lintool | Simplified specification of configs. | Jul 21 2015 | |||
243004be2ded | lintool | Config for WARCIndexer. | Jul 21 2015 | |||
b520faa88283 | lintool | Refactoring to simplify code. | Jul 21 2015 | |||
7936dd172aae | lintool | Fixed: https://github.com/ukwa/webarchive-discovery/issues/64 | Jul 14 2015 | |||
34c47c6ba9bc | lintool | Copy dependencies into solr home. | Jul 13 2015 | |||
38ac5fb5cba5 | lintool | Rename. | Jul 13 2015 | |||
151242d2d988 | lintool | Upgrade CDH; fixed broken tests due classpath conflict issue and Tika upgrade. | Jul 13 2015 | |||
a3eb65567484 | lintool | More code simplification and refactoring. | Jul 13 2015 | |||
46d7cef43ee3 | lintool | Refactoring to simplify code. | Jul 13 2015 | |||
befb3f9b39fb | lintool | Reformat code. | Jul 13 2015 | |||
04f416e56781 | lintool | Getting rid of external dependencies by checking in Solr configs. | Jul 13 2015 | |||
335d751b425c | lintool | Starting to incorporate Hadoop WARC indexing code. | Jul 13 2015 |