R1473/pom.xmlmaster
R1473/pom.xml
master
History Graph
History Graph
Commit | Author | Details | Committed | |||
---|---|---|---|---|---|---|
19be6edfa8cd | lintool | Upgraded to CDH 5.7.1. | Jun 24 2016 | |||
e8c43f1b9d24 | lintool | Cleaned up and simplified dependencies, etc. | Jun 17 2016 | |||
6e8740a55414 | lintool | Integrating warcbase-hbase artifact. | Jun 16 2016 | |||
f972206db516 | lintool | Created warcbase-core module. | Jun 16 2016 | |||
52eb2f96304d | lintool | downgraded guava to fix borked tests, cf https://issues.apache. | Mar 29 2016 | |||
2ffc60cc178e | Jeremy Wiebe | Merge branch 'graphx' | Mar 29 2016 | |||
19eefb5d7f2c | Jeremy Wiebe | Revert to old CDH (new version produces errors on cluster) | Mar 8 2016 | |||
997d2405a84e | Jeremy Wiebe | Basic graph analysis, initial check-in | Mar 5 2016 | |||
93686da4ff23 | lintool | Downgraded json4s, folds in parsing in the loader. | Mar 5 2016 | |||
f2abb2b5bbc2 | lintool | Initial pass at processing tweets in Warcbase. | Mar 5 2016 | |||
4b8f0fb7b482 | Jeremy Wiebe | Use shapeless to flatten tuples of any arity | Feb 13 2016 | |||
c44118522844 | Jeremy Wiebe | Added matchbox function to NER-classify and generate JSON for visualizer, per… | Nov 26 2015 | |||
dd643f9fae5c | Jeremy Wiebe | Added jackson-databind to pom.xml | Nov 25 2015 | |||
f5d8edf50506 | lintool | Killed all the Pig stuff. | Nov 24 2015 | |||
cc274ed73b60 | Alice-Z | Use Jackson JSON serializers to write to String | Nov 12 2015 | |||
9140519a9d50 | Alice-Z | Fix imports and enable tests on JUnitRunner | Nov 9 2015 | |||
3eb11b04e499 | Alice-Z | Commit clean-up | Nov 8 2015 | |||
8057c46945d0 | Alice-Z | Add Spark support | Nov 3 2015 | |||
ade10b26bca4 | lintool | Upgraded Spark to 1.3 in CDH 5.4.1 | Nov 1 2015 | |||
258a159c796e | lintool | Fixed issue #137 Integrate with warc-hadoop-indexer for Shine | Jul 22 2015 | |||
fac3312c1785 | lintool | Upgraded to a stable warc-hadoop-indexer artifact (2.2.0-BETA-5) | Jul 22 2015 | |||
7936dd172aae | lintool | Fixed: https://github.com/ukwa/webarchive-discovery/issues/64 | Jul 14 2015 | |||
34c47c6ba9bc | lintool | Copy dependencies into solr home. | Jul 13 2015 | |||
151242d2d988 | lintool | Upgrade CDH; fixed broken tests due classpath conflict issue and Tika upgrade. | Jul 13 2015 | |||
46d7cef43ee3 | lintool | Refactoring to simplify code. | Jul 13 2015 | |||
335d751b425c | lintool | Starting to incorporate Hadoop WARC indexing code. | Jul 13 2015 | |||
11dfe152d4cc | Jeremy Wiebe | Added ExtractBoilerpipeText UDF | Jun 30 2015 | |||
988a2354d84d | lintool | Fixed merge conflict that got committed to master accidentally. | Jun 9 2015 | |||
51abf7e86696 | ianmilligan1 | testing fork | Jun 9 2015 | |||
2f462979da0d | lintool | Removed JWAT jars. | Jun 7 2015 | |||
a75d58d5b0ee | lintool | whitespace | May 30 2015 | |||
649f1674352b | lintool | Initial experiments with Python converters to use Hadoop InputFormat from… | May 27 2015 | |||
1c013a7b064f | Jeremy Wiebe | Added Stanford NER UDF | May 26 2015 | |||
9acc8ca74c0a | lintool | Merge branch 'master' into extract-pdf-udf | May 24 2015 | |||
0fd8f3f72822 | lintool | Updated versions of some artifacts. | Dec 23 2014 | |||
5c2540099370 | lintool | Reformatting. | Dec 18 2014 | |||
cea1f0cae974 | rwolniak | Updated PIG Tika Parser with new code that should work. Awaiting test on Hadoop | Dec 9 2014 | |||
21397e4e4ff3 | lintool | upgraded to webarchive-commons 1.1.4. | Oct 18 2014 | |||
157df31c0f15 | lintool | Cleanup. | Sep 14 2014 | |||
159596e9b378 | lintool | Fixed build issues in upgrade to CDH 5.1.2. | Sep 13 2014 | |||
9cbd26b2e3bb | lintool | Minor tweaks. | Aug 19 2014 | |||
c5b4973aba96 | lintool | Merge branch 'master' into selenium | Aug 17 2014 | |||
4090318d2f99 | lintool | Cleaned up scripts. | Aug 17 2014 | |||
b02fd402870f | lintool | Cute Selenium browser to conduct a random walk through the archive. | Aug 17 2014 | |||
ff34480981dd | lintool | Restored some sanity to versions and transitive dependencies. | Aug 14 2014 | |||
b6431bed6f8b | lintool | Minor tweaks. | Aug 14 2014 | |||
72c1afbca77c | lintool | Merge branch 'master' into cneud-integration | Aug 14 2014 | |||
ae5581b740fc | lintool | Bumped up memory, fixed class renaming. | Aug 14 2014 | |||
0bc5d4876199 | lintool | Uri -> Url classes renaming. | Aug 14 2014 | |||
a14cda2217fa | lintool | Commented out LibmagicJnaWrapper functionality because the jar isn't generally… | Aug 12 2014 | |||
93f6d42f4ff4 | lintool | Fixed compile and broken test issues. | Aug 12 2014 | |||
8b5a6be9db61 | lintool | Quick and dirty switch over to webarchive-commons API; stores raw ARC records. | Aug 10 2014 | |||
60a9b96beebe | Milad Gholami | Merging with master. | Jun 26 2014 | |||
c7a7247d5fa2 | lintool | Appears to have fixed issue #49, starting work on admin tool, issue #45. | Jun 17 2014 | |||
37f9b1fce90f | milad621 | openwayback upgraded to 2.0.0.BETA.2 | Jun 16 2014 | |||
cad5eddceb19 | Jeffyrao | remove javacsv dependency, add opencsv dependency | May 25 2014 | |||
52b39cfecc82 | lintool | Minor refactoring. | May 6 2014 | |||
0856f3faddd1 | lintool | Merge branch 'master' of github.com:milad621/warcbase into display_issues | May 6 2014 | |||
627743751578 | milad621 | juniversalchardet removed. | Apr 19 2014 | |||
1e3b5543324b | Jeffyrao | add ExtractSiteLinks in pom.xml | Apr 16 2014 | |||
fd511b6eaf7b | milad621 | Fixed Issue 24. | Apr 6 2014 | |||
758948b463b9 | lintool | Light refactoring, fixed a few errors. | Mar 31 2014 | |||
b83b898c782c | lintool | Merge branch 'jar_upgrade' into disable_wal | Mar 17 2014 | |||
d543a4957916 | lintool | Merge branch 'master' into disable_wal | Mar 17 2014 | |||
8c649e5336af | lintool | Fixed Issue #7 | Mar 17 2014 | |||
5fb9d2a33a9e | Jinfeng Rao | remove assembly plugin for hadoop job | Mar 16 2014 | |||
19e2ff05074a | lintool | openwayback-core pulls in hadoop-core; excludes | Mar 16 2014 | |||
073fd3206185 | lintool | Simplified Maven dependencies; Moved from IA artifacts to openwayback artifacts. | Mar 16 2014 | |||
c8577eac7bbb | lintool | Upgraded to CDH 4.4 jars for HBase and ZK. | Mar 16 2014 | |||
1e0783ff8a6c | lintool | Merge branch 'new_hbase_structure' of https://github.com/milad621/warcbase into… | Mar 16 2014 | |||
7a93263997f3 | lintool | More POM tweaks to get MR jobs running correctly. | Mar 16 2014 | |||
2f51acbc5fbe | lintool | Tweaked POM | Mar 15 2014 | |||
3c1f4ccccc0c | Jinfeng Rao | modify UriMappingBuilder in pom.xml | Dec 9 2013 | |||
8a3cb8c12dca | Jinfeng Rao | build hadoop job using Maven Assembly Plugin, modified pom.xml, added hadoop… | Dec 8 2013 | |||
8e1cadb7e70e | Jeffyrao | Extract links and Lucene FST for URLs. | Dec 6 2013 | |||
944b138ee0d2 | milad621 | started a new branch to extract text from warcbase data and organize the data… | Dec 1 2013 | |||
7e76e19f2547 | milad621 | PrintAllUris add to appassembler which will output a urls.html file with all… | Nov 23 2013 | |||
b5b4e211ea05 | lintool | Hadoop InputFormats for ARC and WARC files + simple demos. | Nov 22 2013 | |||
c3e37d50e856 | milad621 | Created a new runnable to find a uri inside warc/arc files. | Nov 7 2013 | |||
01dff2a1f2a5 | milad621 | One runnable to process both arc and warc files in a folder. | Nov 5 2013 | |||
e9e3d1c9d940 | milad621 | IngestWarcFiles fixed. Now it uses jwat-warc to ingest warcfiles to hbase. | Oct 30 2013 | |||
b1bee7dc3a8e | milad621 | Arc Processing tools added. Not working with hbase yet. | Oct 17 2013 | |||
c860aef8368c | milad621 | URL style changed | Oct 2 2013 | |||
8706a55f8e36 | milad621 | header (navigation bar?) added. Pom update | Sep 17 2013 | |||
19ab27133471 | lintool | Refactored dependencies. | Aug 13 2013 | |||
3a2aec9c5698 | lintool | Cleaned up analysis code. Removed dead code. Added README. | Aug 13 2013 | |||
4fffd3905e9e | lintool | Refactoring of browser; each archive is now stored in its own separate table… | Aug 12 2013 | |||
0f580edb6186 | lintool | Interim check-in, refactoring of ingestion program. | Aug 12 2013 | |||
8c61d21d5167 | milad621 | pom.xml updated | Aug 6 2013 | |||
a65403484d61 | milad621 | pom updated | Jul 28 2013 | |||
e16c1d229b37 | milad621 | pom updated | Jul 28 2013 | |||
41ff1fe630d6 | lintool | Refactored Jetty server into WarcBrowser. | Jul 18 2013 | |||
3d448add8f08 | milad621 | pom.xml fixed | Jul 18 2013 | |||
f3387f289d05 | milad621 | pom | Jul 18 2013 | |||
54b8bb992ca6 | milad621 | testHbase program | Jul 18 2013 | |||
53ad25cf7c10 | milad621 | pom.xml updated | Jul 18 2013 | |||
1dd61995bdf1 | milad621 | pom.xml updated | Jul 17 2013 | |||
1dd7ac9455cc | milad621 | added hadoop-core to pom.xml | Jul 17 2013 | |||
336a5b5d3009 | milad621 | pom.xml updated | Jul 16 2013 | |||
12c390826639 | lintool | Conversion into Maven artifact. | Jul 13 2013 |
c4science · Help