Merge branch 'working'
Description
Description
Details
Details
- Committed
lintool <jimmylin@umd.edu> Mar 16 2014, 00:23 - Pushed
dportabella Oct 19 2016, 16:29 - Parents
- R1473:854e5d13f12c: Update README.
R1473:0e4d1592c244: Merge branch 'working' - Branches
- Unknown
- Tags
Merged Changes
Merged Changes
Commit | Author | Details | Committed | |||
---|---|---|---|---|---|---|
854e5d13f12c | lintool | Update README. | Mar 16 2014 | |||
7a93263997f3 | lintool | More POM tweaks to get MR jobs running correctly. | Mar 16 2014 | |||
2f51acbc5fbe | lintool | Tweaked POM | Mar 15 2014 | |||
fdb75decbdf4 | lintool | Whitespace. | Mar 15 2014 | |||
d86dcb73c854 | lintool | Merge branch 'extract-links' of https://github.com/Jeffyrao/warcbase into… | Mar 15 2014 | |||
5d382f5fb4f7 | Jeffyrao | Update README.md | Jan 4 2014 | |||
142082d20eec | Jeffyrao | Update README.md | Jan 4 2014 | |||
3c1f4ccccc0c | Jinfeng Rao | modify UriMappingBuilder in pom.xml | Dec 9 2013 | |||
8a3cb8c12dca | Jinfeng Rao | build hadoop job using Maven Assembly Plugin, modified pom.xml, added hadoop… | Dec 8 2013 | |||
2b49ec425597 | Jeffyrao | check text/html type and modify Jsoup.parse charset as ISO-8859-1 | Dec 8 2013 | |||
2e7e0cb81775 | Jeffyrao | modify UriMappingBuilder to read all files under given directory | Dec 7 2013 | |||
c4248d61b8b7 | lintool | Simple MapReduce program to count number of unique URLs. | Dec 7 2013 | |||
8e1cadb7e70e | Jeffyrao | Extract links and Lucene FST for URLs. | Dec 6 2013 | |||
7ee8dbb88c3c | lintool | Improved error checking for dates. | Nov 25 2013 | |||
22dd0757c01e | lintool | Tweaked browser; added MR programs for simple content analysis. | Nov 23 2013 | |||
b5b4e211ea05 | lintool | Hadoop InputFormats for ARC and WARC files + simple demos. | Nov 22 2013 |