Homec4science

Merge branch 'extract-links' of https://github.com/Jeffyrao/warcbase into…

Authored by lintool <jimmylin@umd.edu> on Mar 15 2014, 20:51.

Description

Merge branch 'extract-links' of https://github.com/Jeffyrao/warcbase into working

Details

Event Timeline

lintool <jimmylin@umd.edu> committed R1473:d86dcb73c854: Merge branch 'extract-links' of https://github.com/Jeffyrao/warcbase into… (authored by lintool <jimmylin@umd.edu>).Mar 15 2014, 20:51

Merged Changes

CommitAuthorDetailsCommitted
5d382f5fb4f7Jeffyrao
Update README.md 
Jan 4 2014
142082d20eecJeffyrao
Update README.md 
Jan 4 2014
3c1f4ccccc0cJinfeng Rao
modify UriMappingBuilder in pom.xml 
Dec 9 2013
8a3cb8c12dcaJinfeng Rao
build hadoop job using Maven Assembly Plugin, modified pom.xml, added hadoop… 
Dec 8 2013
2b49ec425597Jeffyrao
check text/html type and modify Jsoup.parse charset as ISO-8859-1 
Dec 8 2013
2e7e0cb81775Jeffyrao
modify UriMappingBuilder to read all files under given directory 
Dec 7 2013
8e1cadb7e70eJeffyrao
Extract links and Lucene FST for URLs. 
Dec 6 2013