Homec4science

Merge branch 'master' of https://github.com/lintool/warcbase

Authored by Jeremy Wiebe <jwiebe@gmail.com> on Dec 8 2014, 19:57.

Description

Merge branch 'master' of https://github.com/lintool/warcbase

Conflicts:
pom.xml
src/main/java/org/warcbase/browser/WarcBrowserServlet.java
src/main/java/org/warcbase/data/HBaseTableManager.java

Event Timeline

Jeremy Wiebe <jwiebe@gmail.com> committed R1473:0379553cc55c: Merge branch 'master' of https://github.com/lintool/warcbase (authored by Jeremy Wiebe <jwiebe@gmail.com>).Dec 8 2014, 19:57

Merged Changes

CommitAuthorDetailsCommitted
df7f8711c64flintool
Fixed issues #96, #98, #99 
Oct 31 2014
b8915343f5c9lintool
Minor refactoring. 
Oct 31 2014
0145d74ed35elintool
Refactoring to create method that extracts MIME from WARC response records. 
Oct 22 2014
d491020c8287lintool
Merge branch 'pig' into warc 
Oct 19 2014
410cfd81a069lintool
WARC-related Hadoop bindings. 
Oct 19 2014
f4a249469cbclintool
Minor refactoring. 
Oct 19 2014
0c52caed50cflintool
Pig ArcLoader exports its own ResourceSchema. 
Oct 19 2014
e791bec61020lintool
Fixed issues #93, #94, #95 
Oct 19 2014
da7a94be2c70lintool
Added simple Pig script. 
Oct 19 2014
3af097ea655dlintool
Fixed test cases. 
Oct 19 2014
4ceefef78cdclintool
Pig loader materializes the actual content. 
Oct 19 2014
58df7da00880lintool
Refactored Pig loaded to use WAC API. 
Oct 18 2014
63e8352c664blintool
Fixed issues #90, #91, #92 
Oct 18 2014
722b6402b98blintool
Updates. 
Oct 18 2014
a48bb9781feblintool
Merge branch 'master' of github.com:rwolniak/warcbase into warc 
Oct 18 2014
21397e4e4ff3lintool
upgraded to webarchive-commons 1.1.4. 
Oct 18 2014
da890acb0be1rwolniak
Updated Building the URL Mapping Instructions to include instruction to move… 
Oct 13 2014
de4267bf28f1lintool
Updated documentation. 
Oct 13 2014
27b0038b631frwolniak
Testing commit and push + README update 
Oct 8 2014
457a71345d2blintool
Merge branch 'master' into warc 
Sep 15 2014
8fae0c067d68lintool
Fixed issue #87 
Sep 14 2014
05db518ccd83lintool
Added timing info. 
Sep 14 2014
157df31c0f15lintool
Cleanup. 
Sep 14 2014
85c5b4a2ecc5lintool
Added debug output; Fixed deprecated HBase APIs. 
Sep 14 2014
159596e9b378lintool
Fixed build issues in upgrade to CDH 5.1.2. 
Sep 13 2014
4798a4314b6elintool
Figured out how to extract MIME type and date from WARC. 
Aug 30 2014
f3516c7fd7f0lintool
Added test cases to try loading WARC records from a stream; back-ported same… 
Aug 29 2014
28c5c007f4fdlintool
Added simple test case. 
Aug 28 2014
8e67d49d44b3lintool
WARC sample from https://archive.org/details/ExampleArcAndWarcFiles 
Aug 28 2014