Prototype integration of Wayback/Warcbase via REST API on HBase.
Issues #53 and #54.
Merge branch 'wayback-integration'
lintool <jimmylin@umd.edu> | Aug 12 2014, 02:42 |
dportabella | Oct 19 2016, 16:29 |
Commit | Author | Details | Committed | |||
---|---|---|---|---|---|---|
eb4893e7201c | lintool | Merge branch 'cleanup' into wayback-integration | Aug 12 2014 | |||
c665a23bf1f3 | lintool | Fixes issue #58: Wayback reads directly from REST API instead of writing and… | Aug 12 2014 | |||
ffec5b2caa3d | lintool | Fixed issue #59: Unable to fetch URLs from archive with '?' in them | Aug 12 2014 | |||
0f4f541c24d7 | lintool | Fixed issues with fetching URLs with spaces in them. | Aug 12 2014 | |||
b5ccb86d330e | lintool | Better handling of errors: when REST API is unavailable, when URL isn't found… | Aug 11 2014 | |||
515bab098cff | lintool | Code cleanup for browser code; removed unneeded files and associated web files. | Aug 11 2014 | |||
74f3dece6c62 | lintool | Merge branch 'rest-api-bug-fix' of github.com:lintool/warcbase into wayback… | Aug 11 2014 | |||
3075ca19d76d | lintool | Converted host/port/table information to bean settings. | Aug 11 2014 | |||
37e97073d57c | lintool | Simplified code. | Aug 11 2014 | |||
bca5b6944ac7 | lintool | Refactoring; mostly reformatting. | Aug 11 2014 | |||
9e8765b4e3f0 | lintool | Minor fix for NPE when capture isn't in HBase. | Aug 11 2014 | |||
4f387e8952a7 | lintool | Initial check-in of Warcbase integration points with Open Wayback. | Aug 11 2014 | |||
03d5c5365481 | lintool | /*/ query returns MIME type. | Aug 11 2014 | |||
65b6dc5b7138 | lintool | Refactor to confirm to /*/ of Wayback to fetch list of available versions. | Aug 10 2014 | |||
096878f5f932 | lintool | Switched over to 14 digit dates for URLs to align with Wayback. Further… | Aug 10 2014 | |||
7f9764b1a793 | lintool | Cleaned up servlet fetch code. | Aug 10 2014 | |||
b685d65eba7b | lintool | Fixed 14 digit date parsing issue (now uses ArchiveUtils); was an issue with… | Aug 10 2014 | |||
bc6aab1ffd21 | lintool | Fixed a few minor ingestion issues. | Aug 10 2014 | |||
b20ef84e5df9 | lintool | Refactoring; removing WARC ingestion for now. | Aug 10 2014 | |||
cfe508a831f0 | lintool | Tweaks to ingest code. | Aug 10 2014 | |||
180b57fa5dc1 | lintool | Janky, but seems to work: ingesting and serving up raw ARC records. | Aug 10 2014 | |||
8b5a6be9db61 | lintool | Quick and dirty switch over to webarchive-commons API; stores raw ARC records. | Aug 10 2014 |