Merging with master.
Description
Description
Details
Details
- Committed
Milad Gholami <milad621@gmail.com> Jun 26 2014, 16:16 - Pushed
dportabella Oct 19 2016, 16:29 - Parents
- R1473:113332758f06: Fixed issues #45, #46, #49, #50
R1473:149ce6cf969f: Fixing git history. - Branches
- Unknown
- Tags
Merged Changes
Merged Changes
Commit | Author | Details | Committed | |||
---|---|---|---|---|---|---|
113332758f06 | lintool | Fixed issues #45, #46, #49, #50 | Jun 18 2014 | |||
3b8484a944a8 | lintool | More work on the admin interface. | Jun 18 2014 | |||
f3015cd7ba4b | lintool | issue #50 | Jun 18 2014 | |||
10d60bd28c75 | lintool | Fixed broken merge. | Jun 17 2014 | |||
6c452cbb6b5d | lintool | Merge branch 'master' into admin | Jun 17 2014 | |||
71d5a4e0803e | lintool | fixed issue #43 and issue #48 | Jun 17 2014 | |||
781fe4247b31 | lintool | Fixed issue #48 | Jun 17 2014 | |||
5f62c2f6fb9d | lintool | Added comment. | Jun 17 2014 | |||
d17dddca8deb | lintool | Minor refactoring. | Jun 17 2014 | |||
ee7f7749a30a | lintool | Initial working version of anchor text inversion program: issue #43 | Jun 17 2014 | |||
02c26d6d8ea4 | lintool | Started working on issue #46 cleanup of org.warcbase.data.Util | Jun 17 2014 | |||
c7a7247d5fa2 | lintool | Appears to have fixed issue #49, starting work on admin tool, issue #45. | Jun 17 2014 | |||
59c50bb33254 | lintool | Fixed issue #42 and issue #44 | Jun 16 2014 | |||
a6be4375e0c7 | lintool | ExtractLinks using HBase appears to be working. | Jun 13 2014 | |||
21a07efb350e | lintool | Refactored HDFS extractor; HBase extractor still broken. | Jun 13 2014 | |||
bbc73ab64808 | lintool | Merge branch 'master' into refactoring | Jun 12 2014 | |||
6d50f37d6ada | lintool | Merge branch 'hbase_experiments' | Jun 12 2014 | |||
273e5969e943 | lintool | Light refactoring, pushed column family filter into scan. | Jun 12 2014 | |||
3283bb8512ef | lintool | Alternative implementation based on iterating over maps... slightly slower. | Jun 12 2014 | |||
dbfbcb0b3c7e | lintool | More light refactoring. | Jun 12 2014 | |||
34273fdb935a | Jeffyrao | add hbase option for ExtractLinks | Jun 12 2014 | |||
cfb89d2d3379 | Jeffyrao | reformat Jinfeng's code | Jun 12 2014 | |||
b02f33c9b43a | lintool | Refactoring, code cleanup. | Jun 11 2014 | |||
d4c29085ee12 | lintool | Fixed issue #39 | Jun 11 2014 | |||
c3a4348e1250 | lintool | Debugged HBase scan parameters so that they don't knock over region servers… | Jun 11 2014 | |||
5425313990da | lintool | Fixed Issues #31, #32, #40, #41 | Jun 5 2014 | |||
20ef503dbfe5 | lintool | Moving ExtractLinks and ExtractSiteLinks into analysis.graph package, per Issue… | Jun 5 2014 | |||
dc58365a9579 | lintool | Extracts values at different timestamps. | Jun 5 2014 | |||
60b827d1c174 | lintool | Refactored getIdRange method signature, add more test cases to UriMapping. | Jun 5 2014 | |||
a6b17787ccd1 | lintool | Merge branch 'extract-links' of github.com:Jeffyrao/warcbase into refactoring | Jun 5 2014 | |||
7fc1b92d60a7 | Jeffyrao | fix issue 40 that UriMapping prefix search should return empty result when no… | Jun 5 2014 | |||
a8953616248c | lintool | Refactoring, added test case (currently broken). | Jun 4 2014 | |||
6518836b7ea0 | Jeffyrao | fix issue 32, update ExtractSiteLinks code | Jun 3 2014 | |||
a17e3c74954a | Jeffyrao | fix issue 30, add ExtractSiteLinks code | May 29 2014 | |||
3f36eff4f429 | Jeffyrao | fix issue 31 | May 27 2014 | |||
b4ab2ea0d499 | lintool | Initial MapReduce over HBase demo. | May 26 2014 | |||
0598dcd91073 | Jeffyrao | more edits | May 25 2014 | |||
cad5eddceb19 | Jeffyrao | remove javacsv dependency, add opencsv dependency | May 25 2014 | |||
f0e36b46b872 | Jinfeng Rao | Merge remote-tracking branch 'upstream/master' into extract-links | May 25 2014 | |||
7b763c0aa80a | lintool | Successfully re-ingested c108 collection (both parts) Fixed issue #33 | May 24 2014 | |||
7c21e2df9267 | lintool | Fixed robustness and OOM issues when ingesting corrupt ARC files. | May 23 2014 | |||
a19dcc54adda | lintool | Fixed issues #15 | May 22 2014 | |||
fcc4acf98a6f | lintool | Simple program to find URL patterns in archives. | May 22 2014 | |||
24d7d8e08807 | Jeffyrao | modified ExtractSiteLinks.java, changed to read csv format of prefix input | Apr 20 2014 |