Homec4science

Merging with master.

Authored by Milad Gholami <milad621@gmail.com> on Jun 26 2014, 16:16.

Description

Merging with master.

Details

Committed
Milad Gholami <milad621@gmail.com>Jun 26 2014, 16:16
Pushed
dportabellaOct 19 2016, 16:29
Parents
R1473:113332758f06: Fixed issues #45, #46, #49, #50
R1473:149ce6cf969f: Fixing git history.
Branches
Unknown
Tags
Unknown

Event Timeline

Milad Gholami <milad621@gmail.com> committed R1473:60a9b96beebe: Merging with master. (authored by Milad Gholami <milad621@gmail.com>).Jun 26 2014, 16:16

Merged Changes

CommitAuthorDetailsCommitted
113332758f06lintool
Fixed issues #45, #46, #49, #50 
Jun 18 2014
3b8484a944a8lintool
More work on the admin interface. 
Jun 18 2014
f3015cd7ba4blintool
issue #50 
Jun 18 2014
10d60bd28c75lintool
Fixed broken merge. 
Jun 17 2014
6c452cbb6b5dlintool
Merge branch 'master' into admin 
Jun 17 2014
71d5a4e0803elintool
fixed issue #43 and issue #48 
Jun 17 2014
781fe4247b31lintool
Fixed issue #48 
Jun 17 2014
5f62c2f6fb9dlintool
Added comment. 
Jun 17 2014
d17dddca8deblintool
Minor refactoring. 
Jun 17 2014
ee7f7749a30alintool
Initial working version of anchor text inversion program: issue #43 
Jun 17 2014
02c26d6d8ea4lintool
Started working on issue #46 cleanup of org.warcbase.data.Util 
Jun 17 2014
c7a7247d5fa2lintool
Appears to have fixed issue #49, starting work on admin tool, issue #45. 
Jun 17 2014
59c50bb33254lintool
Fixed issue #42 and issue #44 
Jun 16 2014
a6be4375e0c7lintool
ExtractLinks using HBase appears to be working. 
Jun 13 2014
21a07efb350elintool
Refactored HDFS extractor; HBase extractor still broken. 
Jun 13 2014
bbc73ab64808lintool
Merge branch 'master' into refactoring 
Jun 12 2014
6d50f37d6adalintool
Merge branch 'hbase_experiments' 
Jun 12 2014
273e5969e943lintool
Light refactoring, pushed column family filter into scan. 
Jun 12 2014
3283bb8512eflintool
Alternative implementation based on iterating over maps... slightly slower. 
Jun 12 2014
dbfbcb0b3c7elintool
More light refactoring. 
Jun 12 2014
34273fdb935aJeffyrao
add hbase option for ExtractLinks 
Jun 12 2014
cfb89d2d3379Jeffyrao
reformat Jinfeng's code 
Jun 12 2014
b02f33c9b43alintool
Refactoring, code cleanup. 
Jun 11 2014
d4c29085ee12lintool
Fixed issue #39 
Jun 11 2014
c3a4348e1250lintool
Debugged HBase scan parameters so that they don't knock over region servers… 
Jun 11 2014
5425313990dalintool
Fixed Issues #31, #32, #40, #41 
Jun 5 2014
20ef503dbfe5lintool
Moving ExtractLinks and ExtractSiteLinks into analysis.graph package, per Issue… 
Jun 5 2014
dc58365a9579lintool
Extracts values at different timestamps. 
Jun 5 2014
60b827d1c174lintool
Refactored getIdRange method signature, add more test cases to UriMapping. 
Jun 5 2014
a6b17787ccd1lintool
Merge branch 'extract-links' of github.com:Jeffyrao/warcbase into refactoring 
Jun 5 2014
7fc1b92d60a7Jeffyrao
fix issue 40 that UriMapping prefix search should return empty result when no… 
Jun 5 2014
a8953616248clintool
Refactoring, added test case (currently broken). 
Jun 4 2014
6518836b7ea0Jeffyrao
fix issue 32, update ExtractSiteLinks code 
Jun 3 2014
a17e3c74954aJeffyrao
fix issue 30, add ExtractSiteLinks code 
May 29 2014
3f36eff4f429Jeffyrao
fix issue 31 
May 27 2014
b4ab2ea0d499lintool
Initial MapReduce over HBase demo. 
May 26 2014
0598dcd91073Jeffyrao
more edits 
May 25 2014
cad5eddceb19Jeffyrao
remove javacsv dependency, add opencsv dependency 
May 25 2014
f0e36b46b872Jinfeng Rao
Merge remote-tracking branch 'upstream/master' into extract-links 
May 25 2014
7b763c0aa80alintool
Successfully re-ingested c108 collection (both parts) Fixed issue #33 
May 24 2014
7c21e2df9267lintool
Fixed robustness and OOM issues when ingesting corrupt ARC files. 
May 23 2014
a19dcc54addalintool
Fixed issues #15 
May 22 2014
fcc4acf98a6flintool
Simple program to find URL patterns in archives. 
May 22 2014
24d7d8e08807Jeffyrao
modified ExtractSiteLinks.java, changed to read csv format of prefix input 
Apr 20 2014