History Graph
History Graph
Commit | Author | Details | Committed | |||
---|---|---|---|---|---|---|
1ee0455efdf6 | Alice-Z | remove extract methods | Nov 11 2015 | |||
2c2607867ef1 | Alice-Z | Add keepValidPages transformation and layer for counting in Spark | Nov 11 2015 | |||
e1be481cd782 | Alice-Z | Clean up extracting code, use pattern matching | Nov 10 2015 | |||
3957b2cce7fb | Alice-Z | Clean up enum to function mapping | Nov 9 2015 | |||
9140519a9d50 | Alice-Z | Fix imports and enable tests on JUnitRunner | Nov 9 2015 | |||
3eb11b04e499 | Alice-Z | Commit clean-up | Nov 8 2015 | |||
8057c46945d0 | Alice-Z | Add Spark support | Nov 3 2015 | |||
35e39da6e72d | Alice-Z | add extractCrawldateDomainUrlBody | Oct 23 2015 | |||
e5821091ecca | Alice-Z | update ArcRecords interface | Oct 21 2015 | |||
cfe76f600c7d | Alice-Z | extend RDD | Oct 15 2015 | |||
e77201892789 | Alice-Z | first commit | Oct 15 2015 | |||
7c1ebde17fb4 | lintool | Better integration of ARC readers in pyspark. | Jun 4 2015 | |||
b220b0cc175a | lintool | Converter to extract all metadata. | May 27 2015 | |||
649f1674352b | lintool | Initial experiments with Python converters to use Hadoop InputFormat from… | May 27 2015 |
c4science · Help