Homec4science

Port named entities extractor Pig script over to Spark as per issue #158

Authored by Alice-Z <alice.zhou@gmail.com> on Nov 23 2015, 02:05.

Description

Port named entities extractor Pig script over to Spark as per issue #158

Event Timeline

Alice-Z <alice.zhou@gmail.com> committed R1473:4a3d31b8babf: Port named entities extractor Pig script over to Spark as per issue #158 (authored by Alice-Z <alice.zhou@gmail.com>).Nov 23 2015, 02:05

Merged Changes

CommitAuthorDetailsCommitted
f9422c23efaeAlice-Z
ExtractEntities takes a classifier file path 
Nov 23 2015
99583f793bc5Alice-Z
Revert to object version of NER3Classifier 
Nov 23 2015
533b4152d534Alice-Z
Turn test off; classifier is too large to be included 
Nov 21 2015
582b21adefe6Alice-Z
Pass classifier class to ExtractEntities UDF 
Nov 21 2015
86733707e5a1Alice-Z
add test for ner3classifier 
Nov 21 2015
cc274ed73b60Alice-Z
Use Jackson JSON serializers to write to String 
Nov 12 2015
e9a3965e2389Alice-Z
Clean up string formatting 
Nov 12 2015
d56d62cf1415Alice-Z
Merge branch 'ner3classifier' of github.com:lintool/warcbase into ner3classifier 
Nov 12 2015
9f7fa9f26b54Alice-Z
Working extract entities with correct output string 
Nov 12 2015
c9ccf559fafcAlice-Z
WIP: port NER3 Classifier over to Java with example usage 
Nov 12 2015
31269caa44b6Alice-Z
WIP: port NER3 Classifier over to Java with example usage 
Nov 12 2015