You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by Russell Jurney <ru...@gmail.com> on 2013/02/19 00:16:50 UTC
Review Request: Review for PIG-3190, add LuceneTokenize and SnowballTokenize
-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/9511/
-----------------------------------------------------------
Review request for pig, Alan Gates, Prashant Sharma, Jonathan Coveney, and Gunther Hagleitner.
Description
-------
PIG-3190 adds two 'sane' tokenizers to Pig
This addresses bug PIG-3190.
https://issues.apache.org/jira/browse/PIG-3190
Diffs
-----
ivy.xml 70e8d50
src/org/apache/pig/builtin/LuceneTokenize.java PRE-CREATION
src/org/apache/pig/builtin/SnowballTokenize.java PRE-CREATION
test/org/apache/pig/test/TestLuceneTokenize.java PRE-CREATION
test/org/apache/pig/test/TestSnowballTokenize.java PRE-CREATION
test/org/apache/pig/test/data/ExpectedLuceneTokens.txt PRE-CREATION
test/org/apache/pig/test/data/ExpectedSnowballTokens.txt PRE-CREATION
test/org/apache/pig/test/data/InputFiles/ten_enron_emails.txt PRE-CREATION
Diff: https://reviews.apache.org/r/9511/diff/
Testing
-------
Runs locally for me, two unit tests pass
Thanks,
Russell Jurney