You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Feynman Liang (JIRA)" <ji...@apache.org> on 2015/06/18 23:11:01 UTC

[jira] [Created] (SPARK-8457) Documentation for N-Gram feature transformer

Feynman Liang created SPARK-8457:
------------------------------------

             Summary: Documentation for N-Gram feature transformer
                 Key: SPARK-8457
                 URL: https://issues.apache.org/jira/browse/SPARK-8457
             Project: Spark
          Issue Type: Documentation
          Components: ML
            Reporter: Feynman Liang
            Priority: Trivial


Documentation for using n-gram feature transformer needs to be written. In particular, we need to describe the input (ArrayType(StringType)) and output (ArrayType(ArrayType(StringType)) formats, nullability constraints (input is nullable, output is not) and behaviors (null values are ignored in generating n-grams), as well as edge cases (e.g. n-gram length > input sequence length will  yield a single output n-gram equal to the input sequence).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org