You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Loic Descotte (JIRA)" <ji...@apache.org> on 2017/02/07 15:48:42 UTC

[jira] [Comment Edited] (SPARK-19492) Dataset, filter and pattern matching on elements

    [ https://issues.apache.org/jira/browse/SPARK-19492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15856208#comment-15856208 ] 

Loic Descotte edited comment on SPARK-19492 at 2/7/17 3:48 PM:
---------------------------------------------------------------

[~srowen] It's normal, in your example the compiler can't find the type by itself. But when the structure is typed, like a Seq[T], it should work. As it's working for RDD and all scala collections, I don't think there is a special magic in Seq. 
More explanations here : http://stackoverflow.com/a/12869583/591922

Edit : Just figured out that it actually works with map, but not filter


was (Author: loicd):
[~srowen] It's normal, in your example the compiler can't find the type by itself. But when the structure is typed, like a Seq[T], it should work. As it's working for RDD and all scala collections, I don't think there is a special magic in Seq. 
More explanations here : http://stackoverflow.com/a/12869583/591922

> Dataset, filter and pattern matching on elements
> ------------------------------------------------
>
>                 Key: SPARK-19492
>                 URL: https://issues.apache.org/jira/browse/SPARK-19492
>             Project: Spark
>          Issue Type: Bug
>          Components: SQL
>    Affects Versions: 2.0.2, 2.1.0
>            Reporter: Loic Descotte
>            Priority: Minor
>
> It seems it is impossible to use pattern matching to define input parameters for functions like filter, map, etc. on datasets.
> Example :
> This one is working :
> {code}
> val departments = Seq(
>     Department(1, "hr"),
>     Department(2, "it")
> ).toDS
> departments.filter{ d=> 
>   d.name == "hr"
> }
> {code}
> but not this one :
> {code}
>  departments.filter{ case Department(_, name)=>
>   name == "hr"
> }
> {code}
> Error :
> {code}
> error: missing parameter type for expanded function
> The argument types of an anonymous function must be fully known. (SLS 8.5)
> Expected type was: ?
>     departments.filter{ case Department(_, name)=>
> {code}
> This kind of pattern matching should work (as departements dataset type is known) like Scala collections filter function, or RDD filter function for example.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org