You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Apache Spark (Jira)" <ji...@apache.org> on 2021/01/24 12:38:00 UTC

[jira] [Assigned] (SPARK-34214) Expose regexp_extract_all to PySpark

     [ https://issues.apache.org/jira/browse/SPARK-34214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Apache Spark reassigned SPARK-34214:
------------------------------------

    Assignee:     (was: Apache Spark)

> Expose regexp_extract_all to PySpark
> ------------------------------------
>
>                 Key: SPARK-34214
>                 URL: https://issues.apache.org/jira/browse/SPARK-34214
>             Project: Spark
>          Issue Type: Task
>          Components: PySpark
>    Affects Versions: 3.1.2
>            Reporter: André Sá de Mello
>            Priority: Major
>
> I've often come across use cases for regexp_extract_all while working with PySpark code, and UDFs implementing this functionality are very poorly performant. Given the size of the PySpark community, it would be very valuable to have it exposed in the PySpark API.
> All the rationale for why regexp_extract_all is useful is expressed SPARK-24884.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org