You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Jorge (Jira)" <ji...@apache.org> on 2020/08/23 15:12:00 UTC
[jira] [Created] (ARROW-9836) [Rust] [DataFusion] Improve API for
usage of UDFs
Jorge created ARROW-9836:
----------------------------
Summary: [Rust] [DataFusion] Improve API for usage of UDFs
Key: ARROW-9836
URL: https://issues.apache.org/jira/browse/ARROW-9836
Project: Apache Arrow
Issue Type: Improvement
Components: Rust, Rust - DataFusion
Reporter: Jorge
TL;DR; currently, users call UDFs through
{color:#000000}df.select(scalar_functions(“sqrt”, vec![col(“a”)], DataType::Float64)){color}
Proposal:
{color:#000000}let udf = df.registry()?;{color}
{color:#000000}df.select(udf(“sqrt”, vec![col(“a”)])?){color}
so that they do not have to remember the UDFs return type when using it.
This API will in the future allow to declare the UDF as part of the planning, like spark, instead of having to register it in the registry before using it (we just need to check if the UDF is registered or not before doing so).
See complete proposal here: [https://docs.google.com/document/d/1Kzz642ScizeKXmVE1bBlbLvR663BKQaGqVIyy9cAscY/edit?usp=sharing]
--
This message was sent by Atlassian Jira
(v8.3.4#803005)