You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@spark.apache.org by we...@apache.org on 2023/02/15 13:00:40 UTC
[spark] branch master updated: [SPARK-42405][SQL] Improve array insert documentation

This is an automated email from the ASF dual-hosted git repository.

wenchen pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git


The following commit(s) were added to refs/heads/master by this push:
     new a14c6bb2710 [SPARK-42405][SQL] Improve array insert documentation
a14c6bb2710 is described below

commit a14c6bb2710cb7d43538e9754ca536f0269eb3c4
Author: Daniel Davies <dd...@palantir.com>
AuthorDate: Wed Feb 15 21:00:24 2023 +0800

    [SPARK-42405][SQL] Improve array insert documentation
    
    ### What changes were proposed in this pull request?
    
    Part of cleanup from existing PR https://github.com/apache/spark/pull/38867 - documentation on the scala class ArrayInsert should match the python array_insert function. See comment here: https://github.com/apache/spark/pull/38867#discussion_r1097054656.
    
    ### Why are the changes needed?
    
    See https://github.com/apache/spark/pull/38867#discussion_r1097054656.
    
    ### Does this PR introduce _any_ user-facing change?
    Yes- better documentation of the array_insert function
    
    ### How was this patch tested?
    Not applicable/ standard unit testing.
    
    Closes #39975 from Daniel-Davies/ddavies/SPARK-42405.
    
    Authored-by: Daniel Davies <dd...@palantir.com>
    Signed-off-by: Wenchen Fan <we...@databricks.com>
---
 python/pyspark/sql/functions.py                                    | 6 +++---
 .../spark/sql/catalyst/expressions/collectionOperations.scala      | 7 ++++++-
 2 files changed, 9 insertions(+), 4 deletions(-)

diff --git a/python/pyspark/sql/functions.py b/python/pyspark/sql/functions.py
index ac842101b28..b103af72e36 100644
--- a/python/pyspark/sql/functions.py
+++ b/python/pyspark/sql/functions.py
@@ -7680,9 +7680,9 @@ def array_distinct(col: "ColumnOrName") -> Column:
 def array_insert(arr: "ColumnOrName", pos: Union["ColumnOrName", int], value: Any) -> Column:
     """
     Collection function: adds an item into a given array at a specified array index.
-    Array indices start at 1 (or from the end if the index is negative).
-    Index specified beyond the size of the current array (plus additional element)
-    is extended with 'null' elements.
+    Array indices start at 1, or start from the end if index is negative.
+    Index above array size appends the array, or prepends the array if index is negative,
+    with 'null' elements.
 
     .. versionadded:: 3.4.0
 
diff --git a/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/collectionOperations.scala b/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/collectionOperations.scala
index 28c4a9eba68..289859d420b 100644
--- a/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/collectionOperations.scala
+++ b/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/collectionOperations.scala
@@ -4603,7 +4603,12 @@ case class ArrayExcept(left: Expression, right: Expression) extends ArrayBinaryL
 
 // scalastyle:off line.size.limit
 @ExpressionDescription(
-  usage = "_FUNC_(x, pos, val) - Places val into index pos of array x (array indices start at 1, or start from the end if start is negative).\",",
+  usage = """
+    _FUNC_(x, pos, val) - Places val into index pos of array x.
+      Array indices start at 1, or start from the end if index is negative.
+      Index above array size appends the array, or prepends the array if index is negative,
+      with 'null' elements.
+  """,
   examples = """
     Examples:
       > SELECT _FUNC_(array(1, 2, 3, 4), 5, 5);


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@spark.apache.org
For additional commands, e-mail: commits-help@spark.apache.org