You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@impala.apache.org by st...@apache.org on 2024/01/31 00:23:04 UTC

(impala) 03/04: IMPALA-12756: [DOCS] Unicode column name support documentation

This is an automated email from the ASF dual-hosted git repository.

stigahuang pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/impala.git

commit ab445195b0d4eb78eda16c7501f49e1fa554530e
Author: pranavyl <pr...@cloudera.com>
AuthorDate: Thu Dec 14 12:46:16 2023 -0800

    IMPALA-12756: [DOCS] Unicode column name support documentation
    
    The patch focuses on documenting that Impala supports unicode
    column names, consistent with Hive's current support (as we use
    Hive MetaStore to store table metadata).
    
    Change-Id: I3d43d942a3ea069020f06adab6ea77e62ad5ffbe
    Reviewed-on: http://gerrit.cloudera.org:8080/20950
    Reviewed-by: Impala Public Jenkins <im...@cloudera.com>
    Tested-by: Impala Public Jenkins <im...@cloudera.com>
---
 docs/topics/impala_identifiers.xml | 10 ++++++----
 1 file changed, 6 insertions(+), 4 deletions(-)

diff --git a/docs/topics/impala_identifiers.xml b/docs/topics/impala_identifiers.xml
index 5f407fa69..b8760e374 100644
--- a/docs/topics/impala_identifiers.xml
+++ b/docs/topics/impala_identifiers.xml
@@ -51,20 +51,22 @@ under the License.
 
       <li>
         <p>
-        The maximum length of an identifier is currently 128 characters, enforced by the metastore database.
+        The maximum length of an identifier is currently 128 characters except for column names which
+        can contain 767 characters, enforced by the metastore database.
         </p>
       </li>
 
       <li>
         <p>
-        An identifier must start with an alphanumeric or underscore character. Quoting the identifier with
-        backticks has no effect on the allowed characters in the name.
+        An identifier must start with an alphanumeric or underscore character except for column names which
+        can start with any unicode characters. Quoting the identifier with backticks has no effect on the allowed
+        characters in the name.
         </p>
       </li>
 
       <li>
         <p>
-        An identifier can contain only ASCII characters.
+        An identifier can contain only ASCII characters except for column names which can contain unicode characters.
         </p>
       </li>