You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Greg Rahn (JIRA)" <ji...@apache.org> on 2018/10/25 20:42:00 UTC

[jira] [Created] (IMPALA-7759) Add Levenshtein edit distance function

Greg Rahn created IMPALA-7759:
---------------------------------

             Summary: Add Levenshtein edit distance function
                 Key: IMPALA-7759
                 URL: https://issues.apache.org/jira/browse/IMPALA-7759
             Project: IMPALA
          Issue Type: New Feature
            Reporter: Greg Rahn
            Assignee: Greg Rahn


References:
 * [Netezza - (le_dst())|https://www.ibm.com/support/knowledgecenter/en/SSULQD_7.2.1/com.ibm.nz.dbu.doc/r_dbuser_functions_expressions_fuzzy_funcs.html]
 * [Postgres - levenshtein()|https://www.postgresql.org/docs/current/static/fuzzystrmatch.html#id-1.11.7.24.6]

One notable difference:
* Netezza: if either value is NULL, returns the length of non-NULL value
* Postgres: if either value is NULL, returns NULL 

Preference is to implement Postgres version due to ease of cross-system testing.




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org