You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "ASF subversion and git services (Jira)" <ji...@apache.org> on 2022/09/09 05:22:00 UTC
[jira] [Commented] (IMPALA-11492) ExprTest.Utf8MaskTest fails when en_US.UTF-8 is not present
[ https://issues.apache.org/jira/browse/IMPALA-11492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17602120#comment-17602120 ]
ASF subversion and git services commented on IMPALA-11492:
----------------------------------------------------------
Commit 96fcc7286b59101f4aa2b10f5fbe66fe0c9065cb in impala's branch refs/heads/branch-4.1.1 from Joe McDonnell
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=96fcc7286 ]
IMPALA-11492: Add langpacks-en (centos) and language-pack-en (Ubuntu)
Machines that don't have en_US.UTF-8 installed see
issues when running ExprTest.Utf8MaskTest.
This currently impacts the Docker-based tests.
This installs the appropriate language packs
to have en_US.UTF-8 installed.
Testing:
- Ran docker-based tests and verified that
ExprTest.Utf8MaskTest passes.
Change-Id: I1b8696190e4713bda787e773d48943b5dfc6335e
Reviewed-on: http://gerrit.cloudera.org:8080/18875
Tested-by: Impala Public Jenkins <im...@cloudera.com>
Reviewed-by: Quanlong Huang <hu...@gmail.com>
Reviewed-on: http://gerrit.cloudera.org:8080/18950
Reviewed-by: Joe McDonnell <jo...@cloudera.com>
Tested-by: Quanlong Huang <hu...@gmail.com>
> ExprTest.Utf8MaskTest fails when en_US.UTF-8 is not present
> -----------------------------------------------------------
>
> Key: IMPALA-11492
> URL: https://issues.apache.org/jira/browse/IMPALA-11492
> Project: IMPALA
> Issue Type: Bug
> Components: Backend
> Affects Versions: Impala 4.2.0
> Reporter: Joe McDonnell
> Assignee: Joe McDonnell
> Priority: Critical
> Fix For: Impala 4.2.0
>
>
> In the docker-based tests on Redhat 8 / Ubuntu 20, the ExprTest.Utf8MaskTest fails:
> {noformat}
> /home/impdev/Impala/be/src/exprs/expr-test.cc:369
> Value of: GetValue(expr, ColumnType(TYPE_STRING))
> Actual: "xxxx \xC3\xA1\xC3\xA4\xC3\xA8\xC3\xBC XXXX \xC3\x81\xC3\x84\xC3\x88\xC3\x9C"
> Expected: expected_result
> Which is: "xxxx xxxx XXXX XXXX"
> mask('abcd ABCD '){noformat}
> These come with the C.UTF-8 locale installed. This error goes away if I change bin/bootstrap_system.sh to install langpacks-us (Centos) or language-pack-en (Ubuntu), which installs the en_US.UTF-8 locale.
> This might be related to this code: [https://github.com/apache/impala/blob/master/be/src/exprs/mask-functions-ir.cc#L150]
> Installing the language packs is easy, but I'm not sure if users would have those installed.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org