You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@asterixdb.apache.org by "Ian Maxon (JIRA)" <ji...@apache.org> on 2017/10/11 18:02:01 UTC

[jira] [Created] (ASTERIXDB-2129) UTF8StringUtil key normalization failure

Ian Maxon created ASTERIXDB-2129:
------------------------------------

             Summary: UTF8StringUtil key normalization failure
                 Key: ASTERIXDB-2129
                 URL: https://issues.apache.org/jira/browse/ASTERIXDB-2129
             Project: Apache AsterixDB
          Issue Type: Bug
          Components: RT - Runtime, TYPE - Data Model
            Reporter: Ian Maxon


This query:

SELECT text,c
FROM(
SELECT h.text AS text, datetime_from_unix_time_in_ms(to_bigint(t.timestamp_ms)) as time
FROM aca_int AS t
UNNEST t.hashtags h
where t.isRelated = 1 and t.`SA-OM` is not missing and t.createdDate is not missing
) AS g
GROUP BY g.text AS text WITH c AS count(g.time)
ORDER BY c DESC;

Where the un-nested hashtag field text is in a closed schema, causes this failure:

{quote}{{Oct 11, 2017 7:10:05 AM org.apache.hyracks.control.cc.dataset.DatasetDirectoryService reportJobFailure
INFO: job JID:4 failed and is being reported to DatasetDirectoryService
org.apache.hyracks.api.exceptions.HyracksDataException: java.lang.IllegalArgumentException
↪   at org.apache.hyracks.api.exceptions.HyracksDataException.create(HyracksDataException.java:134)
↪   at org.apache.hyracks.control.common.utils.ExceptionUtils.setNodeIds(ExceptionUtils.java:63)
↪   at org.apache.hyracks.control.nc.Task.run(Task.java:362)
↪   at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
↪   at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
↪   at java.lang.Thread.run(Thread.java:748)
Caused by: java.lang.IllegalArgumentException
↪   at org.apache.hyracks.util.string.UTF8StringUtil.charAt(UTF8StringUtil.java:60)
↪   at org.apache.hyracks.util.string.UTF8StringUtil.normalize(UTF8StringUtil.java:228)
↪   at org.apache.hyracks.dataflow.common.data.normalizers.UTF8StringNormalizedKeyComputerFactory$1.normalize(UTF8StringNormalizedKeyComputerFactory.java:33)
↪   at org.apache.asterix.dataflow.data.nontagged.keynormalizers.AWrappedAscNormalizedKeyComputerFactory$1.normalize(AWrappedAscNormalizedKeyComputerFactory.java:46)
↪   at org.apache.hyracks.dataflow.std.sort.AbstractFrameSorter.sort(AbstractFrameSorter.java:139)
↪   at org.apache.hyracks.dataflow.std.sort.AbstractSortRunGenerator.flushFramesToRun(AbstractSortRunGenerator.java:60)
↪   at org.apache.hyracks.dataflow.std.sort.AbstractSortRunGenerator.close(AbstractSortRunGenerator.java:50)
↪   at org.apache.hyracks.dataflow.std.sort.AbstractSorterOperatorDescriptor$SortActivity$1.close(AbstractSorterOperatorDescriptor.java:132)
↪   at org.apache.hyracks.algebricks.runtime.operators.base.AbstractOneInputOneOutputOneFramePushRuntime.close(AbstractOneInputOneOutputOneFramePushRuntime.java:60)
↪   at org.apache.hyracks.algebricks.runtime.operators.base.AbstractOneInputOneOutputOneFramePushRuntime.close(AbstractOneInputOneOutputOneFramePushRuntime.java:60)
↪   at org.apache.hyracks.algebricks.runtime.operators.std.AssignRuntimeFactory$1.close(AssignRuntimeFactory.java:119)
↪   at org.apache.hyracks.algebricks.runtime.operators.base.AbstractOneInputOneOutputOneFramePushRuntime.close(AbstractOneInputOneOutputOneFramePushRuntime.java:60)
↪   at org.apache.hyracks.algebricks.runtime.operators.base.AbstractOneInputOneOutputOneFramePushRuntime.close(AbstractOneInputOneOutputOneFramePushRuntime.java:60)
↪   at org.apache.hyracks.algebricks.runtime.operators.std.StreamSelectRuntimeFactory$1.close(StreamSelectRuntimeFactory.java:112)
↪   at org.apache.hyracks.algebricks.runtime.operators.base.AbstractOneInputOneOutputOneFramePushRuntime.close(AbstractOneInputOneOutputOneFramePushRuntime.java:60)
↪   at org.apache.hyracks.algebricks.runtime.operators.std.AssignRuntimeFactory$1.close(AssignRuntimeFactory.java:119)
↪   at org.apache.hyracks.algebricks.runtime.operators.base.AbstractOneInputOneOutputOneFramePushRuntime.close(AbstractOneInputOneOutputOneFramePushRuntime.java:60)
↪   at org.apache.hyracks.algebricks.runtime.operators.meta.AlgebricksMetaOperatorDescriptor$2.close(AlgebricksMetaOperatorDescriptor.java:140)
↪   at org.apache.hyracks.storage.am.common.dataflow.IndexSearchOperatorNodePushable.close(IndexSearchOperatorNodePushable.java:243)
↪   at org.apache.hyracks.algebricks.runtime.operators.std.EmptyTupleSourceRuntimeFactory$1.close(EmptyTupleSourceRuntimeFactory.java:65)
↪   at org.apache.hyracks.algebricks.runtime.operators.meta.AlgebricksMetaOperatorDescriptor$1.initialize(AlgebricksMetaOperatorDescriptor.java:104)
↪   at org.apache.hyracks.api.rewriter.runtime.SuperActivityOperatorNodePushable.lambda$runInParallel$1(SuperActivityOperatorNodePushable.java:204)
↪   at java.util.concurrent.FutureTask.run(FutureTask.java:266)
↪   ... 3 more
}}{quote}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)