You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@asterixdb.apache.org by "Chen Luo (JIRA)" <ji...@apache.org> on 2017/10/05 18:45:02 UTC

[jira] [Created] (ASTERIXDB-2125) NotImplementedException when Bulk Load LSMRTree

Chen Luo created ASTERIXDB-2125:
-----------------------------------

             Summary: NotImplementedException when Bulk Load LSMRTree
                 Key: ASTERIXDB-2125
                 URL: https://issues.apache.org/jira/browse/ASTERIXDB-2125
             Project: Apache AsterixDB
          Issue Type: Bug
          Components: IDX - Indexes
            Reporter: Chen Luo


When using the twitter dataset (same as the one used by Cloudberry), creating a new LSM RTree index throws NotImplementedException (both for datasets using prefix or correlated merge policy).

Stack trace:
{code}
 org.apache.hyracks.algebricks.common.exceptions.NotImplementedException: Value provider for type missing is not implemented
org.apache.hyracks.api.exceptions.HyracksDataException: org.apache.hyracks.algebricks.common.exceptions.NotImplementedException: Value provider for type missing is not implemented
	at org.apache.hyracks.api.exceptions.HyracksDataException.create(HyracksDataException.java:134)
	at org.apache.hyracks.control.common.utils.ExceptionUtils.setNodeIds(ExceptionUtils.java:63)
	at org.apache.hyracks.control.nc.Task.run(Task.java:367)
	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
	at java.lang.Thread.run(Thread.java:745)
Caused by: org.apache.hyracks.algebricks.common.exceptions.NotImplementedException: Value provider for type missing is not implemented
	at org.apache.asterix.dataflow.data.nontagged.valueproviders.PrimitiveValueProviderFactory$1.getValue(PrimitiveValueProviderFactory.java:60)
	at org.apache.hyracks.storage.am.rtree.frames.RTreeNSMFrame.calculateMBRImpl(RTreeNSMFrame.java:131)
	at org.apache.hyracks.storage.am.rtree.frames.RTreeNSMFrame.adjustMBR(RTreeNSMFrame.java:152)
	at org.apache.hyracks.storage.am.rtree.impls.RTree$RTreeBulkLoader.propagateBulk(RTree.java:1047)
	at org.apache.hyracks.storage.am.rtree.impls.RTree$RTreeBulkLoader.add(RTree.java:948)
	at org.apache.hyracks.storage.am.lsm.common.impls.AbstractLSMDiskComponentBulkLoader.add(AbstractLSMDiskComponentBulkLoader.java:91)
	at org.apache.hyracks.storage.am.lsm.rtree.impls.LSMRTreeWithAntiMatterTuples$LSMRTreeWithAntiMatterTuplesBulkLoader.add(LSMRTreeWithAntiMatterTuples.java:292)
	at org.apache.hyracks.storage.am.common.dataflow.IndexBulkLoadOperatorNodePushable.nextFrame(IndexBulkLoadOperatorNodePushable.java:81)
	at org.apache.hyracks.api.dataflow.EnforceFrameWriter.nextFrame(EnforceFrameWriter.java:76)
	at org.apache.hyracks.dataflow.common.comm.io.AbstractFrameAppender.write(AbstractFrameAppender.java:93)
	at org.apache.hyracks.dataflow.common.comm.util.FrameUtils.appendToWriter(FrameUtils.java:121)
	at org.apache.hyracks.dataflow.std.sort.AbstractFrameSorter.flush(AbstractFrameSorter.java:172)
	at org.apache.hyracks.dataflow.std.sort.AbstractExternalSortRunMerger.process(AbstractExternalSortRunMerger.java:90)
	at org.apache.hyracks.dataflow.std.sort.AbstractSorterOperatorDescriptor$MergeActivity$1.initialize(AbstractSorterOperatorDescriptor.java:181)
	at org.apache.hyracks.api.rewriter.runtime.SuperActivityOperatorNodePushable.lambda$runInParallel$0(SuperActivityOperatorNodePushable.java:204)
	at java.util.concurrent.FutureTask.run(FutureTask.java:266)
	... 3 more
{code}

This should be a bug within RTree, because when we perform bulk load, we first filter out all entries with missing or null secondary keys.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)