You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Kouhei Sutou (Jira)" <ji...@apache.org> on 2022/10/20 05:00:00 UTC
[jira] [Updated] (ARROW-14314) [C++] Sorting dictionary array not implemented
[ https://issues.apache.org/jira/browse/ARROW-14314?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Kouhei Sutou updated ARROW-14314:
---------------------------------
Fix Version/s: 11.0.0
(was: 10.0.0)
> [C++] Sorting dictionary array not implemented
> ----------------------------------------------
>
> Key: ARROW-14314
> URL: https://issues.apache.org/jira/browse/ARROW-14314
> Project: Apache Arrow
> Issue Type: Improvement
> Components: C++
> Reporter: Neal Richardson
> Assignee: Ariana Villegas
> Priority: Major
> Labels: kernel, pull-request-available
> Fix For: 11.0.0
>
> Time Spent: 6h 10m
> Remaining Estimate: 0h
>
> From R, taking the stock {{mtcars}} dataset and giving it a dictionary type column:
> {code}
> mtcars %>%
> mutate(cyl = as.factor(cyl)) %>%
> Table$create() %>%
> arrange(cyl) %>%
> collect()
> Error: Type error: Sorting not supported for type dictionary<values=string, indices=int8, ordered=0>
> ../src/arrow/compute/kernels/vector_array_sort.cc:427 VisitTypeInline(type, this)
> ../src/arrow/compute/kernels/vector_sort.cc:148 GetArraySorter(*physical_type_)
> ../src/arrow/compute/kernels/vector_sort.cc:1206 sorter.Sort()
> ../src/arrow/compute/api_vector.cc:259 CallFunction("sort_indices", {datum}, &options, ctx)
> ../src/arrow/compute/exec/order_by_impl.cc:53 SortIndices(table, options_, ctx_)
> ../src/arrow/compute/exec/sink_node.cc:292 impl_->DoFinish()
> ../src/arrow/compute/exec/exec_plan.cc:297 iterator_.Next()
> ../src/arrow/record_batch.cc:318 ReadNext(&batch)
> ../src/arrow/record_batch.cc:329 ReadAll(&batches)
> {code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)