You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Neal Richardson (Jira)" <ji...@apache.org> on 2021/08/24 17:18:00 UTC

[jira] [Created] (ARROW-13737) [C++] hash_sum on scalar column segfaults

Neal Richardson created ARROW-13737:
---------------------------------------

             Summary: [C++] hash_sum on scalar column segfaults
                 Key: ARROW-13737
                 URL: https://issues.apache.org/jira/browse/ARROW-13737
             Project: Apache Arrow
          Issue Type: Improvement
          Components: C++
            Reporter: Neal Richardson
             Fix For: 6.0.0


Reproducer in R. Works fine without grouping, but segfaults if you group:

{code}
tab <- Table$create(a=1:4, b=6:9)

tab %>% 
  mutate(counter = 1) %>% 
  summarize(count = sum(counter))

Table
1 rows x 1 columns
$count <double>

tab %>% 
  mutate(counter = 1) %>% 
  group_by(a) %>% 
  summarize(count = sum(counter))

 *** caught segfault ***
address 0x0, cause 'invalid permissions'

Traceback:
 1: .Call(`_arrow_ExecPlan_run`, plan, final_node)
 2: ExecPlan_run(self, node)
 3: plan$Run(final_node)
 4: do_exec_plan(.data, group_vars = dplyr::group_vars(.data))
 5: do_arrow_summarize(.data, ...)
...
{code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)