You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Jonathan Keane (Jira)" <ji...@apache.org> on 2021/10/14 15:38:00 UTC

[jira] [Created] (ARROW-14321) [R] segfault when filter + arrange

Jonathan Keane created ARROW-14321:
--------------------------------------

             Summary: [R] segfault when filter + arrange
                 Key: ARROW-14321
                 URL: https://issues.apache.org/jira/browse/ARROW-14321
             Project: Apache Arrow
          Issue Type: Bug
          Components: R
            Reporter: Jonathan Keane


It appears to happen when one of the filter parts has no matching rows:

{code:r}
library(arrow)
library(dplyr)
first_date <- lubridate::ymd_hms("2015-04-29 03:12:39")

df1 <- tibble::tibble(
  int = 1:10,
  dbl = as.numeric(1:10),
  lgl = rep(c(TRUE, FALSE, NA, TRUE, FALSE), 2),
  chr = letters[1:10],
  fct = factor(LETTERS[1:10]),
  ts = first_date + lubridate::days(1:10)
)

ds <- InMemoryDataset$create(df1)

# works
ds %>% 
  filter(int < 8) %>%
  arrange(dbl) %>%
  collect()

# segfaults
ds %>% 
  filter(int < 8, int > 55) %>%
  arrange(dbl) %>%
  collect()

 segfaults
ds %>% 
  filter(int < 0) %>%
  arrange(dbl) %>%
  collect()
{code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)