You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Wes McKinney (Jira)" <ji...@apache.org> on 2020/09/28 14:23:00 UTC

[jira] [Updated] (ARROW-10114) [R] arrow::read_json_arrow gives Error in Table__to_dataframe(x, use_threads = option_use_threads()) :SET_VECTOR_ELT() can only be applied to a 'list', not a 'integer'

     [ https://issues.apache.org/jira/browse/ARROW-10114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Wes McKinney updated ARROW-10114:
---------------------------------
    Summary: [R] arrow::read_json_arrow gives Error in Table__to_dataframe(x, use_threads = option_use_threads()) :SET_VECTOR_ELT() can only be applied to a 'list', not a 'integer'  (was: arrow::read_json_arrow gives Error in Table__to_dataframe(x, use_threads = option_use_threads()) :SET_VECTOR_ELT() can only be applied to a 'list', not a 'integer')

> [R] arrow::read_json_arrow gives Error in Table__to_dataframe(x, use_threads = option_use_threads()) :SET_VECTOR_ELT() can only be applied to a 'list', not a 'integer'
> -----------------------------------------------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: ARROW-10114
>                 URL: https://issues.apache.org/jira/browse/ARROW-10114
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: R
>    Affects Versions: 1.0.1
>         Environment: > sessionInfo()
> R version 3.6.3 (2020-02-29)
> Platform: x86_64-pc-linux-gnu (64-bit)
> Running under: Linux Mint 19.3
> Matrix products: default
> BLAS:   /usr/lib/x86_64-linux-gnu/blas/libblas.so.3.7.1
> LAPACK: /usr/lib/x86_64-linux-gnu/lapack/liblapack.so.3.7.1
> locale:
>  [1] LC_CTYPE=en_US.UTF-8       LC_NUMERIC=C              
>  [3] LC_TIME=en_US.UTF-8        LC_COLLATE=en_US.UTF-8    
>  [5] LC_MONETARY=sv_SE.UTF-8    LC_MESSAGES=en_US.UTF-8   
>  [7] LC_PAPER=sv_SE.UTF-8       LC_NAME=C                 
>  [9] LC_ADDRESS=C               LC_TELEPHONE=C            
> [11] LC_MEASUREMENT=sv_SE.UTF-8 LC_IDENTIFICATION=C       
> attached base packages:
> [1] stats     graphics  grDevices utils     datasets  methods   base     
> other attached packages:
> [1] arrow_1.0.1
> loaded via a namespace (and not attached):
>  [1] tidyselect_1.1.0 bit_4.0.4        compiler_3.6.3   magrittr_1.5    
>  [5] assertthat_0.2.1 R6_2.4.1         glue_1.4.1       Rcpp_1.0.5      
>  [9] bit64_4.0.2      vctrs_0.3.2      rlang_0.4.7      purrr_0.3.4     
>            Reporter: Markus Skyttner
>            Priority: Major
>
> A .jsonl file (newline separated JSON) created from open data available at [ftp://ftp.libris.kb.se/pub/spa/swepub-deduplicated-2019-12-29.zip] is used with the R package arrow (installed from CRAN) using the following statement:
> > arrow::read_json_arrow("~/.config/swepub/head.jsonl")
> It crashes RStudio with no error message. At the R prompt, the error message is:
> Error in Table__to_dataframe(x, use_threads = option_use_threads()) : 
>  SET_VECTOR_ELT() can only be applied to a 'list', not a 'integer'
> The file "head.jsonl" above was created from the extracted zip's .jsonl file with the *nix "head -1 $BIG_JSONL_FILE" command. It can be parsed with jsonlite and tidyjson.
> Also got this error message at one point:
> > arrow::read_json_arrow("head.jsonl", as_data_frame = TRUE)
> *** caught segfault ***
> address 0x8, cause 'memory not mapped'
> Traceback:
>  1: structure(x, extra_cols = colonnade[extra_cols], class = "pillar_squeezed_colonnade")
>  2: new_colonnade_sqeezed(out, colonnade = x, extra_cols = extra_cols)
>  3: pillar::squeeze(x$mcf, width = width)
>  4: format.trunc_mat(mat)
>  5: format(mat)
>  6: format.tbl(x, ..., n = n, width = width, n_extra = n_extra)
>  7: format(x, ..., n = n, width = width, n_extra = n_extra)
>  8: paste0(..., collapse = "\n")
>  9: cli::cat_line(format(x, ..., n = n, width = width, n_extra = n_extra))
> 10: print.tbl(x)
> 11: (function (x, ...) UseMethod("print"))(x)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)