You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2021/12/20 03:19:42 UTC

[GitHub] [arrow-rs] novemberkilo edited a comment on issue #1036: JSON input barfs on {"emptylist":[]}

novemberkilo edited a comment on issue #1036:
URL: https://github.com/apache/arrow-rs/issues/1036#issuecomment-997521410


   @nevi-me @alamb I started with `json2parquet` and found the shape of the RecordBatch that corresponded to `{"emptylist": []}` (see below). This then guided me to writing the test that I've committed for now. I [get the same panic and error message](https://github.com/apache/arrow-rs/runs/4577227397?check_suite_focus=true#step:6:1648) so I think I am on the right track. Any suggestions for where the actual fix might be? I'm spelunking around but if either of you (or anyone else familiar with the code here) can help orient me, that would help.
   
   I ran `json2parquet` on `{"emptylist": []}` and placed a `dbg!` on what is sent to the writer:
   
   ```
   [src/main.rs:182] &batch = Ok(
       RecordBatch {
           schema: Schema {
               fields: [
                   Field {
                       name: "emptylist",
                       data_type: List(
                           Field {
                               name: "item",
                               data_type: Null,
                               nullable: true,
                               dict_id: 0,
                               dict_is_ordered: false,
                               metadata: None,
                           },
                       ),
                       nullable: true,
                       dict_id: 0,
                       dict_is_ordered: false,
                       metadata: None,
                   },
               ],
               metadata: {},
           },
           columns: [
               ListArray
               [
                 NullArray(0),
               ],
           ],
       },
   )
   thread 'main' panicked at 'Cannot filter indices on a non-primitive array, found List(true)', /home/navin/.cargo/registry/src/github.com-1ecc6299db9ec823/parquet-6.3.0/src/arrow/levels.rs:757:18
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org