You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Samuel Schneck (Jira)" <ji...@apache.org> on 2022/10/31 23:43:00 UTC

[jira] [Created] (ARROW-18208) JS: tableFromJSON cannot handle nested objects containing strings

Samuel Schneck created ARROW-18208:
--------------------------------------

             Summary: JS: tableFromJSON cannot handle nested objects containing strings
                 Key: ARROW-18208
                 URL: https://issues.apache.org/jira/browse/ARROW-18208
             Project: Apache Arrow
          Issue Type: Bug
            Reporter: Samuel Schneck


```

$ node

const g = require('apache-arrow')

g.tableFromJSON([\{a: [ { b: "hi" } ]}])

```

 

The dictionary types:

 

TYPE Dictionary \{indices: Int32, dictionary: Utf8, isOrdered: false, id: 12}dictionary: Utf8 {}id: 12indices: Int32 \{isSigned: true, bitWidth: 32}isOrdered: falseArrayType: (...)children: (...)typeId: (...)valueType: (...)[[Prototype]]: Dictionary
typecomparator.ts:191 OTHER 

 

OTHER Dictionary \{indices: Int32, dictionary: Utf8, isOrdered: false, id: 14}dictionary: Utf8typeId: (...)[[Prototype]]: Utf8id: 14indices: Int32 \{isSigned: true, bitWidth: 32}isOrdered: falseArrayType: (...)children: (...)typeId: (...)valueType: (...)[[Prototype]]: Dictionary

 

This happens here:

    else if (arraysCount + nullsCount === value.length) {
        const array = value;
        const childType = inferType(array[array.findIndex((ary) => ary != null)]);
        if (array.every((ary) => ary == null || (0, typecomparator_js_1.compareTypes)(childType, inferType(ary)))) {
            return new dtypes.List(new schema_js_1.Field('', childType, true));
        }
    }

 

So we're always instantiating a new dictionary type, with a new id, when we do inferType(ary), so this is never going to succeed.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)