You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@orc.apache.org by "Quanlong Huang (JIRA)" <ji...@apache.org> on 2019/01/02 09:07:00 UTC

[jira] [Assigned] (ORC-452) Support converting MAP column from JSON to ORC

     [ https://issues.apache.org/jira/browse/ORC-452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Quanlong Huang reassigned ORC-452:
----------------------------------

    Assignee: Quanlong Huang

> Support converting MAP column from JSON to ORC
> ----------------------------------------------
>
>                 Key: ORC-452
>                 URL: https://issues.apache.org/jira/browse/ORC-452
>             Project: ORC
>          Issue Type: New Feature
>          Components: Java, tools
>            Reporter: Quanlong Huang
>            Assignee: Quanlong Huang
>            Priority: Major
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> The JSON convert tool has not supported MAP column yet.
> {code:java}
> java -jar orc-tools-1.5.4-uber.jar convert -s "struct<id:int,map_col:map<string,string>>" map.json 
> log4j:WARN No appenders could be found for logger (org.apache.hadoop.util.Shell).
> log4j:WARN Please initialize the log4j system properly.
> log4j:WARN See http://logging.apache.org/log4j/1.2/faq.html#noconfig for more info.
> Processing map.json
> Exception in thread "main" java.lang.IllegalArgumentException: Unhandled type map<string,string>
>         at org.apache.orc.tools.convert.JsonReader.createConverter(JsonReader.java:252)
>         at org.apache.orc.tools.convert.JsonReader.<init>(JsonReader.java:277)
>         at org.apache.orc.tools.convert.JsonReader.<init>(JsonReader.java:260)
>         at org.apache.orc.tools.convert.ConvertTool$FileInformation.getRecordReader(ConvertTool.java:149)
>         at org.apache.orc.tools.convert.ConvertTool.run(ConvertTool.java:203)
>         at org.apache.orc.tools.convert.ConvertTool.main(ConvertTool.java:165)
>         at org.apache.orc.tools.Driver.main(Driver.java:113)
> {code}
> In JsonReader.java, there's no converter for MAP type.
> We'd like to convert following JSON rows into an ORC file
> {code:java}
> {"id": 0, "map_col": {"k1": "v1", "k2": "v2"}}
> {"id": 1, "map_col": {"k3": "v3", "k4": "v4", "k5": "v5"}}{code}
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)