You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Sudheesh Katkam (JIRA)" <ji...@apache.org> on 2015/11/18 19:42:11 UTC
[jira] [Comment Edited] (DRILL-4102) Only one row found in a JSON
document that contains multiple items.
[ https://issues.apache.org/jira/browse/DRILL-4102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15011627#comment-15011627 ]
Sudheesh Katkam edited comment on DRILL-4102 at 11/18/15 6:41 PM:
------------------------------------------------------------------
This is actually a duplicate of DRILL-2583, allowing `KVGEN(\*)` resolves the issue you are facing.
was (Author: sudheeshkatkam):
This is actually a duplicate of DRILL-2583, allowing `KVGEN(*)` resolves the issue you are facing.
> Only one row found in a JSON document that contains multiple items.
> -------------------------------------------------------------------
>
> Key: DRILL-4102
> URL: https://issues.apache.org/jira/browse/DRILL-4102
> Project: Apache Drill
> Issue Type: Bug
> Environment: OS X, Drill embedded, v1.1.0 installed via HomeBrew
> Reporter: aditya menon
>
> I tried to analyse a JSON file that had the following (sample) structure:
> {code:json}
> {
> "Key1": {
> "htmltags": "<htmltag attr1='bravo' /><htmltag attr2='delta' /><htmltag attr3='charlie' />"
> },
> "Key2": {
> "htmltags": "<htmltag attr1='kilo' /><htmltag attr2='lima' /><htmltag attr3='mike' />"
> },
> "Key3": {
> "htmltags": "<htmltag attr1='november' /><htmltag attr2='foxtrot' /><htmltag attr3='sierra' />"
> }
> }
> {code}
> (Apologies for the obfuscation, I am unable to publish the original dataset. But the structure is exactly the same. Note especially how the keys and other data points *differ* in some places, and remain identical in others.)
> When I run a {code:sql}SELECT * FROM DataFile.json{code} what I get is a single row listed under three columns: {code:html}"<htmltag attr1='bravo' /><htmltag attr2='delta' /><htmltag attr3='charlie' />"{code} [i.e., only the entry `Key1.htmltags`] .
> Ideally, I should see three rows, each with entries from Key1..Key3, listed under the correct respective column.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)