You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Scott Wilburn (JIRA)" <ji...@apache.org> on 2018/04/23 19:37:00 UTC
[jira] [Commented] (DRILL-3202) Count(*) fails on JSON wrapped up
in single array - JSON parsing error
[ https://issues.apache.org/jira/browse/DRILL-3202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16448752#comment-16448752 ]
Scott Wilburn commented on DRILL-3202:
--------------------------------------
Is there any workaround for this issue? It's hard to believe that I can't count objects in a json file using Drill.
> Count(*) fails on JSON wrapped up in single array - JSON parsing error
> ----------------------------------------------------------------------
>
> Key: DRILL-3202
> URL: https://issues.apache.org/jira/browse/DRILL-3202
> Project: Apache Drill
> Issue Type: Bug
> Components: Storage - JSON
> Affects Versions: 1.0.0
> Reporter: Neeraja
> Assignee: Steven Phillips
> Priority: Major
> Fix For: Future
>
> Attachments: DRILL-3202.patch
>
>
> I have a JSON document as follows.
> [
> {
> "Category": "1,2",
> "Comments": "Total sites: 20, RV sites: 20, Elec sites: 20, Water at site, RV Dump, Showers, Flush Toilets, RV Fee: $14, Tent Fee: $14, Elev: 545', Tel: 256-577-9619, Nearest town: Muscle Shoals",
> "Latitude": "34.800446",
> "Longitude": "-87.498242",
> "Name": "Alloys Co Park",
> "State": "AL",
> "Type": "cp",
> "URL": "http://www.campingroadtrip.com/campgrounds/campground/campground/23478/alabama/colbert-county-alloys-park-campground"
> }
> ]
> Drill has ability to unwrap the array (without user specifying it) and perform some SQL operations on it. However count(*) specifically fails on these documents.
> 0: jdbc:drill:zk=local> select * from dfs.`default`.`/Users/nrentachintala/Downloads/yelp/uspointsofinterestshort.json` limit 10;
> +-----------+----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+-----------+------------+-------+--------+-------+------+
> | Category | Comments | Latitude | Longitude | Name | State | Type | URL |
> +-----------+----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+-----------+------------+-------+--------+-------+------+
> | 1,2 | Total sites: 20, RV sites: 20, Elec sites: 20, Water at site, RV Dump, Showers, Flush Toilets, RV Fee: $14, Tent Fee: $14, Elev: 545', Tel: 256-577-9619, Nearest town: Muscle Shoals | 34.800446 | -87.498242 | Alloys Co Park | AL | cp | http://www.campingroadtrip.com/campgrounds/campground/campground/23478/alabama/colbert-county-alloys-park-campground |
> +-----------+----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+-----------+------------+-------+--------+-------+------+
> 1 row selected (0.197 seconds)
> 0: jdbc:drill:zk=local> select distinct type from dfs.`default`.`/Users/nrentachintala/Downloads/yelp/uspointsofinterestshort.json` limit 10;
> +-------+
> | type |
> +-------+
> | cp |
> +-------+
> 1 row selected (0.193 seconds)
> 0: jdbc:drill:zk=local>
> 0: jdbc:drill:zk=local> select count(*) from dfs.`default`.`/Users/nrentachintala/Downloads/yelp/uspointsofinterestshort.json` limit 10;
> Error: DATA_READ ERROR: Error parsing JSON - Cannot read from the middle of a record. Current token was START_ARRAY
> File /Users/nrentachintala/Downloads/yelp/uspointsofinterestshort.json
> Record 1
> Fragment 0:0
> [Error Id: 4742f738-1d43-4fef-af48-110065c9dd83 on 172.16.1.82:31010] (state=,code=0)
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)