You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@drill.apache.org by "Khurram Faraaz (JIRA)" <ji...@apache.org> on 2015/05/07 22:20:00 UTC

[jira] [Created] (DRILL-2985) NPE seen for project distinct values from CSV

Khurram Faraaz created DRILL-2985:
-------------------------------------

             Summary: NPE seen for project distinct values from CSV
                 Key: DRILL-2985
                 URL: https://issues.apache.org/jira/browse/DRILL-2985
             Project: Apache Drill
          Issue Type: Bug
          Components: Storage - Text & CSV
    Affects Versions: 1.0.0
         Environment: b856d0dd50a4495f95ee5d32ea3d396df56842ed | DRILL-2826: Simplify and centralize Operator Cleanup | 05.05.2015 @ 20:28:37 EDT
            Reporter: Khurram Faraaz
            Assignee: Steven Phillips


I am seeing a NPE when we project distinct values. Test was run on 4 node cluster on CentOS.

{code}
0: jdbc:drill:> select distinct type from `airports.csv`;
Error: SYSTEM ERROR: null

Fragment 0:0

[Error Id: 9f6e6929-41f6-4821-8a31-8bd45143f3d1 on centos-01.qa.lab:31010] (state=,code=0)
{code}

Stacktrace from drillbit.log 

{code}
2015-05-07 20:03:21,790 [2ab43af5-b6a9-0578-c45b-454b1a1a7b35:frag:0:0] ERROR o.a.d.c.e.DrillRuntimeException - SYSTEM ERROR: null

Fragment 0:0

[Error Id: 9f6e6929-41f6-4821-8a31-8bd45143f3d1 on centos-01.qa.lab:31010]
org.apache.drill.common.exceptions.UserException: SYSTEM ERROR: null

Fragment 0:0

[Error Id: 9f6e6929-41f6-4821-8a31-8bd45143f3d1 on centos-01.qa.lab:31010]
        at org.apache.drill.common.exceptions.UserException$Builder.build(UserException.java:465) ~[drill-common-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at org.apache.drill.exec.work.fragment.FragmentExecutor.sendFinalState(FragmentExecutor.java:262) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:232) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at org.apache.drill.common.SelfCleaningRunnable.run(SelfCleaningRunnable.java:38) [drill-common-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) [na:1.7.0_75]
        at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) [na:1.7.0_75]
        at java.lang.Thread.run(Thread.java:745) [na:1.7.0_75]
Caused by: java.lang.NullPointerException: null
        at org.apache.drill.exec.store.easy.text.compliant.CompliantTextRecordReader.cleanup(CompliantTextRecordReader.java:147) ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at org.apache.drill.exec.physical.impl.ScanBatch.<init>(ScanBatch.java:104) ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at org.apache.drill.exec.store.dfs.easy.EasyFormatPlugin.getReaderBatch(EasyFormatPlugin.java:189) ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at org.apache.drill.exec.store.dfs.easy.EasyReaderBatchCreator.getBatch(EasyReaderBatchCreator.java:35) ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at org.apache.drill.exec.store.dfs.easy.EasyReaderBatchCreator.getBatch(EasyReaderBatchCreator.java:28) ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at org.apache.drill.exec.physical.impl.ImplCreator.getRecordBatch(ImplCreator.java:140) ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at org.apache.drill.exec.physical.impl.ImplCreator.getChildren(ImplCreator.java:163) ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at org.apache.drill.exec.physical.impl.ImplCreator.getRecordBatch(ImplCreator.java:121) ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at org.apache.drill.exec.physical.impl.ImplCreator.getChildren(ImplCreator.java:163) ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at org.apache.drill.exec.physical.impl.ImplCreator.getRecordBatch(ImplCreator.java:121) ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at org.apache.drill.exec.physical.impl.ImplCreator.getChildren(ImplCreator.java:163) ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at org.apache.drill.exec.physical.impl.ImplCreator.getRecordBatch(ImplCreator.java:121) ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at org.apache.drill.exec.physical.impl.ImplCreator.getChildren(ImplCreator.java:163) ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at org.apache.drill.exec.physical.impl.ImplCreator.getRootExec(ImplCreator.java:96) ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at org.apache.drill.exec.physical.impl.ImplCreator.getExec(ImplCreator.java:77) ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:175) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        ... 4 common frames omitted
2015-05-07 20:03:21,792 [2ab43af5-b6a9-0578-c45b-454b1a1a7b35:frag:0:0] INFO  o.a.drill.exec.work.foreman.Foreman - State change requested.  RUNNING --> FAILED
org.apache.drill.common.exceptions.UserRemoteException: SYSTEM ERROR: null

Fragment 0:0

[Error Id: 9f6e6929-41f6-4821-8a31-8bd45143f3d1 on centos-01.qa.lab:31010]
        at org.apache.drill.exec.work.foreman.QueryManager$1.statusUpdate(QueryManager.java:409) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at org.apache.drill.exec.work.foreman.QueryManager$RootStatusReporter.statusChange(QueryManager.java:389) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at org.apache.drill.exec.work.fragment.AbstractStatusReporter.fail(AbstractStatusReporter.java:90) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at org.apache.drill.exec.work.fragment.AbstractStatusReporter.fail(AbstractStatusReporter.java:86) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at org.apache.drill.exec.work.fragment.FragmentExecutor.sendFinalState(FragmentExecutor.java:266) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:232) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at org.apache.drill.common.SelfCleaningRunnable.run(SelfCleaningRunnable.java:38) [drill-common-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) [na:1.7.0_75]
        at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) [na:1.7.0_75]
        at java.lang.Thread.run(Thread.java:745) [na:1.7.0_75]
{code}

Snippet of data from CSV file

{code}
[root@centos-01 logs]# hadoop fs -cat /tmp/airports.csv
"id","ident","type","name","latitude_deg","longitude_deg","elevation_ft","continent","iso_country","iso_region","municipality","scheduled_service","gps_code","iata_code","local_code","home_link","wikipedia_link","keywords"
6523,"00A","heliport","Total Rf Heliport",40.07080078125,-74.9336013793945,11,"NA","US","US-PA","Bensalem","no","00A",,"00A",,,
6524,"00AK","small_airport","Lowell Field",59.94919968,-151.695999146,450,"NA","US","US-AK","Anchor Point","no","00AK",,"00AK",,,
6525,"00AL","small_airport","Epps Airpark",34.8647994995117,-86.7703018188477,820,"NA","US","US-AL","Harvest","no","00AL",,"00AL",,,
6526,"00AR","heliport","Newport Hospital & Clinic Heliport",35.608699798584,-91.2548980712891,237,"NA","US","US-AR","Newport","no","00AR",,"00AR",,,
6527,"00AZ","small_airport","Cordes Airport",34.3055992126465,-112.165000915527,3810,"NA","US","US-AZ","Cordes","no","00AZ",,"00AZ",,,
6528,"00CA","small_airport","Goldstone /Gts/ Airport",35.3504981995,-116.888000488,3038,"NA","US","US-CA","Barstow","no","00CA",,"00CA",,,
6529,"00CO","small_airport","Cass Field",40.622200012207,-104.34400177002,4830,"NA","US","US-CO","Briggsdale","no","00CO",,"00CO",,,

{code}

details of different columns in the file

{code}
0: jdbc:drill:> select * from `airports.csv` limit 10;
+------------+
|  columns   |
+------------+
| ["id","ident","type","name","latitude_deg","longitude_deg","elevation_ft","continent","iso_country","iso_region","municipality","scheduled_service","gps_code","iata_code","local_code","home_link","wikipedia_link","keywords"] |
| ["6523","00A","heliport","Total Rf Heliport","40.07080078125","-74.9336013793945","11","NA","US","US-PA","Bensalem","no","00A","","00A","","",""] |
| ["6524","00AK","small_airport","Lowell Field","59.94919968","-151.695999146","450","NA","US","US-AK","Anchor Point","no","00AK","","00AK","","",""] |
| ["6525","00AL","small_airport","Epps Airpark","34.8647994995117","-86.7703018188477","820","NA","US","US-AL","Harvest","no","00AL","","00AL","","",""] |
| ["6526","00AR","heliport","Newport Hospital & Clinic Heliport","35.608699798584","-91.2548980712891","237","NA","US","US-AR","Newport","no","00AR","","00AR","","",""] |
| ["6527","00AZ","small_airport","Cordes Airport","34.3055992126465","-112.165000915527","3810","NA","US","US-AZ","Cordes","no","00AZ","","00AZ","","",""] |
| ["6528","00CA","small_airport","Goldstone /Gts/ Airport","35.3504981995","-116.888000488","3038","NA","US","US-CA","Barstow","no","00CA","","00CA","","",""] |
| ["6529","00CO","small_airport","Cass Field","40.622200012207","-104.34400177002","4830","NA","US","US-CO","Briggsdale","no","00CO","","00CO","","",""] |
| ["6531","00FA","small_airport","Grass Patch Airport","28.6455001831055","-82.2190017700195","53","NA","US","US-FL","Bushnell","no","00FA","","00FA","","",""] |
| ["6532","00FD","heliport","Ringhaver Heliport","28.8465995788574","-82.3453979492188","25","NA","US","US-FL","Riverview","no","00FD","","00FD","","",""] |
+------------+
10 rows selected (0.264 seconds)
{code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)