You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@drill.apache.org by Mahesh Sankaran <sa...@gmail.com> on 2015/04/01 14:27:46 UTC
CSV header issue
Hi ,
I am currently working in Apache Drill to analyse CSV files.My
problem is, If the CSV file has headers means we cant do any sum query.It
shows the following errors.
0: jdbc:drill:zk=nn01:2181,dn02:2181,dn03:218> select sum(cast(columns[6]
as int)) from HDFS.`/test.csv` limit 10;
Query failed: RemoteRpcException: Failure while running fragment., rcvdbyte
[ 584925d6-dab6-42ce-8eb3-fa7abfb0e0f2 on nn01:31010 ]
[ 584925d6-dab6-42ce-8eb3-fa7abfb0e0f2 on nn01:31010 ]
Error: exception while executing query: Failure while executing query.
(state=,code=0)
*But the above query is working well without headers.There is any way to
sum the columns in CSV files with headers in Apache Drill.*
*This is our example file:*
0: jdbc:drill:zk=nn01:2181,dn02:2181,dn03:218> select * from
HDFS.`/test.csv` limit 10;
+------------+------------+
| columns | dir0 |
+------------+------------+
| ["date1","time1","srcip","dstip","service","sentbyte","rcvdbyte"] |
nn01:9000 |
| ["2015-01-01","00:00:00","10.10.100.74","192.168.0.12","DNS","0","193"] |
nn01:9000 |
| ["2015-01-01","00:00:00","10.10.100.74","192.168.0.12","DNS","0","166"] |
nn01:9000 |
| ["2015-01-01","00:00:00","10.10.100.74","192.168.0.12","DNS","60","359"]
| nn01:9000 |
|
["2015-01-01","00:00:00","10.10.50.195","106.10.193.45","php","717","359","0","0"]
| nn01:9000 |
| ["2015-01-01","00:00:00","111.123.180.44","117.239.67.36","9064","0","0"]
| nn01:9000 |
| ["2015-01-01","00:00:00","111.123.180.44","117.239.67.37","9064","0","0"]
| nn01:9000 |
| ["2015-01-01","00:00:00","111.123.180.44","117.239.67.38","9064","0","0"]
| nn01:9000 |
| ["2015-01-01","00:00:00","111.123.180.44","117.239.67.34","9064","0","0"]
| nn01:9000 |
| ["2015-01-01","00:00:00","111.123.180.44","117.239.67.44","9064","0","0"]
| nn01:9000 |
Thanks and Regards,
Mahesh Sankaran