You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@drill.apache.org by Mahesh Kumar <ma...@outlook.com> on 2015/04/02 08:06:58 UTC

CSV header Problem

Hi ,         I am currently working in Apache Drill to analyse CSV files.My problem is, If the CSV file has headers means we cant do any sum query.It shows the following errors.
0: jdbc:drill:zk=nn01:2181,dn02:2181,dn03:218> select sum(cast(columns[6] as int)) from HDFS.`/test.csv` limit 10;Query failed: RemoteRpcException: Failure while running fragment., rcvdbyte [ 584925d6-dab6-42ce-8eb3-fa7abfb0e0f2 on nn01:31010 ][ 584925d6-dab6-42ce-8eb3-fa7abfb0e0f2 on nn01:31010 ]

Error: exception while executing query: Failure while executing query. (state=,code=0)
But the above query is working well without headers.There is any way to sum the columns in CSV files with headers in Apache Drill.
This is our example file:0: jdbc:drill:zk=nn01:2181,dn02:2181,dn03:218> select * from HDFS.`/test.csv` limit 10;+------------+------------+|  columns   |    dir0    |+------------+------------+| ["date1","time1","srcip","dstip","service","sentbyte","rcvdbyte"] | nn01:9000  || ["2015-01-01","00:00:00","10.10.100.74","192.168.0.12","DNS","0","193"] | nn01:9000  || ["2015-01-01","00:00:00","10.10.100.74","192.168.0.12","DNS","0","166"] | nn01:9000  || ["2015-01-01","00:00:00","10.10.100.74","192.168.0.12","DNS","60","359"] | nn01:9000  || ["2015-01-01","00:00:00","10.10.50.195","106.10.193.45","php","717","359","0","0"] | nn01:9000  || ["2015-01-01","00:00:00","111.123.180.44","117.239.67.36","9064","0","0"] | nn01:9000  || ["2015-01-01","00:00:00","111.123.180.44","117.239.67.37","9064","0","0"] | nn01:9000  || ["2015-01-01","00:00:00","111.123.180.44","117.239.67.38","9064","0","0"] | nn01:9000  || ["2015-01-01","00:00:00","111.123.180.44","117.239.67.34","9064","0","0"] | nn01:9000  || ["2015-01-01","00:00:00","111.123.180.44","117.239.67.44","9064","0","0"] | nn01:9000  |

Thanks and Regards,
Mahesh Sankaran