You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@hive.apache.org by RSD <ro...@gmail.com> on 2009/07/09 01:25:10 UTC
Odd error (Hive 0.17)
I've gotten new equipment to do an upgrade, but I need to keep my Hadoop cluster
pushing data. :-)
I am getting the following:
Job Submission failed with exception 'Input path doesnt exist :
hdfs://XXXXXXXXXXXXXXX:9000/tmp/hive-username/183603784.10002'
FAILED: Execution Error, return code 1 from
org.apache.hadoop.hive.ql.exec.ExecDriver
My hive.log:
009-07-08 16:08:49,559 INFO exec.ExecDriver
(ExecDriver.java:execute(141)) - Adding input file
/tmp/hive-username/183603784.10002
2009-07-08 16:08:49,705 ERROR exec.ExecDriver
(SessionState.java:printError(242)) - Job Submission failed with
exception 'Input path doesnt exist :
hdfs://XXXXXXXXXXXXXXX:9000/tmp/hive-username/183603784.10002'
org.apache.hadoop.mapred.InvalidInputException: Input path doesnt
exist : hdfs://XXXXXXXXXXXXXXX:9000/tmp/hive-username/183603784.10002
What does this error indicate? The files are in Hadoop that I want to
work on; I've checked permissions, as my first thought was that
I could not write to /tmp on the local machine, but that is not the
case. Any one have a clue?
Re: Odd error (Hive 0.17)
Posted by RSD <ro...@gmail.com>.
I will look into less conditionals. Thank you for the insight.
On Wed, Jul 8, 2009 at 6:57 PM, Frederick Oko<fr...@gmail.com> wrote:
> That error had been thrown in cases where a preceding phase of the Hive plan
> produced no results (yet it did find data to scan) which in your case would
> be the filter conditions ocurring in the first of 2 MR -- r u certain there
> r records which would match your day and game_id conditions? I can't
> validate right now whether that behavior has been made more user friendly.
>
> On Jul 8, 2009 5:10 PM, "RSD" <ro...@gmail.com> wrote:
>
> hive> describe game_start;
>
>
> recordtime string
> user_id int
> session_id string
> host string
> release string
> source string
> ip string
> ip_country string
> game_id int
> challenge int
> suggestion_rank int
> language string
> day string
> hive> select a.game_id, count(distinct(a.user_id)), count(1) from
> game_start a where a.day >= '2009-06-01' and a.day < '2009-06-03' and
> (a.game_id = 501 or a.game_id = 502 or a.game_id = 504 or a.game_id =
> 505 or a.game_id = 563) group by a.game_id;
> Total MapReduce jobs = 2
> Starting Job = job_200907071612_0568, Tracking URL =
> http://somehost:50030/jobdetails.jsp?jobid=job_200907071612_0568
> Kill Command = /home/analytics/hadoop/dist/current/bin/hadoop job
> -Dmapred.job.tracker=somehost:9001 -kill job_200907071612_0568
> ...
> Ended Job = job_200907071612_0568
>
> Job Submission failed with exception 'Input path doesnt exist :
>
> hdfs://somehost:9000/tmp/hive-someuser/183603784.10002'
>
> FAILED: Execution Error, return code 1 from
> org.apache.hadoop.hive.ql.exec.ExecDriver
>
> there is a local directory /tmp/hive-someuser (and in
> /tmp/someuser/hive.log is where i am logging)...
>
> is there something wrong with the syntax of the query?
>
Re: Odd error (Hive 0.17)
Posted by Frederick Oko <fr...@gmail.com>.
That error had been thrown in cases where a preceding phase of the Hive plan
produced no results (yet it did find data to scan) which in your case would
be the filter conditions ocurring in the first of 2 MR -- r u certain there
r records which would match your day and game_id conditions? I can't
validate right now whether that behavior has been made more user friendly.
On Jul 8, 2009 5:10 PM, "RSD" <ro...@gmail.com> wrote:
hive> describe game_start;
recordtime string
user_id int
session_id string
host string
release string
source string
ip string
ip_country string
game_id int
challenge int
suggestion_rank int
language string
day string
hive> select a.game_id, count(distinct(a.user_id)), count(1) from
game_start a where a.day >= '2009-06-01' and a.day < '2009-06-03' and
(a.game_id = 501 or a.game_id = 502 or a.game_id = 504 or a.game_id =
505 or a.game_id = 563) group by a.game_id;
Total MapReduce jobs = 2
Starting Job = job_200907071612_0568, Tracking URL =
http://somehost:50030/jobdetails.jsp?jobid=job_200907071612_0568
Kill Command = /home/analytics/hadoop/dist/current/bin/hadoop job
-Dmapred.job.tracker=somehost:9001 -kill job_200907071612_0568
...
Ended Job = job_200907071612_0568
Job Submission failed with exception 'Input path doesnt exist :
hdfs://somehost:9000/tmp/hive-someuser/183603784.10002'
FAILED: Execution Error, return code 1 from
org.apache.hadoop.hive.ql.exec.ExecDriver
there is a local directory /tmp/hive-someuser (and in
/tmp/someuser/hive.log is where i am logging)...
is there something wrong with the syntax of the query?
Re: Odd error (Hive 0.17)
Posted by Raghu Murthy <rm...@facebook.com>.
> Job Submission failed with exception 'Input path doesnt exist :
> hdfs://somehost:9000/tmp/hive-someuser/183603784.10002'
Again, looks like the query is trying to access hdfs rather than local
directory. Can you paste the output of describe extended game_start?
On 7/8/09 5:10 PM, "RSD" <ro...@gmail.com> wrote:
> hive> describe game_start;
>
>
> recordtime string
> user_id int
> session_id string
> host string
> release string
> source string
> ip string
> ip_country string
> game_id int
> challenge int
> suggestion_rank int
> language string
> day string
> hive> select a.game_id, count(distinct(a.user_id)), count(1) from
> game_start a where a.day >= '2009-06-01' and a.day < '2009-06-03' and
> (a.game_id = 501 or a.game_id = 502 or a.game_id = 504 or a.game_id =
> 505 or a.game_id = 563) group by a.game_id;
> Total MapReduce jobs = 2
> Starting Job = job_200907071612_0568, Tracking URL =
> http://somehost:50030/jobdetails.jsp?jobid=job_200907071612_0568
> Kill Command = /home/analytics/hadoop/dist/current/bin/hadoop job
> -Dmapred.job.tracker=somehost:9001 -kill job_200907071612_0568
> ...
> Ended Job = job_200907071612_0568
> Job Submission failed with exception 'Input path doesnt exist :
> hdfs://somehost:9000/tmp/hive-someuser/183603784.10002'
> FAILED: Execution Error, return code 1 from
> org.apache.hadoop.hive.ql.exec.ExecDriver
>
> there is a local directory /tmp/hive-someuser (and in
> /tmp/someuser/hive.log is where i am logging)...
>
> is there something wrong with the syntax of the query?
Re: Odd error (Hive 0.17)
Posted by RSD <ro...@gmail.com>.
hive> describe game_start;
recordtime string
user_id int
session_id string
host string
release string
source string
ip string
ip_country string
game_id int
challenge int
suggestion_rank int
language string
day string
hive> select a.game_id, count(distinct(a.user_id)), count(1) from
game_start a where a.day >= '2009-06-01' and a.day < '2009-06-03' and
(a.game_id = 501 or a.game_id = 502 or a.game_id = 504 or a.game_id =
505 or a.game_id = 563) group by a.game_id;
Total MapReduce jobs = 2
Starting Job = job_200907071612_0568, Tracking URL =
http://somehost:50030/jobdetails.jsp?jobid=job_200907071612_0568
Kill Command = /home/analytics/hadoop/dist/current/bin/hadoop job
-Dmapred.job.tracker=somehost:9001 -kill job_200907071612_0568
...
Ended Job = job_200907071612_0568
Job Submission failed with exception 'Input path doesnt exist :
hdfs://somehost:9000/tmp/hive-someuser/183603784.10002'
FAILED: Execution Error, return code 1 from
org.apache.hadoop.hive.ql.exec.ExecDriver
there is a local directory /tmp/hive-someuser (and in
/tmp/someuser/hive.log is where i am logging)...
is there something wrong with the syntax of the query?
Re: Odd error (Hive 0.17)
Posted by Raghu Murthy <rm...@facebook.com>.
The path indicates that its looking at hdfs. If you are loading data which
is on your local file system, you should do something like LOAD DATA LOCAL
INPATH '/path/in/local/fs'.
On 7/8/09 4:25 PM, "RSD" <ro...@gmail.com> wrote:
> I've gotten new equipment to do an upgrade, but I need to keep my Hadoop
> cluster
> pushing data. :-)
>
> I am getting the following:
>
> Job Submission failed with exception 'Input path doesnt exist :
> hdfs://XXXXXXXXXXXXXXX:9000/tmp/hive-username/183603784.10002'
> FAILED: Execution Error, return code 1 from
> org.apache.hadoop.hive.ql.exec.ExecDriver
>
> My hive.log:
> 009-07-08 16:08:49,559 INFO exec.ExecDriver
> (ExecDriver.java:execute(141)) - Adding input file
> /tmp/hive-username/183603784.10002
> 2009-07-08 16:08:49,705 ERROR exec.ExecDriver
> (SessionState.java:printError(242)) - Job Submission failed with
> exception 'Input path doesnt exist :
> hdfs://XXXXXXXXXXXXXXX:9000/tmp/hive-username/183603784.10002'
> org.apache.hadoop.mapred.InvalidInputException: Input path doesnt
> exist : hdfs://XXXXXXXXXXXXXXX:9000/tmp/hive-username/183603784.10002
>
>
> What does this error indicate? The files are in Hadoop that I want to
> work on; I've checked permissions, as my first thought was that
> I could not write to /tmp on the local machine, but that is not the
> case. Any one have a clue?