You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@hive.apache.org by RSD <ro...@gmail.com> on 2009/07/09 01:25:10 UTC

Odd error (Hive 0.17)

I've gotten new equipment to do an upgrade, but I need to keep my Hadoop cluster
pushing data. :-)

I am getting the following:

Job Submission failed with exception 'Input path doesnt exist :
hdfs://XXXXXXXXXXXXXXX:9000/tmp/hive-username/183603784.10002'
FAILED: Execution Error, return code 1 from
org.apache.hadoop.hive.ql.exec.ExecDriver

My hive.log:
009-07-08 16:08:49,559 INFO  exec.ExecDriver
(ExecDriver.java:execute(141)) - Adding input file
/tmp/hive-username/183603784.10002
2009-07-08 16:08:49,705 ERROR exec.ExecDriver
(SessionState.java:printError(242)) - Job Submission failed with
exception 'Input path doesnt exist :
hdfs://XXXXXXXXXXXXXXX:9000/tmp/hive-username/183603784.10002'
org.apache.hadoop.mapred.InvalidInputException: Input path doesnt
exist : hdfs://XXXXXXXXXXXXXXX:9000/tmp/hive-username/183603784.10002


What does this error indicate? The files are in Hadoop that I want to
work on; I've checked permissions, as my first thought was that
I could not write to /tmp on the local machine, but that is not the
case. Any one have a clue?

Re: Odd error (Hive 0.17)

Posted by RSD <ro...@gmail.com>.
I will look into less conditionals. Thank you for the insight.

On Wed, Jul 8, 2009 at 6:57 PM, Frederick Oko<fr...@gmail.com> wrote:
> That error had been thrown in cases where a preceding phase of the Hive plan
> produced no results (yet it did find data to scan) which in your case would
> be the filter conditions ocurring in the first of 2 MR -- r u certain there
> r records which would match your day and game_id conditions? I can't
> validate right now whether that behavior has been made more user friendly.
>
> On Jul 8, 2009 5:10 PM, "RSD" <ro...@gmail.com> wrote:
>
> hive> describe game_start;
>
>
>                          recordtime    string
> user_id int
> session_id      string
> host    string
> release string
> source  string
> ip      string
> ip_country      string
> game_id int
> challenge       int
> suggestion_rank int
> language        string
> day     string
> hive> select a.game_id, count(distinct(a.user_id)), count(1) from
> game_start a where a.day >= '2009-06-01' and a.day < '2009-06-03' and
> (a.game_id = 501 or a.game_id = 502 or a.game_id = 504 or a.game_id =
> 505 or a.game_id = 563) group by a.game_id;
> Total MapReduce jobs = 2
> Starting Job = job_200907071612_0568, Tracking URL =
> http://somehost:50030/jobdetails.jsp?jobid=job_200907071612_0568
> Kill Command = /home/analytics/hadoop/dist/current/bin/hadoop job
> -Dmapred.job.tracker=somehost:9001 -kill job_200907071612_0568
> ...
> Ended Job = job_200907071612_0568
>
> Job Submission failed with exception 'Input path doesnt exist :
>
> hdfs://somehost:9000/tmp/hive-someuser/183603784.10002'
>
> FAILED: Execution Error, return code 1 from
> org.apache.hadoop.hive.ql.exec.ExecDriver
>
> there is a local directory /tmp/hive-someuser  (and in
> /tmp/someuser/hive.log is where i am logging)...
>
> is there something wrong with the syntax of the query?
>

Re: Odd error (Hive 0.17)

Posted by Frederick Oko <fr...@gmail.com>.
That error had been thrown in cases where a preceding phase of the Hive plan
produced no results (yet it did find data to scan) which in your case would
be the filter conditions ocurring in the first of 2 MR -- r u certain there
r records which would match your day and game_id conditions? I can't
validate right now whether that behavior has been made more user friendly.

On Jul 8, 2009 5:10 PM, "RSD" <ro...@gmail.com> wrote:

hive> describe game_start;


                         recordtime    string
user_id int
session_id      string
host    string
release string
source  string
ip      string
ip_country      string
game_id int
challenge       int
suggestion_rank int
language        string
day     string
hive> select a.game_id, count(distinct(a.user_id)), count(1) from
game_start a where a.day >= '2009-06-01' and a.day < '2009-06-03' and
(a.game_id = 501 or a.game_id = 502 or a.game_id = 504 or a.game_id =
505 or a.game_id = 563) group by a.game_id;
Total MapReduce jobs = 2
Starting Job = job_200907071612_0568, Tracking URL =
http://somehost:50030/jobdetails.jsp?jobid=job_200907071612_0568
Kill Command = /home/analytics/hadoop/dist/current/bin/hadoop job
-Dmapred.job.tracker=somehost:9001 -kill job_200907071612_0568
...
Ended Job = job_200907071612_0568

Job Submission failed with exception 'Input path doesnt exist :
hdfs://somehost:9000/tmp/hive-someuser/183603784.10002'

FAILED: Execution Error, return code 1 from
org.apache.hadoop.hive.ql.exec.ExecDriver
there is a local directory /tmp/hive-someuser  (and in
/tmp/someuser/hive.log is where i am logging)...

is there something wrong with the syntax of the query?

Re: Odd error (Hive 0.17)

Posted by Raghu Murthy <rm...@facebook.com>.
> Job Submission failed with exception 'Input path doesnt exist :
> hdfs://somehost:9000/tmp/hive-someuser/183603784.10002'

Again, looks like the query is trying to access hdfs rather than local
directory. Can you paste the output of describe extended game_start?


On 7/8/09 5:10 PM, "RSD" <ro...@gmail.com> wrote:

> hive> describe game_start;
> 
> 
>                           recordtime    string
> user_id int    
> session_id      string
> host    string 
> release string 
> source  string 
> ip      string 
> ip_country      string
> game_id int    
> challenge       int
> suggestion_rank int
> language        string
> day     string 
> hive> select a.game_id, count(distinct(a.user_id)), count(1) from
> game_start a where a.day >= '2009-06-01' and a.day < '2009-06-03' and
> (a.game_id = 501 or a.game_id = 502 or a.game_id = 504 or a.game_id =
> 505 or a.game_id = 563) group by a.game_id;
> Total MapReduce jobs = 2
> Starting Job = job_200907071612_0568, Tracking URL =
> http://somehost:50030/jobdetails.jsp?jobid=job_200907071612_0568
> Kill Command = /home/analytics/hadoop/dist/current/bin/hadoop job
> -Dmapred.job.tracker=somehost:9001 -kill job_200907071612_0568
> ...
> Ended Job = job_200907071612_0568
> Job Submission failed with exception 'Input path doesnt exist :
> hdfs://somehost:9000/tmp/hive-someuser/183603784.10002'
> FAILED: Execution Error, return code 1 from
> org.apache.hadoop.hive.ql.exec.ExecDriver
> 
> there is a local directory /tmp/hive-someuser  (and in
> /tmp/someuser/hive.log is where i am logging)...
> 
> is there something wrong with the syntax of the query?


Re: Odd error (Hive 0.17)

Posted by RSD <ro...@gmail.com>.
hive> describe game_start;


                          recordtime	string
user_id	int	
session_id	string	
host	string	
release	string	
source	string	
ip	string	
ip_country	string	
game_id	int	
challenge	int	
suggestion_rank	int	
language	string	
day	string	
hive> select a.game_id, count(distinct(a.user_id)), count(1) from
game_start a where a.day >= '2009-06-01' and a.day < '2009-06-03' and
(a.game_id = 501 or a.game_id = 502 or a.game_id = 504 or a.game_id =
505 or a.game_id = 563) group by a.game_id;
Total MapReduce jobs = 2
Starting Job = job_200907071612_0568, Tracking URL =
http://somehost:50030/jobdetails.jsp?jobid=job_200907071612_0568
Kill Command = /home/analytics/hadoop/dist/current/bin/hadoop job
-Dmapred.job.tracker=somehost:9001 -kill job_200907071612_0568
...
Ended Job = job_200907071612_0568
Job Submission failed with exception 'Input path doesnt exist :
hdfs://somehost:9000/tmp/hive-someuser/183603784.10002'
FAILED: Execution Error, return code 1 from
org.apache.hadoop.hive.ql.exec.ExecDriver

there is a local directory /tmp/hive-someuser  (and in
/tmp/someuser/hive.log is where i am logging)...

is there something wrong with the syntax of the query?

Re: Odd error (Hive 0.17)

Posted by Raghu Murthy <rm...@facebook.com>.
The path indicates that its looking at hdfs. If you are loading data which
is on your local file system, you should do something like LOAD DATA LOCAL
INPATH '/path/in/local/fs'.


On 7/8/09 4:25 PM, "RSD" <ro...@gmail.com> wrote:

> I've gotten new equipment to do an upgrade, but I need to keep my Hadoop
> cluster
> pushing data. :-)
> 
> I am getting the following:
> 
> Job Submission failed with exception 'Input path doesnt exist :
> hdfs://XXXXXXXXXXXXXXX:9000/tmp/hive-username/183603784.10002'
> FAILED: Execution Error, return code 1 from
> org.apache.hadoop.hive.ql.exec.ExecDriver
> 
> My hive.log:
> 009-07-08 16:08:49,559 INFO  exec.ExecDriver
> (ExecDriver.java:execute(141)) - Adding input file
> /tmp/hive-username/183603784.10002
> 2009-07-08 16:08:49,705 ERROR exec.ExecDriver
> (SessionState.java:printError(242)) - Job Submission failed with
> exception 'Input path doesnt exist :
> hdfs://XXXXXXXXXXXXXXX:9000/tmp/hive-username/183603784.10002'
> org.apache.hadoop.mapred.InvalidInputException: Input path doesnt
> exist : hdfs://XXXXXXXXXXXXXXX:9000/tmp/hive-username/183603784.10002
> 
> 
> What does this error indicate? The files are in Hadoop that I want to
> work on; I've checked permissions, as my first thought was that
> I could not write to /tmp on the local machine, but that is not the
> case. Any one have a clue?