You are viewing a plain text version of this content. The canonical link for it is here.

Posted to user@hadoop.apache.org by Robin Verlangen <ro...@us2.nl> on 2013/06/19 07:54:24 UTC

Unexpected end of input stream: how to locate related file(s)/

Hi there,

How can I locate the files that cause these errors in my Map/Reduce jobs?

java.io.IOException: java.io.EOFException: Unexpected end of input stream
	at org.apache.hadoop.hive.io.HiveIOExceptionHandlerChain.handleRecordReaderNextException(HiveIOExceptionHandlerChain.java:121)
	at org.apache.hadoop.hive.io.HiveIOExceptionHandlerUtil.handleRecordReaderNextException(HiveIOExceptionHandlerUtil.java:77)
	at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.doNext(HiveContextAwareRecordReader.java:276)
	at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:79)
	at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:33)
	at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:108)
	at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:215)
	at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:200)
	at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
	at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:417)
	at org.apache.hadoop


Best regards,

Robin Verlangen
*Data Architect*
*
*
W http://www.robinverlangen.nl
E robin@us2.nl

<http://goo.gl/Lt7BC>
*What is CloudPelican? <http://goo.gl/HkB3D>*

Disclaimer: The information contained in this message and attachments is
intended solely for the attention and use of the named addressee and may be
confidential. If you are not the intended recipient, you are reminded that
the information remains the property of the sender. You must not use,
disclose, distribute, copy, print or rely on this e-mail. If you have
received this message in error, please contact the sender immediately and
irrevocably delete this message and any copies.

Re: Unexpected end of input stream: how to locate related file(s)/

Posted by Nitin Pawar <ni...@gmail.com>.

I think they are using hive,

can he look at "INPUT__FILE__NAME" in hive.

I have never tried it, so please pardon me if I am wrong


On Wed, Jun 19, 2013 at 11:58 AM, Arun C Murthy <ac...@hortonworks.com> wrote:

> Robin,
>
> On Jun 18, 2013, at 11:12 PM, Robin Verlangen <ro...@us2.nl> wrote:
>
> Hi Arun,
>
> Thank you for your reply. We run Hadoop 2.0.0 with MapReduce 0.20 packaged
> by Cloudera.
>
> Do you know where to find the log files related to a specific task, is
> that also in the folder /var/log/hadoop-0.20-mapreduce/userlogs/job_ID/
>
>
> Unfortunately it's hard to say, personally - try looking
> in /var/log/hadoop-0.20-mapreduce/userlogs/job_ID/taskAttemptID/syslog.
>
> At least on hadoop-1.2+ and hadoop-2.x you should see the following log
> msg at the top of the map-tasks' log files:
>
> LOG.info("Processing split: " + inputSplit);
>
> hth,
> Arun
>
>
> Best regards,
>
> Robin Verlangen
> *Data Architect*
> *
> *
> W http://www.robinverlangen.nl
> E robin@us2.nl
>
> <http://goo.gl/Lt7BC>
> *What is CloudPelican? <http://goo.gl/HkB3D>*
>
> Disclaimer: The information contained in this message and attachments is
> intended solely for the attention and use of the named addressee and may be
> confidential. If you are not the intended recipient, you are reminded that
> the information remains the property of the sender. You must not use,
> disclose, distribute, copy, print or rely on this e-mail. If you have
> received this message in error, please contact the sender immediately and
> irrevocably delete this message and any copies.
>
>
> On Wed, Jun 19, 2013 at 7:59 AM, Arun C Murthy <ac...@hortonworks.com>wrote:
>
>> What version of MapReduce are you using? At the beginning of the log-file
>> you should be able to see a log msg with the input-split file name for the
>> map.
>>
>> thanks,
>> Arun
>>
>> On Jun 18, 2013, at 10:54 PM, Robin Verlangen <ro...@us2.nl> wrote:
>>
>> Hi there,
>>
>> How can I locate the files that cause these errors in my Map/Reduce jobs?
>>
>> java.io.IOException: java.io.EOFException: Unexpected end of input stream
>> 	at org.apache.hadoop.hive.io.HiveIOExceptionHandlerChain.handleRecordReaderNextException(HiveIOExceptionHandlerChain.java:121)
>> 	at org.apache.hadoop.hive.io.HiveIOExceptionHandlerUtil.handleRecordReaderNextException(HiveIOExceptionHandlerUtil.java:77)
>> 	at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.doNext(HiveContextAwareRecordReader.java:276)
>> 	at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:79)
>> 	at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:33)
>> 	at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:108)
>> 	at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:215)
>> 	at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:200)
>> 	at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
>> 	at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:417)
>> 	at org.apache.hadoop
>>
>>
>> Best regards,
>>
>> Robin Verlangen
>> *Data Architect*
>> *
>> *
>> W http://www.robinverlangen.nl
>> E robin@us2.nl
>>
>> <http://goo.gl/Lt7BC>
>> *What is CloudPelican? <http://goo.gl/HkB3D>*
>>
>> Disclaimer: The information contained in this message and attachments is
>> intended solely for the attention and use of the named addressee and may be
>> confidential. If you are not the intended recipient, you are reminded that
>> the information remains the property of the sender. You must not use,
>> disclose, distribute, copy, print or rely on this e-mail. If you have
>> received this message in error, please contact the sender immediately and
>> irrevocably delete this message and any copies.
>>
>>
>>  --
>> Arun C. Murthy
>> Hortonworks Inc.
>> http://hortonworks.com/
>>
>>
>>
>
> --
> Arun C. Murthy
> Hortonworks Inc.
> http://hortonworks.com/
>
>
>


-- 
Nitin Pawar

Re: Unexpected end of input stream: how to locate related file(s)/

Posted by Nitin Pawar <ni...@gmail.com>.

I think they are using hive,

can he look at "INPUT__FILE__NAME" in hive.

I have never tried it, so please pardon me if I am wrong


On Wed, Jun 19, 2013 at 11:58 AM, Arun C Murthy <ac...@hortonworks.com> wrote:

> Robin,
>
> On Jun 18, 2013, at 11:12 PM, Robin Verlangen <ro...@us2.nl> wrote:
>
> Hi Arun,
>
> Thank you for your reply. We run Hadoop 2.0.0 with MapReduce 0.20 packaged
> by Cloudera.
>
> Do you know where to find the log files related to a specific task, is
> that also in the folder /var/log/hadoop-0.20-mapreduce/userlogs/job_ID/
>
>
> Unfortunately it's hard to say, personally - try looking
> in /var/log/hadoop-0.20-mapreduce/userlogs/job_ID/taskAttemptID/syslog.
>
> At least on hadoop-1.2+ and hadoop-2.x you should see the following log
> msg at the top of the map-tasks' log files:
>
> LOG.info("Processing split: " + inputSplit);
>
> hth,
> Arun
>
>
> Best regards,
>
> Robin Verlangen
> *Data Architect*
> *
> *
> W http://www.robinverlangen.nl
> E robin@us2.nl
>
> <http://goo.gl/Lt7BC>
> *What is CloudPelican? <http://goo.gl/HkB3D>*
>
> Disclaimer: The information contained in this message and attachments is
> intended solely for the attention and use of the named addressee and may be
> confidential. If you are not the intended recipient, you are reminded that
> the information remains the property of the sender. You must not use,
> disclose, distribute, copy, print or rely on this e-mail. If you have
> received this message in error, please contact the sender immediately and
> irrevocably delete this message and any copies.
>
>
> On Wed, Jun 19, 2013 at 7:59 AM, Arun C Murthy <ac...@hortonworks.com>wrote:
>
>> What version of MapReduce are you using? At the beginning of the log-file
>> you should be able to see a log msg with the input-split file name for the
>> map.
>>
>> thanks,
>> Arun
>>
>> On Jun 18, 2013, at 10:54 PM, Robin Verlangen <ro...@us2.nl> wrote:
>>
>> Hi there,
>>
>> How can I locate the files that cause these errors in my Map/Reduce jobs?
>>
>> java.io.IOException: java.io.EOFException: Unexpected end of input stream
>> 	at org.apache.hadoop.hive.io.HiveIOExceptionHandlerChain.handleRecordReaderNextException(HiveIOExceptionHandlerChain.java:121)
>> 	at org.apache.hadoop.hive.io.HiveIOExceptionHandlerUtil.handleRecordReaderNextException(HiveIOExceptionHandlerUtil.java:77)
>> 	at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.doNext(HiveContextAwareRecordReader.java:276)
>> 	at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:79)
>> 	at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:33)
>> 	at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:108)
>> 	at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:215)
>> 	at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:200)
>> 	at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
>> 	at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:417)
>> 	at org.apache.hadoop
>>
>>
>> Best regards,
>>
>> Robin Verlangen
>> *Data Architect*
>> *
>> *
>> W http://www.robinverlangen.nl
>> E robin@us2.nl
>>
>> <http://goo.gl/Lt7BC>
>> *What is CloudPelican? <http://goo.gl/HkB3D>*
>>
>> Disclaimer: The information contained in this message and attachments is
>> intended solely for the attention and use of the named addressee and may be
>> confidential. If you are not the intended recipient, you are reminded that
>> the information remains the property of the sender. You must not use,
>> disclose, distribute, copy, print or rely on this e-mail. If you have
>> received this message in error, please contact the sender immediately and
>> irrevocably delete this message and any copies.
>>
>>
>>  --
>> Arun C. Murthy
>> Hortonworks Inc.
>> http://hortonworks.com/
>>
>>
>>
>
> --
> Arun C. Murthy
> Hortonworks Inc.
> http://hortonworks.com/
>
>
>


-- 
Nitin Pawar

Re: Unexpected end of input stream: how to locate related file(s)/

Posted by Nitin Pawar <ni...@gmail.com>.

I think they are using hive,

can he look at "INPUT__FILE__NAME" in hive.

I have never tried it, so please pardon me if I am wrong


On Wed, Jun 19, 2013 at 11:58 AM, Arun C Murthy <ac...@hortonworks.com> wrote:

> Robin,
>
> On Jun 18, 2013, at 11:12 PM, Robin Verlangen <ro...@us2.nl> wrote:
>
> Hi Arun,
>
> Thank you for your reply. We run Hadoop 2.0.0 with MapReduce 0.20 packaged
> by Cloudera.
>
> Do you know where to find the log files related to a specific task, is
> that also in the folder /var/log/hadoop-0.20-mapreduce/userlogs/job_ID/
>
>
> Unfortunately it's hard to say, personally - try looking
> in /var/log/hadoop-0.20-mapreduce/userlogs/job_ID/taskAttemptID/syslog.
>
> At least on hadoop-1.2+ and hadoop-2.x you should see the following log
> msg at the top of the map-tasks' log files:
>
> LOG.info("Processing split: " + inputSplit);
>
> hth,
> Arun
>
>
> Best regards,
>
> Robin Verlangen
> *Data Architect*
> *
> *
> W http://www.robinverlangen.nl
> E robin@us2.nl
>
> <http://goo.gl/Lt7BC>
> *What is CloudPelican? <http://goo.gl/HkB3D>*
>
> Disclaimer: The information contained in this message and attachments is
> intended solely for the attention and use of the named addressee and may be
> confidential. If you are not the intended recipient, you are reminded that
> the information remains the property of the sender. You must not use,
> disclose, distribute, copy, print or rely on this e-mail. If you have
> received this message in error, please contact the sender immediately and
> irrevocably delete this message and any copies.
>
>
> On Wed, Jun 19, 2013 at 7:59 AM, Arun C Murthy <ac...@hortonworks.com>wrote:
>
>> What version of MapReduce are you using? At the beginning of the log-file
>> you should be able to see a log msg with the input-split file name for the
>> map.
>>
>> thanks,
>> Arun
>>
>> On Jun 18, 2013, at 10:54 PM, Robin Verlangen <ro...@us2.nl> wrote:
>>
>> Hi there,
>>
>> How can I locate the files that cause these errors in my Map/Reduce jobs?
>>
>> java.io.IOException: java.io.EOFException: Unexpected end of input stream
>> 	at org.apache.hadoop.hive.io.HiveIOExceptionHandlerChain.handleRecordReaderNextException(HiveIOExceptionHandlerChain.java:121)
>> 	at org.apache.hadoop.hive.io.HiveIOExceptionHandlerUtil.handleRecordReaderNextException(HiveIOExceptionHandlerUtil.java:77)
>> 	at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.doNext(HiveContextAwareRecordReader.java:276)
>> 	at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:79)
>> 	at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:33)
>> 	at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:108)
>> 	at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:215)
>> 	at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:200)
>> 	at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
>> 	at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:417)
>> 	at org.apache.hadoop
>>
>>
>> Best regards,
>>
>> Robin Verlangen
>> *Data Architect*
>> *
>> *
>> W http://www.robinverlangen.nl
>> E robin@us2.nl
>>
>> <http://goo.gl/Lt7BC>
>> *What is CloudPelican? <http://goo.gl/HkB3D>*
>>
>> Disclaimer: The information contained in this message and attachments is
>> intended solely for the attention and use of the named addressee and may be
>> confidential. If you are not the intended recipient, you are reminded that
>> the information remains the property of the sender. You must not use,
>> disclose, distribute, copy, print or rely on this e-mail. If you have
>> received this message in error, please contact the sender immediately and
>> irrevocably delete this message and any copies.
>>
>>
>>  --
>> Arun C. Murthy
>> Hortonworks Inc.
>> http://hortonworks.com/
>>
>>
>>
>
> --
> Arun C. Murthy
> Hortonworks Inc.
> http://hortonworks.com/
>
>
>


-- 
Nitin Pawar

Re: Unexpected end of input stream: how to locate related file(s)/

Posted by Nitin Pawar <ni...@gmail.com>.

I think they are using hive,

can he look at "INPUT__FILE__NAME" in hive.

I have never tried it, so please pardon me if I am wrong


On Wed, Jun 19, 2013 at 11:58 AM, Arun C Murthy <ac...@hortonworks.com> wrote:

> Robin,
>
> On Jun 18, 2013, at 11:12 PM, Robin Verlangen <ro...@us2.nl> wrote:
>
> Hi Arun,
>
> Thank you for your reply. We run Hadoop 2.0.0 with MapReduce 0.20 packaged
> by Cloudera.
>
> Do you know where to find the log files related to a specific task, is
> that also in the folder /var/log/hadoop-0.20-mapreduce/userlogs/job_ID/
>
>
> Unfortunately it's hard to say, personally - try looking
> in /var/log/hadoop-0.20-mapreduce/userlogs/job_ID/taskAttemptID/syslog.
>
> At least on hadoop-1.2+ and hadoop-2.x you should see the following log
> msg at the top of the map-tasks' log files:
>
> LOG.info("Processing split: " + inputSplit);
>
> hth,
> Arun
>
>
> Best regards,
>
> Robin Verlangen
> *Data Architect*
> *
> *
> W http://www.robinverlangen.nl
> E robin@us2.nl
>
> <http://goo.gl/Lt7BC>
> *What is CloudPelican? <http://goo.gl/HkB3D>*
>
> Disclaimer: The information contained in this message and attachments is
> intended solely for the attention and use of the named addressee and may be
> confidential. If you are not the intended recipient, you are reminded that
> the information remains the property of the sender. You must not use,
> disclose, distribute, copy, print or rely on this e-mail. If you have
> received this message in error, please contact the sender immediately and
> irrevocably delete this message and any copies.
>
>
> On Wed, Jun 19, 2013 at 7:59 AM, Arun C Murthy <ac...@hortonworks.com>wrote:
>
>> What version of MapReduce are you using? At the beginning of the log-file
>> you should be able to see a log msg with the input-split file name for the
>> map.
>>
>> thanks,
>> Arun
>>
>> On Jun 18, 2013, at 10:54 PM, Robin Verlangen <ro...@us2.nl> wrote:
>>
>> Hi there,
>>
>> How can I locate the files that cause these errors in my Map/Reduce jobs?
>>
>> java.io.IOException: java.io.EOFException: Unexpected end of input stream
>> 	at org.apache.hadoop.hive.io.HiveIOExceptionHandlerChain.handleRecordReaderNextException(HiveIOExceptionHandlerChain.java:121)
>> 	at org.apache.hadoop.hive.io.HiveIOExceptionHandlerUtil.handleRecordReaderNextException(HiveIOExceptionHandlerUtil.java:77)
>> 	at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.doNext(HiveContextAwareRecordReader.java:276)
>> 	at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:79)
>> 	at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:33)
>> 	at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:108)
>> 	at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:215)
>> 	at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:200)
>> 	at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
>> 	at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:417)
>> 	at org.apache.hadoop
>>
>>
>> Best regards,
>>
>> Robin Verlangen
>> *Data Architect*
>> *
>> *
>> W http://www.robinverlangen.nl
>> E robin@us2.nl
>>
>> <http://goo.gl/Lt7BC>
>> *What is CloudPelican? <http://goo.gl/HkB3D>*
>>
>> Disclaimer: The information contained in this message and attachments is
>> intended solely for the attention and use of the named addressee and may be
>> confidential. If you are not the intended recipient, you are reminded that
>> the information remains the property of the sender. You must not use,
>> disclose, distribute, copy, print or rely on this e-mail. If you have
>> received this message in error, please contact the sender immediately and
>> irrevocably delete this message and any copies.
>>
>>
>>  --
>> Arun C. Murthy
>> Hortonworks Inc.
>> http://hortonworks.com/
>>
>>
>>
>
> --
> Arun C. Murthy
> Hortonworks Inc.
> http://hortonworks.com/
>
>
>


-- 
Nitin Pawar

Re: Unexpected end of input stream: how to locate related file(s)/

Posted by Arun C Murthy <ac...@hortonworks.com>.

Robin,

On Jun 18, 2013, at 11:12 PM, Robin Verlangen <ro...@us2.nl> wrote:

> Hi Arun,
> 
> Thank you for your reply. We run Hadoop 2.0.0 with MapReduce 0.20 packaged by Cloudera.
> 
> Do you know where to find the log files related to a specific task, is that also in the folder /var/log/hadoop-0.20-mapreduce/userlogs/job_ID/

Unfortunately it's hard to say, personally - try looking in /var/log/hadoop-0.20-mapreduce/userlogs/job_ID/taskAttemptID/syslog.

At least on hadoop-1.2+ and hadoop-2.x you should see the following log msg at the top of the map-tasks' log files:
LOG.info("Processing split: " + inputSplit);
hth,
Arun

> 
> Best regards, 
> 
> Robin Verlangen
> Data Architect
> 
> W http://www.robinverlangen.nl
> E robin@us2.nl
> 
> 
> What is CloudPelican?
> 
> Disclaimer: The information contained in this message and attachments is intended solely for the attention and use of the named addressee and may be confidential. If you are not the intended recipient, you are reminded that the information remains the property of the sender. You must not use, disclose, distribute, copy, print or rely on this e-mail. If you have received this message in error, please contact the sender immediately and irrevocably delete this message and any copies.
> 
> 
> On Wed, Jun 19, 2013 at 7:59 AM, Arun C Murthy <ac...@hortonworks.com> wrote:
> What version of MapReduce are you using? At the beginning of the log-file you should be able to see a log msg with the input-split file name for the map.
> 
> thanks,
> Arun
> 
> On Jun 18, 2013, at 10:54 PM, Robin Verlangen <ro...@us2.nl> wrote:
> 
>> Hi there,
>> 
>> How can I locate the files that cause these errors in my Map/Reduce jobs?
>> 
>> java.io.IOException: java.io.EOFException: Unexpected end of input stream
>> 	at org.apache.hadoop.hive.io.HiveIOExceptionHandlerChain.handleRecordReaderNextException(HiveIOExceptionHandlerChain.java:121)
>> 	at org.apache.hadoop.hive.io.HiveIOExceptionHandlerUtil.handleRecordReaderNextException(HiveIOExceptionHandlerUtil.java:77)
>> 	at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.doNext(HiveContextAwareRecordReader.java:276)
>> 	at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:79)
>> 	at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:33)
>> 	at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:108)
>> 	at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:215)
>> 	at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:200)
>> 	at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
>> 	at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:417)
>> 	at org.apache.hadoop
>> 
>> Best regards, 
>> 
>> Robin Verlangen
>> Data Architect
>> 
>> W http://www.robinverlangen.nl
>> E robin@us2.nl
>> 
>> 
>> What is CloudPelican?
>> 
>> Disclaimer: The information contained in this message and attachments is intended solely for the attention and use of the named addressee and may be confidential. If you are not the intended recipient, you are reminded that the information remains the property of the sender. You must not use, disclose, distribute, copy, print or rely on this e-mail. If you have received this message in error, please contact the sender immediately and irrevocably delete this message and any copies.
> 
> --
> Arun C. Murthy
> Hortonworks Inc.
> http://hortonworks.com/
> 
> 
> 

--
Arun C. Murthy
Hortonworks Inc.
http://hortonworks.com/

Re: Unexpected end of input stream: how to locate related file(s)/

Posted by Arun C Murthy <ac...@hortonworks.com>.

Robin,

On Jun 18, 2013, at 11:12 PM, Robin Verlangen <ro...@us2.nl> wrote:

> Hi Arun,
> 
> Thank you for your reply. We run Hadoop 2.0.0 with MapReduce 0.20 packaged by Cloudera.
> 
> Do you know where to find the log files related to a specific task, is that also in the folder /var/log/hadoop-0.20-mapreduce/userlogs/job_ID/

Unfortunately it's hard to say, personally - try looking in /var/log/hadoop-0.20-mapreduce/userlogs/job_ID/taskAttemptID/syslog.

At least on hadoop-1.2+ and hadoop-2.x you should see the following log msg at the top of the map-tasks' log files:
LOG.info("Processing split: " + inputSplit);
hth,
Arun

> 
> Best regards, 
> 
> Robin Verlangen
> Data Architect
> 
> W http://www.robinverlangen.nl
> E robin@us2.nl
> 
> 
> What is CloudPelican?
> 
> Disclaimer: The information contained in this message and attachments is intended solely for the attention and use of the named addressee and may be confidential. If you are not the intended recipient, you are reminded that the information remains the property of the sender. You must not use, disclose, distribute, copy, print or rely on this e-mail. If you have received this message in error, please contact the sender immediately and irrevocably delete this message and any copies.
> 
> 
> On Wed, Jun 19, 2013 at 7:59 AM, Arun C Murthy <ac...@hortonworks.com> wrote:
> What version of MapReduce are you using? At the beginning of the log-file you should be able to see a log msg with the input-split file name for the map.
> 
> thanks,
> Arun
> 
> On Jun 18, 2013, at 10:54 PM, Robin Verlangen <ro...@us2.nl> wrote:
> 
>> Hi there,
>> 
>> How can I locate the files that cause these errors in my Map/Reduce jobs?
>> 
>> java.io.IOException: java.io.EOFException: Unexpected end of input stream
>> 	at org.apache.hadoop.hive.io.HiveIOExceptionHandlerChain.handleRecordReaderNextException(HiveIOExceptionHandlerChain.java:121)
>> 	at org.apache.hadoop.hive.io.HiveIOExceptionHandlerUtil.handleRecordReaderNextException(HiveIOExceptionHandlerUtil.java:77)
>> 	at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.doNext(HiveContextAwareRecordReader.java:276)
>> 	at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:79)
>> 	at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:33)
>> 	at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:108)
>> 	at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:215)
>> 	at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:200)
>> 	at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
>> 	at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:417)
>> 	at org.apache.hadoop
>> 
>> Best regards, 
>> 
>> Robin Verlangen
>> Data Architect
>> 
>> W http://www.robinverlangen.nl
>> E robin@us2.nl
>> 
>> 
>> What is CloudPelican?
>> 
>> Disclaimer: The information contained in this message and attachments is intended solely for the attention and use of the named addressee and may be confidential. If you are not the intended recipient, you are reminded that the information remains the property of the sender. You must not use, disclose, distribute, copy, print or rely on this e-mail. If you have received this message in error, please contact the sender immediately and irrevocably delete this message and any copies.
> 
> --
> Arun C. Murthy
> Hortonworks Inc.
> http://hortonworks.com/
> 
> 
> 

--
Arun C. Murthy
Hortonworks Inc.
http://hortonworks.com/

Re: Unexpected end of input stream: how to locate related file(s)/

Posted by Arun C Murthy <ac...@hortonworks.com>.

Robin,

On Jun 18, 2013, at 11:12 PM, Robin Verlangen <ro...@us2.nl> wrote:

> Hi Arun,
> 
> Thank you for your reply. We run Hadoop 2.0.0 with MapReduce 0.20 packaged by Cloudera.
> 
> Do you know where to find the log files related to a specific task, is that also in the folder /var/log/hadoop-0.20-mapreduce/userlogs/job_ID/

Unfortunately it's hard to say, personally - try looking in /var/log/hadoop-0.20-mapreduce/userlogs/job_ID/taskAttemptID/syslog.

At least on hadoop-1.2+ and hadoop-2.x you should see the following log msg at the top of the map-tasks' log files:
LOG.info("Processing split: " + inputSplit);
hth,
Arun

> 
> Best regards, 
> 
> Robin Verlangen
> Data Architect
> 
> W http://www.robinverlangen.nl
> E robin@us2.nl
> 
> 
> What is CloudPelican?
> 
> Disclaimer: The information contained in this message and attachments is intended solely for the attention and use of the named addressee and may be confidential. If you are not the intended recipient, you are reminded that the information remains the property of the sender. You must not use, disclose, distribute, copy, print or rely on this e-mail. If you have received this message in error, please contact the sender immediately and irrevocably delete this message and any copies.
> 
> 
> On Wed, Jun 19, 2013 at 7:59 AM, Arun C Murthy <ac...@hortonworks.com> wrote:
> What version of MapReduce are you using? At the beginning of the log-file you should be able to see a log msg with the input-split file name for the map.
> 
> thanks,
> Arun
> 
> On Jun 18, 2013, at 10:54 PM, Robin Verlangen <ro...@us2.nl> wrote:
> 
>> Hi there,
>> 
>> How can I locate the files that cause these errors in my Map/Reduce jobs?
>> 
>> java.io.IOException: java.io.EOFException: Unexpected end of input stream
>> 	at org.apache.hadoop.hive.io.HiveIOExceptionHandlerChain.handleRecordReaderNextException(HiveIOExceptionHandlerChain.java:121)
>> 	at org.apache.hadoop.hive.io.HiveIOExceptionHandlerUtil.handleRecordReaderNextException(HiveIOExceptionHandlerUtil.java:77)
>> 	at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.doNext(HiveContextAwareRecordReader.java:276)
>> 	at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:79)
>> 	at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:33)
>> 	at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:108)
>> 	at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:215)
>> 	at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:200)
>> 	at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
>> 	at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:417)
>> 	at org.apache.hadoop
>> 
>> Best regards, 
>> 
>> Robin Verlangen
>> Data Architect
>> 
>> W http://www.robinverlangen.nl
>> E robin@us2.nl
>> 
>> 
>> What is CloudPelican?
>> 
>> Disclaimer: The information contained in this message and attachments is intended solely for the attention and use of the named addressee and may be confidential. If you are not the intended recipient, you are reminded that the information remains the property of the sender. You must not use, disclose, distribute, copy, print or rely on this e-mail. If you have received this message in error, please contact the sender immediately and irrevocably delete this message and any copies.
> 
> --
> Arun C. Murthy
> Hortonworks Inc.
> http://hortonworks.com/
> 
> 
> 

--
Arun C. Murthy
Hortonworks Inc.
http://hortonworks.com/

Re: Unexpected end of input stream: how to locate related file(s)/

Posted by Arun C Murthy <ac...@hortonworks.com>.

Robin,

On Jun 18, 2013, at 11:12 PM, Robin Verlangen <ro...@us2.nl> wrote:

> Hi Arun,
> 
> Thank you for your reply. We run Hadoop 2.0.0 with MapReduce 0.20 packaged by Cloudera.
> 
> Do you know where to find the log files related to a specific task, is that also in the folder /var/log/hadoop-0.20-mapreduce/userlogs/job_ID/

Unfortunately it's hard to say, personally - try looking in /var/log/hadoop-0.20-mapreduce/userlogs/job_ID/taskAttemptID/syslog.

At least on hadoop-1.2+ and hadoop-2.x you should see the following log msg at the top of the map-tasks' log files:
LOG.info("Processing split: " + inputSplit);
hth,
Arun

> 
> Best regards, 
> 
> Robin Verlangen
> Data Architect
> 
> W http://www.robinverlangen.nl
> E robin@us2.nl
> 
> 
> What is CloudPelican?
> 
> Disclaimer: The information contained in this message and attachments is intended solely for the attention and use of the named addressee and may be confidential. If you are not the intended recipient, you are reminded that the information remains the property of the sender. You must not use, disclose, distribute, copy, print or rely on this e-mail. If you have received this message in error, please contact the sender immediately and irrevocably delete this message and any copies.
> 
> 
> On Wed, Jun 19, 2013 at 7:59 AM, Arun C Murthy <ac...@hortonworks.com> wrote:
> What version of MapReduce are you using? At the beginning of the log-file you should be able to see a log msg with the input-split file name for the map.
> 
> thanks,
> Arun
> 
> On Jun 18, 2013, at 10:54 PM, Robin Verlangen <ro...@us2.nl> wrote:
> 
>> Hi there,
>> 
>> How can I locate the files that cause these errors in my Map/Reduce jobs?
>> 
>> java.io.IOException: java.io.EOFException: Unexpected end of input stream
>> 	at org.apache.hadoop.hive.io.HiveIOExceptionHandlerChain.handleRecordReaderNextException(HiveIOExceptionHandlerChain.java:121)
>> 	at org.apache.hadoop.hive.io.HiveIOExceptionHandlerUtil.handleRecordReaderNextException(HiveIOExceptionHandlerUtil.java:77)
>> 	at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.doNext(HiveContextAwareRecordReader.java:276)
>> 	at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:79)
>> 	at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:33)
>> 	at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:108)
>> 	at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:215)
>> 	at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:200)
>> 	at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
>> 	at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:417)
>> 	at org.apache.hadoop
>> 
>> Best regards, 
>> 
>> Robin Verlangen
>> Data Architect
>> 
>> W http://www.robinverlangen.nl
>> E robin@us2.nl
>> 
>> 
>> What is CloudPelican?
>> 
>> Disclaimer: The information contained in this message and attachments is intended solely for the attention and use of the named addressee and may be confidential. If you are not the intended recipient, you are reminded that the information remains the property of the sender. You must not use, disclose, distribute, copy, print or rely on this e-mail. If you have received this message in error, please contact the sender immediately and irrevocably delete this message and any copies.
> 
> --
> Arun C. Murthy
> Hortonworks Inc.
> http://hortonworks.com/
> 
> 
> 

--
Arun C. Murthy
Hortonworks Inc.
http://hortonworks.com/

Re: Unexpected end of input stream: how to locate related file(s)/

Posted by Robin Verlangen <ro...@us2.nl>.

Hi Arun,

Thank you for your reply. We run Hadoop 2.0.0 with MapReduce 0.20 packaged
by Cloudera.

Do you know where to find the log files related to a specific task, is that
also in the folder /var/log/hadoop-0.20-mapreduce/userlogs/job_ID/

Best regards,

Robin Verlangen
*Data Architect*
*
*
W http://www.robinverlangen.nl
E robin@us2.nl

<http://goo.gl/Lt7BC>
*What is CloudPelican? <http://goo.gl/HkB3D>*

Disclaimer: The information contained in this message and attachments is
intended solely for the attention and use of the named addressee and may be
confidential. If you are not the intended recipient, you are reminded that
the information remains the property of the sender. You must not use,
disclose, distribute, copy, print or rely on this e-mail. If you have
received this message in error, please contact the sender immediately and
irrevocably delete this message and any copies.


On Wed, Jun 19, 2013 at 7:59 AM, Arun C Murthy <ac...@hortonworks.com> wrote:

> What version of MapReduce are you using? At the beginning of the log-file
> you should be able to see a log msg with the input-split file name for the
> map.
>
> thanks,
> Arun
>
> On Jun 18, 2013, at 10:54 PM, Robin Verlangen <ro...@us2.nl> wrote:
>
> Hi there,
>
> How can I locate the files that cause these errors in my Map/Reduce jobs?
>
> java.io.IOException: java.io.EOFException: Unexpected end of input stream
> 	at org.apache.hadoop.hive.io.HiveIOExceptionHandlerChain.handleRecordReaderNextException(HiveIOExceptionHandlerChain.java:121)
> 	at org.apache.hadoop.hive.io.HiveIOExceptionHandlerUtil.handleRecordReaderNextException(HiveIOExceptionHandlerUtil.java:77)
> 	at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.doNext(HiveContextAwareRecordReader.java:276)
> 	at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:79)
> 	at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:33)
> 	at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:108)
> 	at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:215)
> 	at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:200)
> 	at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> 	at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:417)
> 	at org.apache.hadoop
>
>
> Best regards,
>
> Robin Verlangen
> *Data Architect*
> *
> *
> W http://www.robinverlangen.nl
> E robin@us2.nl
>
> <http://goo.gl/Lt7BC>
> *What is CloudPelican? <http://goo.gl/HkB3D>*
>
> Disclaimer: The information contained in this message and attachments is
> intended solely for the attention and use of the named addressee and may be
> confidential. If you are not the intended recipient, you are reminded that
> the information remains the property of the sender. You must not use,
> disclose, distribute, copy, print or rely on this e-mail. If you have
> received this message in error, please contact the sender immediately and
> irrevocably delete this message and any copies.
>
>
> --
> Arun C. Murthy
> Hortonworks Inc.
> http://hortonworks.com/
>
>
>

Re: Unexpected end of input stream: how to locate related file(s)/

Posted by Robin Verlangen <ro...@us2.nl>.

Hi Arun,

Thank you for your reply. We run Hadoop 2.0.0 with MapReduce 0.20 packaged
by Cloudera.

Do you know where to find the log files related to a specific task, is that
also in the folder /var/log/hadoop-0.20-mapreduce/userlogs/job_ID/

Best regards,

Robin Verlangen
*Data Architect*
*
*
W http://www.robinverlangen.nl
E robin@us2.nl

<http://goo.gl/Lt7BC>
*What is CloudPelican? <http://goo.gl/HkB3D>*

Disclaimer: The information contained in this message and attachments is
intended solely for the attention and use of the named addressee and may be
confidential. If you are not the intended recipient, you are reminded that
the information remains the property of the sender. You must not use,
disclose, distribute, copy, print or rely on this e-mail. If you have
received this message in error, please contact the sender immediately and
irrevocably delete this message and any copies.


On Wed, Jun 19, 2013 at 7:59 AM, Arun C Murthy <ac...@hortonworks.com> wrote:

> What version of MapReduce are you using? At the beginning of the log-file
> you should be able to see a log msg with the input-split file name for the
> map.
>
> thanks,
> Arun
>
> On Jun 18, 2013, at 10:54 PM, Robin Verlangen <ro...@us2.nl> wrote:
>
> Hi there,
>
> How can I locate the files that cause these errors in my Map/Reduce jobs?
>
> java.io.IOException: java.io.EOFException: Unexpected end of input stream
> 	at org.apache.hadoop.hive.io.HiveIOExceptionHandlerChain.handleRecordReaderNextException(HiveIOExceptionHandlerChain.java:121)
> 	at org.apache.hadoop.hive.io.HiveIOExceptionHandlerUtil.handleRecordReaderNextException(HiveIOExceptionHandlerUtil.java:77)
> 	at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.doNext(HiveContextAwareRecordReader.java:276)
> 	at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:79)
> 	at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:33)
> 	at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:108)
> 	at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:215)
> 	at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:200)
> 	at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> 	at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:417)
> 	at org.apache.hadoop
>
>
> Best regards,
>
> Robin Verlangen
> *Data Architect*
> *
> *
> W http://www.robinverlangen.nl
> E robin@us2.nl
>
> <http://goo.gl/Lt7BC>
> *What is CloudPelican? <http://goo.gl/HkB3D>*
>
> Disclaimer: The information contained in this message and attachments is
> intended solely for the attention and use of the named addressee and may be
> confidential. If you are not the intended recipient, you are reminded that
> the information remains the property of the sender. You must not use,
> disclose, distribute, copy, print or rely on this e-mail. If you have
> received this message in error, please contact the sender immediately and
> irrevocably delete this message and any copies.
>
>
> --
> Arun C. Murthy
> Hortonworks Inc.
> http://hortonworks.com/
>
>
>

Re: Unexpected end of input stream: how to locate related file(s)/

Posted by Robin Verlangen <ro...@us2.nl>.

Hi Arun,

Thank you for your reply. We run Hadoop 2.0.0 with MapReduce 0.20 packaged
by Cloudera.

Do you know where to find the log files related to a specific task, is that
also in the folder /var/log/hadoop-0.20-mapreduce/userlogs/job_ID/

Best regards,

Robin Verlangen
*Data Architect*
*
*
W http://www.robinverlangen.nl
E robin@us2.nl

<http://goo.gl/Lt7BC>
*What is CloudPelican? <http://goo.gl/HkB3D>*

Disclaimer: The information contained in this message and attachments is
intended solely for the attention and use of the named addressee and may be
confidential. If you are not the intended recipient, you are reminded that
the information remains the property of the sender. You must not use,
disclose, distribute, copy, print or rely on this e-mail. If you have
received this message in error, please contact the sender immediately and
irrevocably delete this message and any copies.


On Wed, Jun 19, 2013 at 7:59 AM, Arun C Murthy <ac...@hortonworks.com> wrote:

> What version of MapReduce are you using? At the beginning of the log-file
> you should be able to see a log msg with the input-split file name for the
> map.
>
> thanks,
> Arun
>
> On Jun 18, 2013, at 10:54 PM, Robin Verlangen <ro...@us2.nl> wrote:
>
> Hi there,
>
> How can I locate the files that cause these errors in my Map/Reduce jobs?
>
> java.io.IOException: java.io.EOFException: Unexpected end of input stream
> 	at org.apache.hadoop.hive.io.HiveIOExceptionHandlerChain.handleRecordReaderNextException(HiveIOExceptionHandlerChain.java:121)
> 	at org.apache.hadoop.hive.io.HiveIOExceptionHandlerUtil.handleRecordReaderNextException(HiveIOExceptionHandlerUtil.java:77)
> 	at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.doNext(HiveContextAwareRecordReader.java:276)
> 	at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:79)
> 	at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:33)
> 	at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:108)
> 	at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:215)
> 	at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:200)
> 	at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> 	at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:417)
> 	at org.apache.hadoop
>
>
> Best regards,
>
> Robin Verlangen
> *Data Architect*
> *
> *
> W http://www.robinverlangen.nl
> E robin@us2.nl
>
> <http://goo.gl/Lt7BC>
> *What is CloudPelican? <http://goo.gl/HkB3D>*
>
> Disclaimer: The information contained in this message and attachments is
> intended solely for the attention and use of the named addressee and may be
> confidential. If you are not the intended recipient, you are reminded that
> the information remains the property of the sender. You must not use,
> disclose, distribute, copy, print or rely on this e-mail. If you have
> received this message in error, please contact the sender immediately and
> irrevocably delete this message and any copies.
>
>
> --
> Arun C. Murthy
> Hortonworks Inc.
> http://hortonworks.com/
>
>
>

Re: Unexpected end of input stream: how to locate related file(s)/

Posted by Robin Verlangen <ro...@us2.nl>.

Hi Arun,

Thank you for your reply. We run Hadoop 2.0.0 with MapReduce 0.20 packaged
by Cloudera.

Do you know where to find the log files related to a specific task, is that
also in the folder /var/log/hadoop-0.20-mapreduce/userlogs/job_ID/

Best regards,

Robin Verlangen
*Data Architect*
*
*
W http://www.robinverlangen.nl
E robin@us2.nl

<http://goo.gl/Lt7BC>
*What is CloudPelican? <http://goo.gl/HkB3D>*

Disclaimer: The information contained in this message and attachments is
intended solely for the attention and use of the named addressee and may be
confidential. If you are not the intended recipient, you are reminded that
the information remains the property of the sender. You must not use,
disclose, distribute, copy, print or rely on this e-mail. If you have
received this message in error, please contact the sender immediately and
irrevocably delete this message and any copies.


On Wed, Jun 19, 2013 at 7:59 AM, Arun C Murthy <ac...@hortonworks.com> wrote:

> What version of MapReduce are you using? At the beginning of the log-file
> you should be able to see a log msg with the input-split file name for the
> map.
>
> thanks,
> Arun
>
> On Jun 18, 2013, at 10:54 PM, Robin Verlangen <ro...@us2.nl> wrote:
>
> Hi there,
>
> How can I locate the files that cause these errors in my Map/Reduce jobs?
>
> java.io.IOException: java.io.EOFException: Unexpected end of input stream
> 	at org.apache.hadoop.hive.io.HiveIOExceptionHandlerChain.handleRecordReaderNextException(HiveIOExceptionHandlerChain.java:121)
> 	at org.apache.hadoop.hive.io.HiveIOExceptionHandlerUtil.handleRecordReaderNextException(HiveIOExceptionHandlerUtil.java:77)
> 	at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.doNext(HiveContextAwareRecordReader.java:276)
> 	at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:79)
> 	at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:33)
> 	at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:108)
> 	at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:215)
> 	at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:200)
> 	at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> 	at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:417)
> 	at org.apache.hadoop
>
>
> Best regards,
>
> Robin Verlangen
> *Data Architect*
> *
> *
> W http://www.robinverlangen.nl
> E robin@us2.nl
>
> <http://goo.gl/Lt7BC>
> *What is CloudPelican? <http://goo.gl/HkB3D>*
>
> Disclaimer: The information contained in this message and attachments is
> intended solely for the attention and use of the named addressee and may be
> confidential. If you are not the intended recipient, you are reminded that
> the information remains the property of the sender. You must not use,
> disclose, distribute, copy, print or rely on this e-mail. If you have
> received this message in error, please contact the sender immediately and
> irrevocably delete this message and any copies.
>
>
> --
> Arun C. Murthy
> Hortonworks Inc.
> http://hortonworks.com/
>
>
>

Re: Unexpected end of input stream: how to locate related file(s)/

Posted by Arun C Murthy <ac...@hortonworks.com>.

What version of MapReduce are you using? At the beginning of the log-file you should be able to see a log msg with the input-split file name for the map.

thanks,
Arun

On Jun 18, 2013, at 10:54 PM, Robin Verlangen <ro...@us2.nl> wrote:

> Hi there,
> 
> How can I locate the files that cause these errors in my Map/Reduce jobs?
> 
> java.io.IOException: java.io.EOFException: Unexpected end of input stream
> 	at org.apache.hadoop.hive.io.HiveIOExceptionHandlerChain.handleRecordReaderNextException(HiveIOExceptionHandlerChain.java:121)
> 	at org.apache.hadoop.hive.io.HiveIOExceptionHandlerUtil.handleRecordReaderNextException(HiveIOExceptionHandlerUtil.java:77)
> 	at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.doNext(HiveContextAwareRecordReader.java:276)
> 	at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:79)
> 	at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:33)
> 	at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:108)
> 	at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:215)
> 	at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:200)
> 	at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> 	at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:417)
> 	at org.apache.hadoop
> 
> Best regards, 
> 
> Robin Verlangen
> Data Architect
> 
> W http://www.robinverlangen.nl
> E robin@us2.nl
> 
> 
> What is CloudPelican?
> 
> Disclaimer: The information contained in this message and attachments is intended solely for the attention and use of the named addressee and may be confidential. If you are not the intended recipient, you are reminded that the information remains the property of the sender. You must not use, disclose, distribute, copy, print or rely on this e-mail. If you have received this message in error, please contact the sender immediately and irrevocably delete this message and any copies.

--
Arun C. Murthy
Hortonworks Inc.
http://hortonworks.com/

Re: Unexpected end of input stream: how to locate related file(s)/

Posted by Arun C Murthy <ac...@hortonworks.com>.

What version of MapReduce are you using? At the beginning of the log-file you should be able to see a log msg with the input-split file name for the map.

thanks,
Arun

On Jun 18, 2013, at 10:54 PM, Robin Verlangen <ro...@us2.nl> wrote:

> Hi there,
> 
> How can I locate the files that cause these errors in my Map/Reduce jobs?
> 
> java.io.IOException: java.io.EOFException: Unexpected end of input stream
> 	at org.apache.hadoop.hive.io.HiveIOExceptionHandlerChain.handleRecordReaderNextException(HiveIOExceptionHandlerChain.java:121)
> 	at org.apache.hadoop.hive.io.HiveIOExceptionHandlerUtil.handleRecordReaderNextException(HiveIOExceptionHandlerUtil.java:77)
> 	at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.doNext(HiveContextAwareRecordReader.java:276)
> 	at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:79)
> 	at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:33)
> 	at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:108)
> 	at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:215)
> 	at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:200)
> 	at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> 	at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:417)
> 	at org.apache.hadoop
> 
> Best regards, 
> 
> Robin Verlangen
> Data Architect
> 
> W http://www.robinverlangen.nl
> E robin@us2.nl
> 
> 
> What is CloudPelican?
> 
> Disclaimer: The information contained in this message and attachments is intended solely for the attention and use of the named addressee and may be confidential. If you are not the intended recipient, you are reminded that the information remains the property of the sender. You must not use, disclose, distribute, copy, print or rely on this e-mail. If you have received this message in error, please contact the sender immediately and irrevocably delete this message and any copies.

--
Arun C. Murthy
Hortonworks Inc.
http://hortonworks.com/

Re: Unexpected end of input stream: how to locate related file(s)/

Posted by Arun C Murthy <ac...@hortonworks.com>.

What version of MapReduce are you using? At the beginning of the log-file you should be able to see a log msg with the input-split file name for the map.

thanks,
Arun

On Jun 18, 2013, at 10:54 PM, Robin Verlangen <ro...@us2.nl> wrote:

> Hi there,
> 
> How can I locate the files that cause these errors in my Map/Reduce jobs?
> 
> java.io.IOException: java.io.EOFException: Unexpected end of input stream
> 	at org.apache.hadoop.hive.io.HiveIOExceptionHandlerChain.handleRecordReaderNextException(HiveIOExceptionHandlerChain.java:121)
> 	at org.apache.hadoop.hive.io.HiveIOExceptionHandlerUtil.handleRecordReaderNextException(HiveIOExceptionHandlerUtil.java:77)
> 	at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.doNext(HiveContextAwareRecordReader.java:276)
> 	at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:79)
> 	at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:33)
> 	at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:108)
> 	at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:215)
> 	at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:200)
> 	at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> 	at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:417)
> 	at org.apache.hadoop
> 
> Best regards, 
> 
> Robin Verlangen
> Data Architect
> 
> W http://www.robinverlangen.nl
> E robin@us2.nl
> 
> 
> What is CloudPelican?
> 
> Disclaimer: The information contained in this message and attachments is intended solely for the attention and use of the named addressee and may be confidential. If you are not the intended recipient, you are reminded that the information remains the property of the sender. You must not use, disclose, distribute, copy, print or rely on this e-mail. If you have received this message in error, please contact the sender immediately and irrevocably delete this message and any copies.

--
Arun C. Murthy
Hortonworks Inc.
http://hortonworks.com/

Re: Unexpected end of input stream: how to locate related file(s)/

Posted by Arun C Murthy <ac...@hortonworks.com>.

What version of MapReduce are you using? At the beginning of the log-file you should be able to see a log msg with the input-split file name for the map.

thanks,
Arun

On Jun 18, 2013, at 10:54 PM, Robin Verlangen <ro...@us2.nl> wrote:

> Hi there,
> 
> How can I locate the files that cause these errors in my Map/Reduce jobs?
> 
> java.io.IOException: java.io.EOFException: Unexpected end of input stream
> 	at org.apache.hadoop.hive.io.HiveIOExceptionHandlerChain.handleRecordReaderNextException(HiveIOExceptionHandlerChain.java:121)
> 	at org.apache.hadoop.hive.io.HiveIOExceptionHandlerUtil.handleRecordReaderNextException(HiveIOExceptionHandlerUtil.java:77)
> 	at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.doNext(HiveContextAwareRecordReader.java:276)
> 	at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:79)
> 	at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:33)
> 	at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:108)
> 	at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:215)
> 	at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:200)
> 	at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> 	at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:417)
> 	at org.apache.hadoop
> 
> Best regards, 
> 
> Robin Verlangen
> Data Architect
> 
> W http://www.robinverlangen.nl
> E robin@us2.nl
> 
> 
> What is CloudPelican?
> 
> Disclaimer: The information contained in this message and attachments is intended solely for the attention and use of the named addressee and may be confidential. If you are not the intended recipient, you are reminded that the information remains the property of the sender. You must not use, disclose, distribute, copy, print or rely on this e-mail. If you have received this message in error, please contact the sender immediately and irrevocably delete this message and any copies.

--
Arun C. Murthy
Hortonworks Inc.
http://hortonworks.com/