You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-user@hadoop.apache.org by Varsha Raveendran <va...@gmail.com> on 2013/04/01 08:43:39 UTC
Word count on cluster configuration
Hello!
I did the setup for a cluster configuration of Hadoop. After running the
word count example the output shown in the part-r-00000 file is as shown :
hduser@MT2012158:/usr/local/hadoop$ head
/tmp/gutenberg-output/gutenberg-output
40
2
4
��� � � � �@�� 2
��� � � � �@�@�� 1
���� � � � �@�@�� 1
P�������� j l k m �������� g��������������������EXTH � j 2004-01-01d
Leonardo 1
P�������� � � � � �������� ���������������������EXTH � t 1
�P�������� � � � ������������ � � � ���������EXTH � j 2004-01-01d
Leonardo 1
�P�������� � � � ������������ � � � � �����EXTH � t 1
Can you please tell me why this is happening?
--
*-Varsha *
Re: Word count on cluster configuration
Posted by Varsha Raveendran <va...@gmail.com>.
Thanks!
On Mon, Apr 1, 2013 at 12:23 PM, Wenming Ye <ye...@hotmail.com> wrote:
> because many of the “words” are unicode, check the next blog.
>
> http://blogs.msdn.com/b/hpctrekker/archive/2013/04/01/make-another-small-step-with-the-javascript-console-pig-in-hdinsight.aspx
>
> *From:* Varsha Raveendran <va...@gmail.com>
> *Sent:* Sunday, March 31, 2013 11:43 PM
> *To:* user@hadoop.apache.org
> *Subject:* Word count on cluster configuration
>
> Hello!
>
> I did the setup for a cluster configuration of Hadoop. After running the
> word count example the output shown in the part-r-00000 file is as shown :
>
> hduser@MT2012158:/usr/local/hadoop$ head
> /tmp/gutenberg-output/gutenberg-output
> 40
> 2
> 4
> ��� � � � �@�� 2
> ��� � � � �@�@�� 1
> ���� � � � �@�@�� 1
> P�������� j l k m �������� g��������������������EXTH � j 2004-01-01d
> Leonardo 1
> P�������� � � � � �������� ���������������������EXTH � t 1
> �P�������� � � � ������������ � � � ���������EXTH � j 2004-01-01d
> Leonardo 1
> �P�������� � � � ������������ � � � � �����EXTH � t 1
>
>
>
> Can you please tell me why this is happening?
>
>
>
>
> --
> *-Varsha *
>
--
*-Varsha *
Re: Word count on cluster configuration
Posted by Varsha Raveendran <va...@gmail.com>.
Thanks!
On Mon, Apr 1, 2013 at 12:23 PM, Wenming Ye <ye...@hotmail.com> wrote:
> because many of the “words” are unicode, check the next blog.
>
> http://blogs.msdn.com/b/hpctrekker/archive/2013/04/01/make-another-small-step-with-the-javascript-console-pig-in-hdinsight.aspx
>
> *From:* Varsha Raveendran <va...@gmail.com>
> *Sent:* Sunday, March 31, 2013 11:43 PM
> *To:* user@hadoop.apache.org
> *Subject:* Word count on cluster configuration
>
> Hello!
>
> I did the setup for a cluster configuration of Hadoop. After running the
> word count example the output shown in the part-r-00000 file is as shown :
>
> hduser@MT2012158:/usr/local/hadoop$ head
> /tmp/gutenberg-output/gutenberg-output
> 40
> 2
> 4
> ��� � � � �@�� 2
> ��� � � � �@�@�� 1
> ���� � � � �@�@�� 1
> P�������� j l k m �������� g��������������������EXTH � j 2004-01-01d
> Leonardo 1
> P�������� � � � � �������� ���������������������EXTH � t 1
> �P�������� � � � ������������ � � � ���������EXTH � j 2004-01-01d
> Leonardo 1
> �P�������� � � � ������������ � � � � �����EXTH � t 1
>
>
>
> Can you please tell me why this is happening?
>
>
>
>
> --
> *-Varsha *
>
--
*-Varsha *
Re: Word count on cluster configuration
Posted by Varsha Raveendran <va...@gmail.com>.
Thanks!
On Mon, Apr 1, 2013 at 12:23 PM, Wenming Ye <ye...@hotmail.com> wrote:
> because many of the “words” are unicode, check the next blog.
>
> http://blogs.msdn.com/b/hpctrekker/archive/2013/04/01/make-another-small-step-with-the-javascript-console-pig-in-hdinsight.aspx
>
> *From:* Varsha Raveendran <va...@gmail.com>
> *Sent:* Sunday, March 31, 2013 11:43 PM
> *To:* user@hadoop.apache.org
> *Subject:* Word count on cluster configuration
>
> Hello!
>
> I did the setup for a cluster configuration of Hadoop. After running the
> word count example the output shown in the part-r-00000 file is as shown :
>
> hduser@MT2012158:/usr/local/hadoop$ head
> /tmp/gutenberg-output/gutenberg-output
> 40
> 2
> 4
> ��� � � � �@�� 2
> ��� � � � �@�@�� 1
> ���� � � � �@�@�� 1
> P�������� j l k m �������� g��������������������EXTH � j 2004-01-01d
> Leonardo 1
> P�������� � � � � �������� ���������������������EXTH � t 1
> �P�������� � � � ������������ � � � ���������EXTH � j 2004-01-01d
> Leonardo 1
> �P�������� � � � ������������ � � � � �����EXTH � t 1
>
>
>
> Can you please tell me why this is happening?
>
>
>
>
> --
> *-Varsha *
>
--
*-Varsha *
Re: Word count on cluster configuration
Posted by Varsha Raveendran <va...@gmail.com>.
Thanks!
On Mon, Apr 1, 2013 at 12:23 PM, Wenming Ye <ye...@hotmail.com> wrote:
> because many of the “words” are unicode, check the next blog.
>
> http://blogs.msdn.com/b/hpctrekker/archive/2013/04/01/make-another-small-step-with-the-javascript-console-pig-in-hdinsight.aspx
>
> *From:* Varsha Raveendran <va...@gmail.com>
> *Sent:* Sunday, March 31, 2013 11:43 PM
> *To:* user@hadoop.apache.org
> *Subject:* Word count on cluster configuration
>
> Hello!
>
> I did the setup for a cluster configuration of Hadoop. After running the
> word count example the output shown in the part-r-00000 file is as shown :
>
> hduser@MT2012158:/usr/local/hadoop$ head
> /tmp/gutenberg-output/gutenberg-output
> 40
> 2
> 4
> ��� � � � �@�� 2
> ��� � � � �@�@�� 1
> ���� � � � �@�@�� 1
> P�������� j l k m �������� g��������������������EXTH � j 2004-01-01d
> Leonardo 1
> P�������� � � � � �������� ���������������������EXTH � t 1
> �P�������� � � � ������������ � � � ���������EXTH � j 2004-01-01d
> Leonardo 1
> �P�������� � � � ������������ � � � � �����EXTH � t 1
>
>
>
> Can you please tell me why this is happening?
>
>
>
>
> --
> *-Varsha *
>
--
*-Varsha *
Re: Word count on cluster configuration
Posted by Wenming Ye <ye...@hotmail.com>.
because many of the “words” are unicode, check the next blog.
http://blogs.msdn.com/b/hpctrekker/archive/2013/04/01/make-another-small-step-with-the-javascript-console-pig-in-hdinsight.aspx
From: Varsha Raveendran
Sent: Sunday, March 31, 2013 11:43 PM
To: user@hadoop.apache.org
Subject: Word count on cluster configuration
Hello!
I did the setup for a cluster configuration of Hadoop. After running the word count example the output shown in the part-r-00000 file is as shown :
hduser@MT2012158:/usr/local/hadoop$ head /tmp/gutenberg-output/gutenberg-output
40
2
4
��� � � � �@�� 2
��� � � � �@�@�� 1
���� � � � �@�@�� 1
P�������� j l k m �������� g��������������������EXTH � j 2004-01-01d Leonardo 1
P�������� � � � � �������� ���������������������EXTH � t 1
�P�������� � � � ������������ � � � ���������EXTH � j 2004-01-01d Leonardo 1
�P�������� � � � ������������ � � � � �����EXTH � t 1
Can you please tell me why this is happening?
--
-Varsha
Re: Word count on cluster configuration
Posted by Wenming Ye <ye...@hotmail.com>.
because many of the “words” are unicode, check the next blog.
http://blogs.msdn.com/b/hpctrekker/archive/2013/04/01/make-another-small-step-with-the-javascript-console-pig-in-hdinsight.aspx
From: Varsha Raveendran
Sent: Sunday, March 31, 2013 11:43 PM
To: user@hadoop.apache.org
Subject: Word count on cluster configuration
Hello!
I did the setup for a cluster configuration of Hadoop. After running the word count example the output shown in the part-r-00000 file is as shown :
hduser@MT2012158:/usr/local/hadoop$ head /tmp/gutenberg-output/gutenberg-output
40
2
4
��� � � � �@�� 2
��� � � � �@�@�� 1
���� � � � �@�@�� 1
P�������� j l k m �������� g��������������������EXTH � j 2004-01-01d Leonardo 1
P�������� � � � � �������� ���������������������EXTH � t 1
�P�������� � � � ������������ � � � ���������EXTH � j 2004-01-01d Leonardo 1
�P�������� � � � ������������ � � � � �����EXTH � t 1
Can you please tell me why this is happening?
--
-Varsha
Re: Word count on cluster configuration
Posted by Wenming Ye <ye...@hotmail.com>.
because many of the “words” are unicode, check the next blog.
http://blogs.msdn.com/b/hpctrekker/archive/2013/04/01/make-another-small-step-with-the-javascript-console-pig-in-hdinsight.aspx
From: Varsha Raveendran
Sent: Sunday, March 31, 2013 11:43 PM
To: user@hadoop.apache.org
Subject: Word count on cluster configuration
Hello!
I did the setup for a cluster configuration of Hadoop. After running the word count example the output shown in the part-r-00000 file is as shown :
hduser@MT2012158:/usr/local/hadoop$ head /tmp/gutenberg-output/gutenberg-output
40
2
4
��� � � � �@�� 2
��� � � � �@�@�� 1
���� � � � �@�@�� 1
P�������� j l k m �������� g��������������������EXTH � j 2004-01-01d Leonardo 1
P�������� � � � � �������� ���������������������EXTH � t 1
�P�������� � � � ������������ � � � ���������EXTH � j 2004-01-01d Leonardo 1
�P�������� � � � ������������ � � � � �����EXTH � t 1
Can you please tell me why this is happening?
--
-Varsha
Re: Word count on cluster configuration
Posted by Wenming Ye <ye...@hotmail.com>.
because many of the “words” are unicode, check the next blog.
http://blogs.msdn.com/b/hpctrekker/archive/2013/04/01/make-another-small-step-with-the-javascript-console-pig-in-hdinsight.aspx
From: Varsha Raveendran
Sent: Sunday, March 31, 2013 11:43 PM
To: user@hadoop.apache.org
Subject: Word count on cluster configuration
Hello!
I did the setup for a cluster configuration of Hadoop. After running the word count example the output shown in the part-r-00000 file is as shown :
hduser@MT2012158:/usr/local/hadoop$ head /tmp/gutenberg-output/gutenberg-output
40
2
4
��� � � � �@�� 2
��� � � � �@�@�� 1
���� � � � �@�@�� 1
P�������� j l k m �������� g��������������������EXTH � j 2004-01-01d Leonardo 1
P�������� � � � � �������� ���������������������EXTH � t 1
�P�������� � � � ������������ � � � ���������EXTH � j 2004-01-01d Leonardo 1
�P�������� � � � ������������ � � � � �����EXTH � t 1
Can you please tell me why this is happening?
--
-Varsha