You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@pig.apache.org by James Brown <jb...@buzzlogic.com> on 2010/12/06 20:15:54 UTC

RE: Encoding byte code 254 in pig.

in the bash script that calls the pig job, replace the interpreter definition:

#!/bin/bash 

with this:

#!/bin/bash -l



-----Original Message-----
From: Marilson Campos [mailto:mbc_active@yahoo.com] 
Sent: Wednesday, November 10, 2010 3:53 PM
To: user@pig.apache.org
Subject: Re: Encoding byte code 254 in pig.

Hey,I've got it working fine in the shell. When I installed on the crontab i get this error.






p.p1 {margin: 0.0px 0.0px 0.0px 0.0px; font: 13.0px Consolas}



2010-11-10 12:29:02,684 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 2999: Unexpected internal error. could not instantiate 'PigStorage' with arguments '[??]'
Free consulting. :-) Great!
Marilson Campos

650-222-7369

--- On Wed, 11/10/10, Stefan Will <st...@gmx.net> wrote:

From: Stefan Will <st...@gmx.net>
Subject: Re: Encoding byte code 254 in pig.
To: user@pig.apache.org
Date: Wednesday, November 10, 2010, 3:47 PM

I don't think character sequences actually work in Pig scripts. IIRC there
is even a bug reported for it. So using the unescaped character might be
your only choice.

-- Stefan

On 11/11/10 12:34 AM, "Marilson Campos" <mb...@yahoo.com> wrote:

>Hi,
>I have a file that has the char (254) as a separator. I can force the
>character into the file, but wanted to
>
>
>
>
>
>
>p.p1 {margin: 0.0px 0.0px 0.0px 0.0px; font: 11.0px Monaco}
>
>
>
>LOAD 'file.log.gz' USING PigStorage('þ')
>
>when encoding as UTF-8
>LOAD 'file.log.gz' USING PigStorage('\u00FE')
>It does not parse. I believe \u notation works only with chars that take
>1 byte.
>I need  to encode the xFE byte
>
>
>Thanks for the help in advance.
>Marilson