You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Amir Youssefi (JIRA)" <ji...@apache.org> on 2008/04/09 18:25:27 UTC

[jira] Created: (PIG-197) bz2 doesn't work

bz2 doesn't work
----------------

                 Key: PIG-197
                 URL: https://issues.apache.org/jira/browse/PIG-197
             Project: Pig
          Issue Type: Bug
            Reporter: Amir Youssefi


Storing a file in bzip2 doesn't work. Example:

a = load 'any_input'; -- use any sample input
store a into 'any_output.bz2';

or  to have reducer/combiner

a = load 'any_input';
b = group a all parallel 1;
c = foreach b generate group, COUNT(a);
store c into 'any_output.bz2';


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Assigned: (PIG-197) bz2 doesn't work

Posted by "Olga Natkovich (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/PIG-197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Olga Natkovich reassigned PIG-197:
----------------------------------

    Assignee: Benjamin Reed

Ben will be looking at this issue today and will provide update

> bz2 doesn't work
> ----------------
>
>                 Key: PIG-197
>                 URL: https://issues.apache.org/jira/browse/PIG-197
>             Project: Pig
>          Issue Type: Bug
>            Reporter: Amir Youssefi
>            Assignee: Benjamin Reed
>
> Storing a file in bzip2 doesn't work. It stores output in regular plain text in following examples:
> a = load 'any_input'; -- use any sample input
> store a into 'any_output.bz2';
> or  to have reducer/combiner
> a = load 'any_input';
> b = group a all parallel 1;
> c = foreach b generate group, COUNT(a);
> store c into 'any_output.bz2';
> --
> pig -version
> Apache Pig version 0.1.0-dev (r6455)
> compiled Apr 03 2008, 16:10:08

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (PIG-197) bz2 doesn't work

Posted by "Amir Youssefi (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/PIG-197?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12591985#action_12591985 ] 

Amir Youssefi commented on PIG-197:
-----------------------------------

loading a bz2 doesn't work either.

grunt> a = load 'test.txt.bz2';
grunt> dump a;
(BZh91AY&SY???
              ?0r?8 "??=Lj?E??}!RW??N$1?)


bzcat test.txt.bz2 shows correct results:
a1      1       5700
b1      2       2001
c2      2


> bz2 doesn't work
> ----------------
>
>                 Key: PIG-197
>                 URL: https://issues.apache.org/jira/browse/PIG-197
>             Project: Pig
>          Issue Type: Bug
>            Reporter: Amir Youssefi
>
> Storing a file in bzip2 doesn't work. It stores output in regular plain text in following examples:
> a = load 'any_input'; -- use any sample input
> store a into 'any_output.bz2';
> or  to have reducer/combiner
> a = load 'any_input';
> b = group a all parallel 1;
> c = foreach b generate group, COUNT(a);
> store c into 'any_output.bz2';
> --
> pig -version
> Apache Pig version 0.1.0-dev (r6455)
> compiled Apr 03 2008, 16:10:08

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (PIG-197) bz2 doesn't work

Posted by "Benjamin Reed (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/PIG-197?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12592298#action_12592298 ] 

Benjamin Reed commented on PIG-197:
-----------------------------------

I track the problem down to a runtime change in Hadoop, which made our code to figure out the extension of the output directory break. The fix and test cases for it are in PIG-151.

> bz2 doesn't work
> ----------------
>
>                 Key: PIG-197
>                 URL: https://issues.apache.org/jira/browse/PIG-197
>             Project: Pig
>          Issue Type: Bug
>            Reporter: Amir Youssefi
>            Assignee: Benjamin Reed
>
> Storing a file in bzip2 doesn't work. It stores output in regular plain text in following examples:
> a = load 'any_input'; -- use any sample input
> store a into 'any_output.bz2';
> or  to have reducer/combiner
> a = load 'any_input';
> b = group a all parallel 1;
> c = foreach b generate group, COUNT(a);
> store c into 'any_output.bz2';
> --
> pig -version
> Apache Pig version 0.1.0-dev (r6455)
> compiled Apr 03 2008, 16:10:08

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Resolved: (PIG-197) bz2 doesn't work

Posted by "Olga Natkovich (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/PIG-197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Olga Natkovich resolved PIG-197.
--------------------------------

    Resolution: Fixed

This issue is resolved by PIG-151. 

> bz2 doesn't work
> ----------------
>
>                 Key: PIG-197
>                 URL: https://issues.apache.org/jira/browse/PIG-197
>             Project: Pig
>          Issue Type: Bug
>            Reporter: Amir Youssefi
>            Assignee: Benjamin Reed
>
> Storing a file in bzip2 doesn't work. It stores output in regular plain text in following examples:
> a = load 'any_input'; -- use any sample input
> store a into 'any_output.bz2';
> or  to have reducer/combiner
> a = load 'any_input';
> b = group a all parallel 1;
> c = foreach b generate group, COUNT(a);
> store c into 'any_output.bz2';
> --
> pig -version
> Apache Pig version 0.1.0-dev (r6455)
> compiled Apr 03 2008, 16:10:08

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (PIG-197) bz2 doesn't work

Posted by "Amir Youssefi (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/PIG-197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Amir Youssefi updated PIG-197:
------------------------------

    Description: 
Storing a file in bzip2 doesn't work. It stores output in regular plain text in following examples:

a = load 'any_input'; -- use any sample input
store a into 'any_output.bz2';

or  to have reducer/combiner

a = load 'any_input';
b = group a all parallel 1;
c = foreach b generate group, COUNT(a);
store c into 'any_output.bz2';

--

pig -version
Apache Pig version 0.1.0-dev (r6455)
compiled Apr 03 2008, 16:10:08


  was:
Storing a file in bzip2 doesn't work. Example:

a = load 'any_input'; -- use any sample input
store a into 'any_output.bz2';

or  to have reducer/combiner

a = load 'any_input';
b = group a all parallel 1;
c = foreach b generate group, COUNT(a);
store c into 'any_output.bz2';



> bz2 doesn't work
> ----------------
>
>                 Key: PIG-197
>                 URL: https://issues.apache.org/jira/browse/PIG-197
>             Project: Pig
>          Issue Type: Bug
>            Reporter: Amir Youssefi
>
> Storing a file in bzip2 doesn't work. It stores output in regular plain text in following examples:
> a = load 'any_input'; -- use any sample input
> store a into 'any_output.bz2';
> or  to have reducer/combiner
> a = load 'any_input';
> b = group a all parallel 1;
> c = foreach b generate group, COUNT(a);
> store c into 'any_output.bz2';
> --
> pig -version
> Apache Pig version 0.1.0-dev (r6455)
> compiled Apr 03 2008, 16:10:08

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.