You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Amir Youssefi (JIRA)" <ji...@apache.org> on 2008/04/09 18:25:27 UTC
[jira] Created: (PIG-197) bz2 doesn't work
bz2 doesn't work
----------------
Key: PIG-197
URL: https://issues.apache.org/jira/browse/PIG-197
Project: Pig
Issue Type: Bug
Reporter: Amir Youssefi
Storing a file in bzip2 doesn't work. Example:
a = load 'any_input'; -- use any sample input
store a into 'any_output.bz2';
or to have reducer/combiner
a = load 'any_input';
b = group a all parallel 1;
c = foreach b generate group, COUNT(a);
store c into 'any_output.bz2';
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Assigned: (PIG-197) bz2 doesn't work
Posted by "Olga Natkovich (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/PIG-197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Olga Natkovich reassigned PIG-197:
----------------------------------
Assignee: Benjamin Reed
Ben will be looking at this issue today and will provide update
> bz2 doesn't work
> ----------------
>
> Key: PIG-197
> URL: https://issues.apache.org/jira/browse/PIG-197
> Project: Pig
> Issue Type: Bug
> Reporter: Amir Youssefi
> Assignee: Benjamin Reed
>
> Storing a file in bzip2 doesn't work. It stores output in regular plain text in following examples:
> a = load 'any_input'; -- use any sample input
> store a into 'any_output.bz2';
> or to have reducer/combiner
> a = load 'any_input';
> b = group a all parallel 1;
> c = foreach b generate group, COUNT(a);
> store c into 'any_output.bz2';
> --
> pig -version
> Apache Pig version 0.1.0-dev (r6455)
> compiled Apr 03 2008, 16:10:08
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Commented: (PIG-197) bz2 doesn't work
Posted by "Amir Youssefi (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/PIG-197?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12591985#action_12591985 ]
Amir Youssefi commented on PIG-197:
-----------------------------------
loading a bz2 doesn't work either.
grunt> a = load 'test.txt.bz2';
grunt> dump a;
(BZh91AY&SY???
?0r?8 "??=Lj?E??}!RW??N$1?)
bzcat test.txt.bz2 shows correct results:
a1 1 5700
b1 2 2001
c2 2
> bz2 doesn't work
> ----------------
>
> Key: PIG-197
> URL: https://issues.apache.org/jira/browse/PIG-197
> Project: Pig
> Issue Type: Bug
> Reporter: Amir Youssefi
>
> Storing a file in bzip2 doesn't work. It stores output in regular plain text in following examples:
> a = load 'any_input'; -- use any sample input
> store a into 'any_output.bz2';
> or to have reducer/combiner
> a = load 'any_input';
> b = group a all parallel 1;
> c = foreach b generate group, COUNT(a);
> store c into 'any_output.bz2';
> --
> pig -version
> Apache Pig version 0.1.0-dev (r6455)
> compiled Apr 03 2008, 16:10:08
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Commented: (PIG-197) bz2 doesn't work
Posted by "Benjamin Reed (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/PIG-197?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12592298#action_12592298 ]
Benjamin Reed commented on PIG-197:
-----------------------------------
I track the problem down to a runtime change in Hadoop, which made our code to figure out the extension of the output directory break. The fix and test cases for it are in PIG-151.
> bz2 doesn't work
> ----------------
>
> Key: PIG-197
> URL: https://issues.apache.org/jira/browse/PIG-197
> Project: Pig
> Issue Type: Bug
> Reporter: Amir Youssefi
> Assignee: Benjamin Reed
>
> Storing a file in bzip2 doesn't work. It stores output in regular plain text in following examples:
> a = load 'any_input'; -- use any sample input
> store a into 'any_output.bz2';
> or to have reducer/combiner
> a = load 'any_input';
> b = group a all parallel 1;
> c = foreach b generate group, COUNT(a);
> store c into 'any_output.bz2';
> --
> pig -version
> Apache Pig version 0.1.0-dev (r6455)
> compiled Apr 03 2008, 16:10:08
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Resolved: (PIG-197) bz2 doesn't work
Posted by "Olga Natkovich (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/PIG-197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Olga Natkovich resolved PIG-197.
--------------------------------
Resolution: Fixed
This issue is resolved by PIG-151.
> bz2 doesn't work
> ----------------
>
> Key: PIG-197
> URL: https://issues.apache.org/jira/browse/PIG-197
> Project: Pig
> Issue Type: Bug
> Reporter: Amir Youssefi
> Assignee: Benjamin Reed
>
> Storing a file in bzip2 doesn't work. It stores output in regular plain text in following examples:
> a = load 'any_input'; -- use any sample input
> store a into 'any_output.bz2';
> or to have reducer/combiner
> a = load 'any_input';
> b = group a all parallel 1;
> c = foreach b generate group, COUNT(a);
> store c into 'any_output.bz2';
> --
> pig -version
> Apache Pig version 0.1.0-dev (r6455)
> compiled Apr 03 2008, 16:10:08
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Updated: (PIG-197) bz2 doesn't work
Posted by "Amir Youssefi (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/PIG-197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Amir Youssefi updated PIG-197:
------------------------------
Description:
Storing a file in bzip2 doesn't work. It stores output in regular plain text in following examples:
a = load 'any_input'; -- use any sample input
store a into 'any_output.bz2';
or to have reducer/combiner
a = load 'any_input';
b = group a all parallel 1;
c = foreach b generate group, COUNT(a);
store c into 'any_output.bz2';
--
pig -version
Apache Pig version 0.1.0-dev (r6455)
compiled Apr 03 2008, 16:10:08
was:
Storing a file in bzip2 doesn't work. Example:
a = load 'any_input'; -- use any sample input
store a into 'any_output.bz2';
or to have reducer/combiner
a = load 'any_input';
b = group a all parallel 1;
c = foreach b generate group, COUNT(a);
store c into 'any_output.bz2';
> bz2 doesn't work
> ----------------
>
> Key: PIG-197
> URL: https://issues.apache.org/jira/browse/PIG-197
> Project: Pig
> Issue Type: Bug
> Reporter: Amir Youssefi
>
> Storing a file in bzip2 doesn't work. It stores output in regular plain text in following examples:
> a = load 'any_input'; -- use any sample input
> store a into 'any_output.bz2';
> or to have reducer/combiner
> a = load 'any_input';
> b = group a all parallel 1;
> c = foreach b generate group, COUNT(a);
> store c into 'any_output.bz2';
> --
> pig -version
> Apache Pig version 0.1.0-dev (r6455)
> compiled Apr 03 2008, 16:10:08
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.