You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Sriranjan Manjunath (JIRA)" <ji...@apache.org> on 2009/10/14 05:28:31 UTC

[jira] Commented: (PIG-1017) Converts strings to text in Pig

    [ https://issues.apache.org/jira/browse/PIG-1017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12765380#action_12765380 ] 

Sriranjan Manjunath commented on PIG-1017:
------------------------------------------

Pigmix results before and after converting strings to text:

||Pigmix query||Trunk||Modified code||
|L1| 3:2|2:24|
|L2| 2:6|1:23|
|L3| 3:36|3:49|
|L4| 1:42|1:49|
|L5| 1:49|1:49|
|L6| 1:47|3:3|
|L7| 1:44|1:49|
|L8| 1:19|1:18|
|L9| 4:6|5:35|
|L10| 8:52|7:56|
|L11| 2:26|1:34|
|L12| 1:57|1:54|


> Converts strings to text in Pig
> -------------------------------
>
>                 Key: PIG-1017
>                 URL: https://issues.apache.org/jira/browse/PIG-1017
>             Project: Pig
>          Issue Type: Improvement
>            Reporter: Sriranjan Manjunath
>
> Strings in Java are UTF-16 and takes 2 bytes. Text (org.apache.hadoop.io.Text) stores the data in UTF-8 and could show significant reductions in memory.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.