You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Alan Gates (JIRA)" <ji...@apache.org> on 2010/01/15 07:02:54 UTC

[jira] Resolved: (PIG-209) Indexes for accelerating joins

     [ https://issues.apache.org/jira/browse/PIG-209?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Alan Gates resolved PIG-209.
----------------------------

    Resolution: Won't Fix

At this point Pig is relying on storage formats such as Zebra to do indexing.  We have no near term plans to provide indexing inside Pig itself.

> Indexes for accelerating joins
> ------------------------------
>
>                 Key: PIG-209
>                 URL: https://issues.apache.org/jira/browse/PIG-209
>             Project: Pig
>          Issue Type: New Feature
>          Components: data
>            Reporter: John DeTreville
>
> Computing the inner join of a very large table (i.e., bag or mapping) with a smaller table can take time proportional to the size of the very large table. This time required can be greatly reduced if the very large table is indexed, taking time proportional to the size of the smaller table. It should be possible for clients to index tables for use by future joins.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.