You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Pradeep Kamath (JIRA)" <ji...@apache.org> on 2008/10/14 03:19:44 UTC

[jira] Created: (PIG-492) There should be a way for Loader to refer to the output of determineSchema() in the backend

There should be a way for Loader to refer to the output of determineSchema() in the backend
-------------------------------------------------------------------------------------------

                 Key: PIG-492
                 URL: https://issues.apache.org/jira/browse/PIG-492
             Project: Pig
          Issue Type: Bug
    Affects Versions: types_branch
            Reporter: Pradeep Kamath
             Fix For: types_branch


Currently LoadFunc.determineSchema() is only called from LOLoad() at parse time in the front end. If the loader.getNext() needs to know what the output of determineSchema() was there is no way to get to it in the backend - there should be some way to get to it.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (PIG-492) There should be a way for Loader to refer to the output of determineSchema() in the backend

Posted by "Pradeep Kamath (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/PIG-492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12639484#action_12639484 ] 

Pradeep Kamath commented on PIG-492:
------------------------------------

To add more to the initial description:
Here is a scenario showing the need for this: Consider a loader which samples the first 100 records (say) to determine the schema and returns a schema which is {(long, chararray)}. However at runtime while constructing the tuple, the getNext() code might need to know that determineSchema() had returned {(long, chararray)} at parse time and hence construct the tuple accordingly to consists of Long and chararray (String) fields. To be able to do this, the loader should somehow have access to the schema which was returned on the initial determineSchema() call.

> There should be a way for Loader to refer to the output of determineSchema() in the backend
> -------------------------------------------------------------------------------------------
>
>                 Key: PIG-492
>                 URL: https://issues.apache.org/jira/browse/PIG-492
>             Project: Pig
>          Issue Type: Bug
>    Affects Versions: types_branch
>            Reporter: Pradeep Kamath
>             Fix For: types_branch
>
>
> Currently LoadFunc.determineSchema() is only called from LOLoad() at parse time in the front end. If the loader.getNext() needs to know what the output of determineSchema() was there is no way to get to it in the backend - there should be some way to get to it.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.