You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Pradeep Kamath (JIRA)" <ji...@apache.org> on 2008/10/14 03:19:44 UTC
[jira] Created: (PIG-492) There should be a way for Loader to refer
to the output of determineSchema() in the backend
There should be a way for Loader to refer to the output of determineSchema() in the backend
-------------------------------------------------------------------------------------------
Key: PIG-492
URL: https://issues.apache.org/jira/browse/PIG-492
Project: Pig
Issue Type: Bug
Affects Versions: types_branch
Reporter: Pradeep Kamath
Fix For: types_branch
Currently LoadFunc.determineSchema() is only called from LOLoad() at parse time in the front end. If the loader.getNext() needs to know what the output of determineSchema() was there is no way to get to it in the backend - there should be some way to get to it.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Commented: (PIG-492) There should be a way for Loader to
refer to the output of determineSchema() in the backend
Posted by "Pradeep Kamath (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/PIG-492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12639484#action_12639484 ]
Pradeep Kamath commented on PIG-492:
------------------------------------
To add more to the initial description:
Here is a scenario showing the need for this: Consider a loader which samples the first 100 records (say) to determine the schema and returns a schema which is {(long, chararray)}. However at runtime while constructing the tuple, the getNext() code might need to know that determineSchema() had returned {(long, chararray)} at parse time and hence construct the tuple accordingly to consists of Long and chararray (String) fields. To be able to do this, the loader should somehow have access to the schema which was returned on the initial determineSchema() call.
> There should be a way for Loader to refer to the output of determineSchema() in the backend
> -------------------------------------------------------------------------------------------
>
> Key: PIG-492
> URL: https://issues.apache.org/jira/browse/PIG-492
> Project: Pig
> Issue Type: Bug
> Affects Versions: types_branch
> Reporter: Pradeep Kamath
> Fix For: types_branch
>
>
> Currently LoadFunc.determineSchema() is only called from LOLoad() at parse time in the front end. If the loader.getNext() needs to know what the output of determineSchema() was there is no way to get to it in the backend - there should be some way to get to it.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.