You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@pinot.apache.org by GitBox <gi...@apache.org> on 2022/01/14 21:22:44 UTC

[GitHub] [pinot] abhs50 opened a new issue #8027: Support batch ingestion of data from Hive table into Pinot

abhs50 opened a new issue #8027:
URL: https://github.com/apache/pinot/issues/8027


   Add support for loading data in Hive Table in following format : org.apache.hadoop.mapred.TextInputFormat


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@pinot.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@pinot.apache.org
For additional commands, e-mail: commits-help@pinot.apache.org


[GitHub] [pinot] kishoreg commented on issue #8027: Support batch ingestion of data from Hive table into Pinot

Posted by GitBox <gi...@apache.org>.
kishoreg commented on issue #8027:
URL: https://github.com/apache/pinot/issues/8027#issuecomment-1013502408


   @abhs50 The problem with TextInputFormat is that the reader only providers line number and byte[]. That byte[] can be JSON, CSV, etc, and depends on the program that generated this file. So we still need a deserializer that provides more information on how to deserialize the byte[] into a Pinot GenericRow
   
   Do you know whats the underlying format in your case?
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@pinot.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@pinot.apache.org
For additional commands, e-mail: commits-help@pinot.apache.org


[GitHub] [pinot] abhs50 commented on issue #8027: Support batch ingestion of data from Hive table into Pinot

Posted by GitBox <gi...@apache.org>.
abhs50 commented on issue #8027:
URL: https://github.com/apache/pinot/issues/8027#issuecomment-1013517597


   It's CSV(tab-separated) in our case.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@pinot.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@pinot.apache.org
For additional commands, e-mail: commits-help@pinot.apache.org