You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@orc.apache.org by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2016/07/02 17:48:10 UTC

[jira] [Commented] (ORC-69) Add batch option support in orc-contents and orc-scan tools.

    [ https://issues.apache.org/jira/browse/ORC-69?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15360265#comment-15360265 ] 

ASF GitHub Bot commented on ORC-69:
-----------------------------------

Github user omalley commented on the issue:

    https://github.com/apache/orc/pull/38
  
    I buy that we may want to test the batch size for performance tests, but orc-scan is the important tool there since you don't really want to benchmark the conversion into JSON.
    
    With that in mind, I've made a variant of this patch that:
    * reverts the change to orc-contents
    * switches orc-scan to use getopt_long
    * adds some test infrastructure so that we can test the executables
    * adds tests of the nominal and off-nominal invocations of orc-scan.
    
    Please see my changes on https://github.com/omalley/orc/tree/orc-69


> Add batch option support in orc-contents and orc-scan tools. 
> -------------------------------------------------------------
>
>                 Key: ORC-69
>                 URL: https://issues.apache.org/jira/browse/ORC-69
>             Project: Orc
>          Issue Type: Improvement
>          Components: tools
>            Reporter: hongwu
>            Assignee: hongwu
>            Priority: Trivial
>
> The batchSize in FileScan.cc and FileContents.cc is hard coded with 1000. I add option named --batch to support input batchSize from command line.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)