You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Hatem Helal (JIRA)" <ji...@apache.org> on 2019/02/22 13:12:00 UTC

[jira] [Updated] (ARROW-4661) [C++] Consolidate random string generators for use in benchmarks and unittests

     [ https://issues.apache.org/jira/browse/ARROW-4661?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Hatem Helal updated ARROW-4661:
-------------------------------
    Description: 
This was discussed in here:

[https://github.com/apache/arrow/pull/3721]

For testing/benchmarking dictionary encoding its useful to control the number of repeated values and it would also be good to optionally include null values.  The ability to provide a custom alphabet would be handy for generating strings with unicode characters.

  was:
This was discussed in here:

[https://github.com/apache/arrow/pull/3721]

For testing/benchmarking dictionary encoding its useful to control the number of repeated values and it would also be good to optionally include null values.


> [C++] Consolidate random string generators for use in benchmarks and unittests
> ------------------------------------------------------------------------------
>
>                 Key: ARROW-4661
>                 URL: https://issues.apache.org/jira/browse/ARROW-4661
>             Project: Apache Arrow
>          Issue Type: Improvement
>            Reporter: Hatem Helal
>            Assignee: Hatem Helal
>            Priority: Minor
>             Fix For: 0.14.0
>
>
> This was discussed in here:
> [https://github.com/apache/arrow/pull/3721]
> For testing/benchmarking dictionary encoding its useful to control the number of repeated values and it would also be good to optionally include null values.  The ability to provide a custom alphabet would be handy for generating strings with unicode characters.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)