You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Jacek Pliszka (Jira)" <ji...@apache.org> on 2021/05/03 19:19:00 UTC
[jira] [Updated] (ARROW-12637) Range type (similar to Pandas
intervals)
[ https://issues.apache.org/jira/browse/ARROW-12637?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Jacek Pliszka updated ARROW-12637:
----------------------------------
Description:
This is wish for range(interval) extension type.
Type would describe range/interval with defined start and end and would have metadata
whether range is open/close/left-open/right-open.
Considerations:
* should it focus on integers and like or on floats as well?
* are open/semi-open cases worth the effort? Maybe not.
The most useful case might be closed range of 32bit/64bit and maybe 8bit/16bit unsigned integers packed in twice larger integer.
This would allow computations like:
* overlap joins (see R foverlaps)
* range algebra: unions, intersections, differences
* classification in Allen's interval algebra
* conversion to/from 2 columns/rows
* exploding (see Pandas explode)
was:
This is wish for range(interval) extension type.
Type would describe range/interval with defined start and end and would have metadata
whether range is open/close/left-open/right-open.
Considerations:
* should it focus on integers and like or on floats as well?
* are open/semi-open cases worth the effort? Maybe not.
The most useful case might be closed range of 32bit/64bit and maybe 8bit/16bit unsigned integers packed in twice larger integer.
This would allow computation kernels like:
* overlap joins (see R foverlaps)
* range algebra: unions, intersections, differences
* classification in Allen's interval algebra
* conversion to/from 2 columns/rows
* exploding (see Pandas explode)
> Range type (similar to Pandas intervals)
> ----------------------------------------
>
> Key: ARROW-12637
> URL: https://issues.apache.org/jira/browse/ARROW-12637
> Project: Apache Arrow
> Issue Type: Wish
> Components: C, C++
> Reporter: Jacek Pliszka
> Priority: Major
>
> This is wish for range(interval) extension type.
>
> Type would describe range/interval with defined start and end and would have metadata
> whether range is open/close/left-open/right-open.
>
> Considerations:
> * should it focus on integers and like or on floats as well?
> * are open/semi-open cases worth the effort? Maybe not.
>
> The most useful case might be closed range of 32bit/64bit and maybe 8bit/16bit unsigned integers packed in twice larger integer.
>
> This would allow computations like:
>
> * overlap joins (see R foverlaps)
> * range algebra: unions, intersections, differences
> * classification in Allen's interval algebra
> * conversion to/from 2 columns/rows
> * exploding (see Pandas explode)
>
>
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)