You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Jacek Pliszka (Jira)" <ji...@apache.org> on 2021/05/04 18:56:00 UTC

[jira] [Comment Edited] (ARROW-12637) Range type (anchored variable-length interval)

    [ https://issues.apache.org/jira/browse/ARROW-12637?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17339223#comment-17339223 ] 

Jacek Pliszka edited comment on ARROW-12637 at 5/4/21, 6:55 PM:
----------------------------------------------------------------

I've just realized that what I would like to propose differs from pandas - in pandas InvervalIndex is fixed length.

And I would like to propose flexible length like

(0, 1],  (1, 5], (7, 10]

with spaces possible in between.

 

The representation would be a pair of numbers - one for left and one for right

 


was (Author: jacek.pliszka):
I've just realized that what I would like to propose differs from pandas - in pandas InvervalIndex is fixed length.

And I would like to propose flexible length like

(0, 1],  (1, 7], (7, 10]

> Range type (anchored variable-length interval)
> ----------------------------------------------
>
>                 Key: ARROW-12637
>                 URL: https://issues.apache.org/jira/browse/ARROW-12637
>             Project: Apache Arrow
>          Issue Type: Wish
>          Components: C, C++
>            Reporter: Jacek Pliszka
>            Priority: Major
>
> This is wish for range(interval) extension type.
>  
> Type would describe range/interval with defined start and end and would have metadata
> whether range is open/close/left-open/right-open.
>  
> Considerations:
>  * should it focus on integers and like or on floats as well?
>  * are open/semi-open cases worth the effort? Maybe not.
>  
> The most useful case (big low hanging fruit) might be closed range of 32bit/64bit and maybe 8bit/16bit unsigned integers packed in twice larger integer allowing usage with date/time via ordinals.
>  
> This would allow computations like:
>  
>  * overlap joins (see R foverlaps)
>  * range algebra: unions, intersections, differences
>  * classification in Allen's interval algebra
>  * conversion to/from 2 columns/rows
>  * exploding (see Pandas explode)
>  
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)