You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Maarten Breddels (Jira)" <ji...@apache.org> on 2020/12/18 14:44:00 UTC

[jira] [Commented] (ARROW-10557) [C++] Add scalar string slicing/substring kernel

    [ https://issues.apache.org/jira/browse/ARROW-10557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17251793#comment-17251793 ] 

Maarten Breddels commented on ARROW-10557:
------------------------------------------

This would be easier to implement using the tools in utf8.h (e.g. UTF8FindIfReverse) in that PR

> [C++] Add scalar string slicing/substring kernel 
> -------------------------------------------------
>
>                 Key: ARROW-10557
>                 URL: https://issues.apache.org/jira/browse/ARROW-10557
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: C++
>            Reporter: Maarten Breddels
>            Assignee: Maarten Breddels
>            Priority: Major
>
> This should implement slicing scalar string values of strings arrays with Python semantics with start, stop ,step arguments. This may seem similar to lists, or binary array, but the string length semantics enter into this kernel, which does not need to equal the number of bytes, nor the number of codepoints (accents, etc should be skipped).
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)