You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Kouhei Sutou (Jira)" <ji...@apache.org> on 2021/10/17 20:17:00 UTC

[jira] [Commented] (ARROW-14088) [GLib][Ruby][Dataset] Add support for filter

    [ https://issues.apache.org/jira/browse/ARROW-14088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17429764#comment-17429764 ] 

Kouhei Sutou commented on ARROW-14088:
--------------------------------------

{noformat}
Arrow::Table.load("parquet_dataset", format: :parquet, filter: ["greater_equal", :a, 7])
{noformat}

{{filter}} uses S-expression style for now. We'll improve the API by ARROW-14360 such as {{filter: ->() \{a >= 7\}}}.

> [GLib][Ruby][Dataset] Add support for filter
> --------------------------------------------
>
>                 Key: ARROW-14088
>                 URL: https://issues.apache.org/jira/browse/ARROW-14088
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: GLib, Ruby
>            Reporter: Dominic Sisneros
>            Assignee: Kouhei Sutou
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 6.0.0
>
>          Time Spent: 40m
>  Remaining Estimate: 0h
>
> in python dataset = ds.dataset (base / "parquet_dataset", format = "parquet")
> dataset.files
> dataset.to_table (filter = ds.field ('a')> = 7) .to_pandas ()
> Want to do equivalent in ruby



--
This message was sent by Atlassian Jira
(v8.3.4#803005)