You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Kouhei Sutou (Jira)" <ji...@apache.org> on 2021/10/17 20:17:00 UTC
[jira] [Commented] (ARROW-14088) [GLib][Ruby][Dataset] Add support
for filter
[ https://issues.apache.org/jira/browse/ARROW-14088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17429764#comment-17429764 ]
Kouhei Sutou commented on ARROW-14088:
--------------------------------------
{noformat}
Arrow::Table.load("parquet_dataset", format: :parquet, filter: ["greater_equal", :a, 7])
{noformat}
{{filter}} uses S-expression style for now. We'll improve the API by ARROW-14360 such as {{filter: ->() \{a >= 7\}}}.
> [GLib][Ruby][Dataset] Add support for filter
> --------------------------------------------
>
> Key: ARROW-14088
> URL: https://issues.apache.org/jira/browse/ARROW-14088
> Project: Apache Arrow
> Issue Type: Improvement
> Components: GLib, Ruby
> Reporter: Dominic Sisneros
> Assignee: Kouhei Sutou
> Priority: Major
> Labels: pull-request-available
> Fix For: 6.0.0
>
> Time Spent: 40m
> Remaining Estimate: 0h
>
> in python dataset = ds.dataset (base / "parquet_dataset", format = "parquet")
> dataset.files
> dataset.to_table (filter = ds.field ('a')> = 7) .to_pandas ()
> Want to do equivalent in ruby
--
This message was sent by Atlassian Jira
(v8.3.4#803005)