You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "JerAguilon (via GitHub)" <gi...@apache.org> on 2024/02/16 08:20:49 UTC

[I] [Pyarrow] Support non-scalar filtering [arrow]

JerAguilon opened a new issue, #40099:
URL: https://github.com/apache/arrow/issues/40099

   ### Describe the enhancement requested
   
   It would be fantastic to be able to run expressions on a dataset with non-scalar expression. As a dummy example, this would return all the rows in which column "foo" is bigger than the last value:
   
   ```
   import pyarrow.compute as pc
   import pyarrow as pa
   
   table = pa.Table.from_arrays([pa.array([1, 5, 3, 4])], names=["foo"])
   expr = pc.field('foo') >= pc.last(pc.field('foo'))
   
   # expected:  pa.Table.from_arrays([pa.array([5, 4])], names=["foo"])
   ```
   
   Today, you'd get something like:
   
   ```
   ArrowInvalid: ExecuteScalarExpression cannot Execute non-scalar expression (foo == last(foo))
   ```
   
   Is there a reason we can only execute scalar expressions? Is there a way today to interweave aggregations in one filter query?
   
   ### Component(s)
   
   Python


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Re: [I] [Pyarrow] Support scalar aggregate filtering [arrow]

Posted by "JerAguilon (via GitHub)" <gi...@apache.org>.
JerAguilon commented on issue #40099:
URL: https://github.com/apache/arrow/issues/40099#issuecomment-1949128116

   I realize this is actually a CPP request for `ExecuteScalarExpression`. Changing tags and making a PR


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Re: [I] [Pyarrow] Support scalar aggregate filtering [arrow]

Posted by "JerAguilon (via GitHub)" <gi...@apache.org>.
JerAguilon closed issue #40099: [Pyarrow] Support scalar aggregate filtering 
URL: https://github.com/apache/arrow/issues/40099


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org