You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by "westonpace (via GitHub)" <gi...@apache.org> on 2023/02/25 02:11:39 UTC

[GitHub] [arrow] westonpace opened a new pull request, #34348: GH-34347: [C++] Add an initial fuzz test for the scanner

westonpace opened a new pull request, #34348:
URL: https://github.com/apache/arrow/pull/34348

   This is just an starting point.  There are a number of different things that will hopefully be eventually added (support for more formats, different scanning scenarios, running multiple scans in parallel).
   
   It's currently exposed as a unit test and can maybe be a bit slow (takes ~30 seconds with one thread on my machine).  Eventually it might make more sense to change this to a nightly test that runs for a longer duration.  We should also probably have a version that runs on release mode.
   
   I also fixed an issue encountered with partition guarantee normalization.  I could separate that into a separate PR if desired.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow] amol- commented on pull request #34348: GH-34347: [C++] Add an initial fuzz test for the scanner

Posted by "amol- (via GitHub)" <gi...@apache.org>.
amol- commented on PR #34348:
URL: https://github.com/apache/arrow/pull/34348#issuecomment-1490667674

   Closing because it has been untouched for a while, in case it's still relevant feel free to reopen and move it forward 👍


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow] github-actions[bot] commented on pull request #34348: GH-34347: [C++] Add an initial fuzz test for the scanner

Posted by "github-actions[bot] (via GitHub)" <gi...@apache.org>.
github-actions[bot] commented on PR #34348:
URL: https://github.com/apache/arrow/pull/34348#issuecomment-1444926000

   :warning: GitHub issue #34347 **has been automatically assigned in GitHub** to PR creator.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow] github-actions[bot] commented on pull request #34348: GH-34347: [C++] Add an initial fuzz test for the scanner

Posted by "github-actions[bot] (via GitHub)" <gi...@apache.org>.
github-actions[bot] commented on PR #34348:
URL: https://github.com/apache/arrow/pull/34348#issuecomment-1444925948

   * Closes: #34347


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow] westonpace commented on pull request #34348: GH-34347: [C++] Add an initial fuzz test for the scanner

Posted by "westonpace (via GitHub)" <gi...@apache.org>.
westonpace commented on PR #34348:
URL: https://github.com/apache/arrow/pull/34348#issuecomment-1454231356

   These test failures are legimate.  I hacked at this a little today and it quickly started to get out of control :laughing: 
   
   The basic issue is that I am transferring to a model where all expressions (projections, filters, guarantees) get normalized as we start to work with them.  This allows us to cleanly handle duplicate column names and similar issues.  Unfortunately, it's exposing a number of places in the current datasets code where we are explicitly relying on named refs and I am needing to work around that.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow] amol- closed pull request #34348: GH-34347: [C++] Add an initial fuzz test for the scanner

Posted by "amol- (via GitHub)" <gi...@apache.org>.
amol- closed pull request #34348: GH-34347: [C++] Add an initial fuzz test for the scanner
URL: https://github.com/apache/arrow/pull/34348


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org