You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2022/02/22 04:53:35 UTC

[GitHub] [arrow-datafusion] matthewmturner opened a new issue #1870: Add a script for running db-benchmark

matthewmturner opened a new issue #1870:
URL: https://github.com/apache/arrow-datafusion/issues/1870


   **Is your feature request related to a problem or challenge? Please describe what you are trying to do.**
   A clear and concise description of what the problem is. Ex. I'm always frustrated when [...] 
   (This section helps Arrow developers understand the context and *why* for this feature, in addition to  the *what*)
   I've been working on db-benchmark and think it would be valuable to automate the full process so others could run the benchmarks as well
   
   **Describe the solution you'd like**
   A clear and concise description of what you want to happen.
   A new shell script in `benchmarks/db-benchmark` that runs the full db-benchmark process
   
   **Describe alternatives you've considered**
   A clear and concise description of any alternative solutions or features you've considered.
   
   **Additional context**
   Add any other context or screenshots about the feature request here.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [arrow-datafusion] matthewmturner commented on issue #1870: Add a script for running db-benchmark

Posted by GitBox <gi...@apache.org>.
matthewmturner commented on issue #1870:
URL: https://github.com/apache/arrow-datafusion/issues/1870#issuecomment-1058821000


   Given that db-benchmark uses R scripts to download the source data im wondering if i should assume that the user will take care of that or if i should wrap this all up in a docker container.
   
   @houqp do you have a preference?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [arrow-datafusion] matthewmturner commented on issue #1870: Add a script for running full db-benchmark suite

Posted by GitBox <gi...@apache.org>.
matthewmturner commented on issue #1870:
URL: https://github.com/apache/arrow-datafusion/issues/1870#issuecomment-1062586064


   I hope it makes it easier to play with optimizations!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [arrow-datafusion] matthewmturner commented on issue #1870: Add a script for running db-benchmark

Posted by GitBox <gi...@apache.org>.
matthewmturner commented on issue #1870:
URL: https://github.com/apache/arrow-datafusion/issues/1870#issuecomment-1059308544


   @Dandandan also interested in your view on this


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [arrow-datafusion] matthewmturner commented on issue #1870: Add a script for running full db-benchmark suite

Posted by GitBox <gi...@apache.org>.
matthewmturner commented on issue #1870:
URL: https://github.com/apache/arrow-datafusion/issues/1870#issuecomment-1059518563


   i just keep forgetting people.
   
   @realno i know you had expressed interest in benchmarking in the past as well so im curious if you have a preference on the above.
   
   @alamb i know youre busy - so no rush - but you had initially raised #147 so interested in your view as well.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [arrow-datafusion] realno commented on issue #1870: Add a script for running full db-benchmark suite

Posted by GitBox <gi...@apache.org>.
realno commented on issue #1870:
URL: https://github.com/apache/arrow-datafusion/issues/1870#issuecomment-1059526470


   @matthewmturner I like the idea using docker, it should be easier for dependency management.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [arrow-datafusion] yjshen closed issue #1870: Add a script for running full db-benchmark suite

Posted by GitBox <gi...@apache.org>.
yjshen closed issue #1870:
URL: https://github.com/apache/arrow-datafusion/issues/1870


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [arrow-datafusion] matthewmturner edited a comment on issue #1870: Add a script for running db-benchmark

Posted by GitBox <gi...@apache.org>.
matthewmturner edited a comment on issue #1870:
URL: https://github.com/apache/arrow-datafusion/issues/1870#issuecomment-1058821000


   Given that db-benchmark uses R scripts to download the source data im wondering if i should assume that the user will take care of that or if i should wrap this all up in a docker container.
   
   my current preference is wrapping it all up in docker which im thinking should make it easier to use - but i dont have strong feelings either way.
   
   @houqp do you have a preference?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [arrow-datafusion] houqp commented on issue #1870: Add a script for running full db-benchmark suite

Posted by GitBox <gi...@apache.org>.
houqp commented on issue #1870:
URL: https://github.com/apache/arrow-datafusion/issues/1870#issuecomment-1062583742


   agree with @realno , thanks @matthewmturner for taking care of this :)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org