You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@arrow.apache.org by Andy Grove <an...@gmail.com> on 2021/02/21 22:21:58 UTC

Looking for up-to-date info for Arrow talk this week

I'm giving a talk about Ballista [1], Rust, and Apache Arrow this week [2]
and I'd like to make sure I'm giving accurate information about other
efforts around building execution engines.

I know there is work happening in C++ but I haven't been following this.
Could someone point me to any documentation/news about this or maybe
provide a brief summary of the current status?

Are there other projects that I should be aware of and mention in this talk?

Thanks,

Andy.

[1] https://github.com/ballista-compute/ballista
[2] https://www.meetup.com/nyhackr/events/276261812/

Re: Looking for up-to-date info for Arrow talk this week

Posted by Andy Grove <an...@gmail.com>.
Thanks, Wes. That is very helpful.

On Sun, Feb 21, 2021 at 3:53 PM Wes McKinney <we...@gmail.com> wrote:

> hi Andy — my team at Ursa Computing is planning to contribute here
> significantly in 2021 and beyond (we're also hiring for people to work
> on this), but we haven't put forward any new documents beyond the ones
> I wrote in the past AFAIK. We've been working on getting our general
> systems affairs in order in C++ (see the recent async, multithreading
> work) to provide a solid foundation for the buildout. The project in
> general has made a lot of progress on basic functions / expressions
> since the big refactor May/June last year. I think it's a good idea to
> have high quality implementations of canonical algorithms (hash
> aggregation, joins, sorting, etc.) so they can be community-maintained
> and -optimized and reused in other projects.
>
> Some extra-ASF data processing projects that are written in C++ and
> using Arrow to represent data:
>
> * Cylon (@ multiple unis) https://github.com/cylondata/cylon
> * Hustle (@ U Wisconsin) https://github.com/UWHustle/hustle
> * NoisePage (@ CMU) https://github.com/cmu-db/noisepage
> * RAPIDS / cuDF (https://github.com/rapidsai/cudf)
>
> There's probably some others that I'm not aware of.
>
> best,
> Wes
>
> On Sun, Feb 21, 2021 at 4:22 PM Andy Grove <an...@gmail.com> wrote:
> >
> > I'm giving a talk about Ballista [1], Rust, and Apache Arrow this week
> [2]
> > and I'd like to make sure I'm giving accurate information about other
> > efforts around building execution engines.
> >
> > I know there is work happening in C++ but I haven't been following this.
> > Could someone point me to any documentation/news about this or maybe
> > provide a brief summary of the current status?
> >
> > Are there other projects that I should be aware of and mention in this
> talk?
> >
> > Thanks,
> >
> > Andy.
> >
> > [1] https://github.com/ballista-compute/ballista
> > [2] https://www.meetup.com/nyhackr/events/276261812/
>

Re: Looking for up-to-date info for Arrow talk this week

Posted by Wes McKinney <we...@gmail.com>.
hi Andy — my team at Ursa Computing is planning to contribute here
significantly in 2021 and beyond (we're also hiring for people to work
on this), but we haven't put forward any new documents beyond the ones
I wrote in the past AFAIK. We've been working on getting our general
systems affairs in order in C++ (see the recent async, multithreading
work) to provide a solid foundation for the buildout. The project in
general has made a lot of progress on basic functions / expressions
since the big refactor May/June last year. I think it's a good idea to
have high quality implementations of canonical algorithms (hash
aggregation, joins, sorting, etc.) so they can be community-maintained
and -optimized and reused in other projects.

Some extra-ASF data processing projects that are written in C++ and
using Arrow to represent data:

* Cylon (@ multiple unis) https://github.com/cylondata/cylon
* Hustle (@ U Wisconsin) https://github.com/UWHustle/hustle
* NoisePage (@ CMU) https://github.com/cmu-db/noisepage
* RAPIDS / cuDF (https://github.com/rapidsai/cudf)

There's probably some others that I'm not aware of.

best,
Wes

On Sun, Feb 21, 2021 at 4:22 PM Andy Grove <an...@gmail.com> wrote:
>
> I'm giving a talk about Ballista [1], Rust, and Apache Arrow this week [2]
> and I'd like to make sure I'm giving accurate information about other
> efforts around building execution engines.
>
> I know there is work happening in C++ but I haven't been following this.
> Could someone point me to any documentation/news about this or maybe
> provide a brief summary of the current status?
>
> Are there other projects that I should be aware of and mention in this talk?
>
> Thanks,
>
> Andy.
>
> [1] https://github.com/ballista-compute/ballista
> [2] https://www.meetup.com/nyhackr/events/276261812/