You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@arrow.apache.org by Julien Le Dem <ju...@dremio.com> on 2017/01/27 00:00:52 UTC

Next Arrow sync

The next Arrow sync will be Thursday February 2nd 10am PT on google hangout
https://plus.google.com/hangouts/_/dremio.com/arrow
notes will be posted to the list

-- 
Julien

Re: Next Arrow sync

Posted by Julien Le Dem <ju...@dremio.com>.
Ajay, Kirils: sorry, I closed the window before copying your emails. Could
you send them again to me?
Thank you.

On Thu, Feb 2, 2017 at 10:47 AM, Julien Le Dem <ju...@dremio.com> wrote:

> Notes:
>
> Attendance:
> - Ajay: (USA ET). here to listen and learn. Has been using storage formats
> at work.
> - Kirils: (Europe) memory alignment in Arrow. corresponding PR for Netty.
> - Uwe: (Europe) ready to make a 0.2 release in the next 2 weeks
> - Wes: (USA ET) 2sigma in NY. Working on C++/Python components. ready for
> 0.2 as well. Worked with Nong on the streaming formats with integration
> tests. with Uwe on Arrow-Parquet integration. Multi-threaded parquet reads
> etc. thread safe work. Spark-13534: convert from Spark datasets to arrow
> (file based) => spark summit Boston. Great speedups. Need to ship a release
> to get it merged.
> - Julien: (USA PT) Dremio in CA. discussed streaming with Nong, release
> 0.2
>
> - Memory alignment (ARROW-186, PR#98):
>    - Sometimes allocates too much memory.
>    - Netty PR: https://github.com/netty/netty/pull/6293
>    - need to find out when the next netty release comes out.
>    - optional for 0.2 arrow release
> - 0.2 release (ARROW-353):
>    - see blocker on that jira
>    - Spark-13534 depends on an Arrow release
>    - some code cleanup JIRAs
>    - integration test for binary data
>    - other units for timestamps in java.
>    - (optionally) c++: api for slicing arrays with 0 copy: adding an
> offset member in the array
>    - jemalloc for memory
>    - Julien to create a lira for some java api improvements.
>    - goal: close or move over JIRAS by end of next week. Friday 2/10 and
> make the release
>    - Uwe: release manager for 0.2 (will be the first release in pip python
> package manager).
> - 0.3
>    - integration tests for timestamps
>
>
>
>
>
> On Thu, Feb 2, 2017 at 10:00 AM, Julien Le Dem <ju...@dremio.com> wrote:
>
>> The arrow sync is starting now:
>> https://plus.google.com/hangouts/_/dremio.com/arrow
>>
>> On Thu, Feb 2, 2017 at 8:38 AM, Julien Le Dem <ju...@dremio.com> wrote:
>>
>>> (I just sent this to the Parquet list but this applies to Arrow as well)
>>> Everybody interested is welcome.
>>> If there is more than one of you in the same location I'd recommend
>>> sharing the connection.
>>> The sync is every other week, lasts one hour and goes as follows:
>>>  - go around the "table" for everyone to quickly introduce themselves
>>> and state the agenda items they'd want discussed (if any). It could be
>>> letting others know of what they're planning to work on, helping reaching a
>>> consensus on a JIRA, reminding people to review something that's important
>>> to them...
>>>  - once the agenda is built from this first round we go over each item
>>> in order.
>>>  - at the end notes are sent to the list. They usually have a list of
>>> action items (follow up on jira, review PR #x, ...) and
>>> resolved/unresolved discussion points.
>>>
>>> Generally, discussions happen on the mailing list, JIRA or github PRs
>>> and the sync helps getting those to conclusion faster.
>>>
>>> On Thu, Feb 2, 2017 at 8:36 AM, Julien Le Dem <ju...@dremio.com> wrote:
>>>
>>>> Reminder that the next Arrow sync is today at 10am PT (in 1 hour 25
>>>> min) on google hangout:
>>>> https://plus.google.com/hangouts/_/dremio.com/arrow
>>>>
>>>> On Thu, Jan 26, 2017 at 4:00 PM, Julien Le Dem <ju...@dremio.com>
>>>> wrote:
>>>>
>>>>> The next Arrow sync will be Thursday February 2nd 10am PT on google
>>>>> hangout
>>>>> https://plus.google.com/hangouts/_/dremio.com/arrow
>>>>> notes will be posted to the list
>>>>>
>>>>> --
>>>>> Julien
>>>>>
>>>>
>>>>
>>>>
>>>> --
>>>> Julien
>>>>
>>>
>>>
>>>
>>> --
>>> Julien
>>>
>>
>>
>>
>> --
>> Julien
>>
>
>
>
> --
> Julien
>



-- 
Julien

Re: Next Arrow sync

Posted by Julien Le Dem <ju...@dremio.com>.
Notes:

Attendance:
- Ajay: (USA ET). here to listen and learn. Has been using storage formats
at work.
- Kirils: (Europe) memory alignment in Arrow. corresponding PR for Netty.
- Uwe: (Europe) ready to make a 0.2 release in the next 2 weeks
- Wes: (USA ET) 2sigma in NY. Working on C++/Python components. ready for
0.2 as well. Worked with Nong on the streaming formats with integration
tests. with Uwe on Arrow-Parquet integration. Multi-threaded parquet reads
etc. thread safe work. Spark-13534: convert from Spark datasets to arrow
(file based) => spark summit Boston. Great speedups. Need to ship a release
to get it merged.
- Julien: (USA PT) Dremio in CA. discussed streaming with Nong, release 0.2

- Memory alignment (ARROW-186, PR#98):
   - Sometimes allocates too much memory.
   - Netty PR: https://github.com/netty/netty/pull/6293
   - need to find out when the next netty release comes out.
   - optional for 0.2 arrow release
- 0.2 release (ARROW-353):
   - see blocker on that jira
   - Spark-13534 depends on an Arrow release
   - some code cleanup JIRAs
   - integration test for binary data
   - other units for timestamps in java.
   - (optionally) c++: api for slicing arrays with 0 copy: adding an offset
member in the array
   - jemalloc for memory
   - Julien to create a lira for some java api improvements.
   - goal: close or move over JIRAS by end of next week. Friday 2/10 and
make the release
   - Uwe: release manager for 0.2 (will be the first release in pip python
package manager).
- 0.3
   - integration tests for timestamps





On Thu, Feb 2, 2017 at 10:00 AM, Julien Le Dem <ju...@dremio.com> wrote:

> The arrow sync is starting now:
> https://plus.google.com/hangouts/_/dremio.com/arrow
>
> On Thu, Feb 2, 2017 at 8:38 AM, Julien Le Dem <ju...@dremio.com> wrote:
>
>> (I just sent this to the Parquet list but this applies to Arrow as well)
>> Everybody interested is welcome.
>> If there is more than one of you in the same location I'd recommend
>> sharing the connection.
>> The sync is every other week, lasts one hour and goes as follows:
>>  - go around the "table" for everyone to quickly introduce themselves and
>> state the agenda items they'd want discussed (if any). It could be letting
>> others know of what they're planning to work on, helping reaching a
>> consensus on a JIRA, reminding people to review something that's important
>> to them...
>>  - once the agenda is built from this first round we go over each item in
>> order.
>>  - at the end notes are sent to the list. They usually have a list of
>> action items (follow up on jira, review PR #x, ...) and
>> resolved/unresolved discussion points.
>>
>> Generally, discussions happen on the mailing list, JIRA or github PRs and
>> the sync helps getting those to conclusion faster.
>>
>> On Thu, Feb 2, 2017 at 8:36 AM, Julien Le Dem <ju...@dremio.com> wrote:
>>
>>> Reminder that the next Arrow sync is today at 10am PT (in 1 hour 25 min)
>>> on google hangout:
>>> https://plus.google.com/hangouts/_/dremio.com/arrow
>>>
>>> On Thu, Jan 26, 2017 at 4:00 PM, Julien Le Dem <ju...@dremio.com>
>>> wrote:
>>>
>>>> The next Arrow sync will be Thursday February 2nd 10am PT on google
>>>> hangout
>>>> https://plus.google.com/hangouts/_/dremio.com/arrow
>>>> notes will be posted to the list
>>>>
>>>> --
>>>> Julien
>>>>
>>>
>>>
>>>
>>> --
>>> Julien
>>>
>>
>>
>>
>> --
>> Julien
>>
>
>
>
> --
> Julien
>



-- 
Julien

Re: Next Arrow sync

Posted by Julien Le Dem <ju...@dremio.com>.
The arrow sync is starting now:
https://plus.google.com/hangouts/_/dremio.com/arrow

On Thu, Feb 2, 2017 at 8:38 AM, Julien Le Dem <ju...@dremio.com> wrote:

> (I just sent this to the Parquet list but this applies to Arrow as well)
> Everybody interested is welcome.
> If there is more than one of you in the same location I'd recommend
> sharing the connection.
> The sync is every other week, lasts one hour and goes as follows:
>  - go around the "table" for everyone to quickly introduce themselves and
> state the agenda items they'd want discussed (if any). It could be letting
> others know of what they're planning to work on, helping reaching a
> consensus on a JIRA, reminding people to review something that's important
> to them...
>  - once the agenda is built from this first round we go over each item in
> order.
>  - at the end notes are sent to the list. They usually have a list of
> action items (follow up on jira, review PR #x, ...) and
> resolved/unresolved discussion points.
>
> Generally, discussions happen on the mailing list, JIRA or github PRs and
> the sync helps getting those to conclusion faster.
>
> On Thu, Feb 2, 2017 at 8:36 AM, Julien Le Dem <ju...@dremio.com> wrote:
>
>> Reminder that the next Arrow sync is today at 10am PT (in 1 hour 25 min)
>> on google hangout:
>> https://plus.google.com/hangouts/_/dremio.com/arrow
>>
>> On Thu, Jan 26, 2017 at 4:00 PM, Julien Le Dem <ju...@dremio.com> wrote:
>>
>>> The next Arrow sync will be Thursday February 2nd 10am PT on google
>>> hangout
>>> https://plus.google.com/hangouts/_/dremio.com/arrow
>>> notes will be posted to the list
>>>
>>> --
>>> Julien
>>>
>>
>>
>>
>> --
>> Julien
>>
>
>
>
> --
> Julien
>



-- 
Julien

Re: Next Arrow sync

Posted by Julien Le Dem <ju...@dremio.com>.
(I just sent this to the Parquet list but this applies to Arrow as well)
Everybody interested is welcome.
If there is more than one of you in the same location I'd recommend sharing
the connection.
The sync is every other week, lasts one hour and goes as follows:
 - go around the "table" for everyone to quickly introduce themselves and
state the agenda items they'd want discussed (if any). It could be letting
others know of what they're planning to work on, helping reaching a
consensus on a JIRA, reminding people to review something that's important
to them...
 - once the agenda is built from this first round we go over each item in
order.
 - at the end notes are sent to the list. They usually have a list of
action items (follow up on jira, review PR #x, ...) and resolved/unresolved
discussion points.

Generally, discussions happen on the mailing list, JIRA or github PRs and
the sync helps getting those to conclusion faster.

On Thu, Feb 2, 2017 at 8:36 AM, Julien Le Dem <ju...@dremio.com> wrote:

> Reminder that the next Arrow sync is today at 10am PT (in 1 hour 25 min)
> on google hangout:
> https://plus.google.com/hangouts/_/dremio.com/arrow
>
> On Thu, Jan 26, 2017 at 4:00 PM, Julien Le Dem <ju...@dremio.com> wrote:
>
>> The next Arrow sync will be Thursday February 2nd 10am PT on google
>> hangout
>> https://plus.google.com/hangouts/_/dremio.com/arrow
>> notes will be posted to the list
>>
>> --
>> Julien
>>
>
>
>
> --
> Julien
>



-- 
Julien

Re: Next Arrow sync

Posted by Julien Le Dem <ju...@dremio.com>.
Reminder that the next Arrow sync is today at 10am PT (in 1 hour 25 min) on
google hangout:
https://plus.google.com/hangouts/_/dremio.com/arrow

On Thu, Jan 26, 2017 at 4:00 PM, Julien Le Dem <ju...@dremio.com> wrote:

> The next Arrow sync will be Thursday February 2nd 10am PT on google hangout
> https://plus.google.com/hangouts/_/dremio.com/arrow
> notes will be posted to the list
>
> --
> Julien
>



-- 
Julien