You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@iceberg.apache.org by Ryan Blue <rb...@netflix.com.INVALID> on 2019/09/04 23:55:06 UTC

[DISCUSS] September report

Hi everyone,

Here's a draft of this month's report to the IPMC. Please reply with
comments if you'd like to add anything!

rb

## Iceberg

Iceberg is a table format for large, slow-moving tabular data.

Iceberg has been incubating since 2018-11-16.

### Three most important unfinished issues to address before graduating:

1. Make the first Apache release. (https://github.com/apache/incubator-
iceberg/milestone/1)
2. Grow the Iceberg community
3. Add more committers and PPMC members

### Are there any issues that the IPMC or ASF Board need to be aware of?

No issues.

### How has the community developed since the last report?

The community continues to grow steadily. In the last month:
* 59 pull requests have been merged
* 17 people contributed the merged PRs
* 18 issues have been closed, 22 issues were opened

For comparison, the last report had 74 pull requests merged over 3 months.

### How has the project developed since the last report?

* License documentation has been completed for the Java project, unblocking
the first release
* Added more documentation to iceberg.apache.org
* Started vectorized read branch with significantly better performance
* Added metadata tables
* Added configuration to control statistics and truncate long values
* Improved Hive Metastore integration
* A working python read path has been submitted in PRs

### How would you assess the podling's maturity?

  - [ ] Initial setup
  - [x] Working towards first release
  - [x] Community building
  - [x] Nearing graduation
  - [ ] Other:

### Date of last release:

* No release yet

### When were the last committers or PPMC members elected?

* Anton Okolnychyi was added 30 August 2019

### Have your mentors been helpful and responsive?

Yes

-- 
Ryan Blue
Software Engineer
Netflix

Re: [DISCUSS] September report

Posted by Owen O'Malley <ow...@gmail.com>.
On Wed, Sep 4, 2019 at 4:55 PM Ryan Blue <rb...@netflix.com.invalid> wrote:

> Hi everyone,
>
> Here's a draft of this month's report to the IPMC. Please reply with
> comments if you'd like to add anything!
>
> rb
>
> ## Iceberg
>
> Iceberg is a table format for large, slow-moving tabular data.
>
> Iceberg has been incubating since 2018-11-16.
>
> ### Three most important unfinished issues to address before graduating:
>
> 1. Make the first Apache release. (https://github.com/apache/incubator-
> iceberg/milestone/1)
> 2. Grow the Iceberg community
> 3. Add more committers and PPMC members
>
> ### Are there any issues that the IPMC or ASF Board need to be aware of?
>
> No issues.
>
> ### How has the community developed since the last report?
>
> The community continues to grow steadily. In the last month:
> * 59 pull requests have been merged
> * 17 people contributed the merged PRs
> * 18 issues have been closed, 22 issues were opened
>

Presentations were given at:
* Berlin Buzzwords (June 2019)
* ApacheCon NA (Sep 2019)

Iceberg is being used in production at Netflix on huge tables, up to 25
petabytes.


> For comparison, the last report had 74 pull requests merged over 3 months.
>
> ### How has the project developed since the last report?
>
> * License documentation has been completed for the Java project,
> unblocking the first release
> * Added more documentation to iceberg.apache.org
> * Started vectorized read branch with significantly better performance
> * Added metadata tables
> * Added configuration to control statistics and truncate long values
> * Improved Hive Metastore integration
> * A working python read path has been submitted in PRs
>
> ### How would you assess the podling's maturity?
>
>   - [ ] Initial setup
>   - [x] Working towards first release
>   - [x] Community building
>   - [x] Nearing graduation
>   - [ ] Other:
>
> ### Date of last release:
>
> * No release yet
>
> ### When were the last committers or PPMC members elected?
>
> * Anton Okolnychyi was added 30 August 2019
>
> ### Have your mentors been helpful and responsive?
>
> Yes
>
> +1 to the report.

-- 
> Ryan Blue
> Software Engineer
> Netflix
>

Re: [DISCUSS] September report

Posted by Daniel Weeks <dw...@netflix.com.INVALID>.
It would be good to include that PrestoSQL has a native Iceberg connector
<https://github.com/prestosql/presto/pull/1067>, which is significant.



On Fri, Sep 6, 2019 at 4:13 PM Carl Steinbach <cw...@apache.org> wrote:

> +1 to the report
> +1 to graduation for the same set of reasons mentioned by Owen.
>
> - Carl
>
> On Fri, Sep 6, 2019 at 12:04 PM Owen O'Malley <ow...@gmail.com>
> wrote:
>
>> On Fri, Sep 6, 2019 at 12:19 AM Justin Mclean <jm...@apache.org> wrote:
>>
>>> So why does the project think it's ready to graduate? Mentors do you
>>> think the project is ready to graduate?
>>>
>>
>> It has to make a release or two, but I agree with Ryan that it
>> approaching graduation. The project entered Apache with five Apache members
>> from different companies. It has grown the community to include a few more
>> companies. I think it is doing great.
>>
>> .. Owen
>>
>>

Re: [DISCUSS] September report

Posted by Carl Steinbach <cw...@apache.org>.
+1 to the report
+1 to graduation for the same set of reasons mentioned by Owen.

- Carl

On Fri, Sep 6, 2019 at 12:04 PM Owen O'Malley <ow...@gmail.com>
wrote:

> On Fri, Sep 6, 2019 at 12:19 AM Justin Mclean <jm...@apache.org> wrote:
>
>> So why does the project think it's ready to graduate? Mentors do you
>> think the project is ready to graduate?
>>
>
> It has to make a release or two, but I agree with Ryan that it approaching
> graduation. The project entered Apache with five Apache members from
> different companies. It has grown the community to include a few more
> companies. I think it is doing great.
>
> .. Owen
>
>

Re: [DISCUSS] September report

Posted by Owen O'Malley <ow...@gmail.com>.
On Fri, Sep 6, 2019 at 12:19 AM Justin Mclean <jm...@apache.org> wrote:

> So why does the project think it's ready to graduate? Mentors do you think
> the project is ready to graduate?
>

It has to make a release or two, but I agree with Ryan that it approaching
graduation. The project entered Apache with five Apache members from
different companies. It has grown the community to include a few more
companies. I think it is doing great.

.. Owen

Re: [DISCUSS] September report

Posted by Ryan Blue <rb...@netflix.com.INVALID>.
Hi Justin,

I checked the box that Iceberg is "nearing graduation", not that it is
ready to graduate. I think the numbers show that we've had good community
growth and we have added a PPMC member. Adding more and actually getting a
release out are the points that I've listed as the unfinished steps before
graduation.

As an ASF mentor, I think that this community is doing well adopting the
Apache Way, which is why I think it is nearing graduation. To call out a
couple: discussion is open and happens on the dev list or issue tracker;
and artifacts have license documentation that complies with ASF policy (the
major hurdle to the first release).

rb

On Fri, Sep 6, 2019 at 12:19 AM Justin Mclean <jm...@apache.org> wrote:

> Hi,
>
> I can a couple of concern about the report, first off I don't think any
> project would be ready to graduate if they have not made a release or added
> committers/PPMC members. And it would be grate to see some something other
> than meaningless stats on community growth in the report. Those numbers may
> mean something to people in the project, but they don't mean a lot outside
> of it. Try to write the report as if it was going to be read by something
> who knew very little about your project and
>
> So why does the project think it's ready to graduate? Mentors do you think
> the project is ready to graduate?
>
> Thanks,
> Justin
>


-- 
Ryan Blue
Software Engineer
Netflix

Re: [DISCUSS] September report

Posted by Justin Mclean <jm...@apache.org>.
Hi,

I can a couple of concern about the report, first off I don't think any project would be ready to graduate if they have not made a release or added committers/PPMC members. And it would be grate to see some something other than meaningless stats on community growth in the report. Those numbers may mean something to people in the project, but they don't mean a lot outside of it. Try to write the report as if it was going to be read by something who knew very little about your project and  

So why does the project think it's ready to graduate? Mentors do you think the project is ready to graduate?

Thanks,
Justin