You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@iceberg.apache.org by Sam Redai <sa...@tabular.io> on 2022/07/28 16:30:39 UTC
Meeting Minutes from 07/27 Iceberg Sync
Hi Iceberg Community,
Below are the minutes and recording from our Iceberg Community Sync on *July
27th, 9am-10am PT*. Please remember that anyone is welcome to join the
discussion so feel free to share the Iceberg-Sync
<https://groups.google.com/g/iceberg-sync> google group with those seeking
an invite. The notes and the agenda are posted in the live doc
<https://docs.google.com/document/d/1YuGhUdukLP5gGiqCbk0A5_Wifqe2CZWgOd3TbhY3UQg/edit?usp=drive_web>
that's
also attached to the meeting invitation and it's a good place to add items
as you see fit so we can discuss them in the next community sync.
Meeting Recording
<https://drive.google.com/file/d/1G9Pp4AqmnF4tM7Pp2PJe2mOC2hQEUqvI/view?usp=sharing>
⭕
-
Highlights
-
0.14.0 is released
-
FLIP-27 Flink reader is in (Thanks, Steven!)
-
Added orphan file cleanup prefix mismatch modes (Thanks Karuppaya!)
-
Added Python expression classes and binding (Thanks, Nick and Sam!)
-
Rewrite data files procedure now supports zorder (Thanks, Ajantha!)
-
Releases
-
1.0.0
-
Branch from 0.14.0
-
Apply spotless <https://github.com/apache/iceberg/pull/5312>
-
Remove deprecated APIs
-
Update docs
-
Discussion
-
Spotless update
-
Views progress / status
-
API change PR: #4925 <https://github.com/apache/iceberg/pull/4925>
-
SQL Syntax for Branching and Tagging
-
Uncharted territory – nothing in the ANSI SQL spec
-
Proposal for syntax
<https://docs.google.com/document/d/1tbATFPrKF3vNlzkgZQdaW8CAJmbjvryfrlg6C2Ci_aA/edit#heading=h.v8gsu2fe19q2>
-
Kafka Connect Sink Proposal
<https://docs.google.com/document/d/1LQQXbi2gJ2CE5KltBiVj8WSDOVtO_wiVnGJeACIb5cU/edit?usp=sharing>
-
Roadmap <https://iceberg.apache.org/roadmap/> update
-
Add: FunctionCatalog and storage-partitioned joins in Spark
-
CDC reads
-
Partition stats (and indexes?)
-
Physical/file sequence number
-
REST APIs for planning scans and committing files
-
Partial commit to check a conflict (commit as much as possible)
-
Improving conflict detection – failing compaction due to position
deletes
-
Concurrent MERGE and compaction
-
Pessimistic locking
-
Maintaining delete files
-
Rewriting to position deletes
-
Delete file compaction
-
Multi-table transactions
-
An updated roadmap proposal will be created and shared for review
Thank you to everyone who joined and contributed!