You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@iceberg.apache.org by Sam Redai <sa...@tabular.io> on 2022/07/28 16:30:39 UTC

Meeting Minutes from 07/27 Iceberg Sync

Hi Iceberg Community,

Below are the minutes and recording from our Iceberg Community Sync on *July
27th, 9am-10am PT*. Please remember that anyone is welcome to join the
discussion so feel free to share the Iceberg-Sync
<https://groups.google.com/g/iceberg-sync> google group with those seeking
an invite. The notes and the agenda are posted in the live doc
<https://docs.google.com/document/d/1YuGhUdukLP5gGiqCbk0A5_Wifqe2CZWgOd3TbhY3UQg/edit?usp=drive_web>
that's
also attached to the meeting invitation and it's a good place to add items
as you see fit so we can discuss them in the next community sync.

Meeting Recording
<https://drive.google.com/file/d/1G9Pp4AqmnF4tM7Pp2PJe2mOC2hQEUqvI/view?usp=sharing>
⭕

   -

   Highlights
   -

      0.14.0 is released
      -

      FLIP-27 Flink reader is in (Thanks, Steven!)
      -

      Added orphan file cleanup prefix mismatch modes (Thanks Karuppaya!)
      -

      Added Python expression classes and binding (Thanks, Nick and Sam!)
      -

      Rewrite data files procedure now supports zorder (Thanks, Ajantha!)
      -

   Releases
   -

      1.0.0
      -

         Branch from 0.14.0
         -

         Apply spotless <https://github.com/apache/iceberg/pull/5312>
         -

         Remove deprecated APIs
         -

         Update docs
         -

   Discussion
   -

      Spotless update
      -

      Views progress / status
      -

         API change PR: #4925 <https://github.com/apache/iceberg/pull/4925>
         -

      SQL Syntax for Branching and Tagging
      -

         Uncharted territory – nothing in the ANSI SQL spec
         -

         Proposal for syntax
         <https://docs.google.com/document/d/1tbATFPrKF3vNlzkgZQdaW8CAJmbjvryfrlg6C2Ci_aA/edit#heading=h.v8gsu2fe19q2>
         -

      Kafka Connect Sink Proposal
      <https://docs.google.com/document/d/1LQQXbi2gJ2CE5KltBiVj8WSDOVtO_wiVnGJeACIb5cU/edit?usp=sharing>
      -

      Roadmap <https://iceberg.apache.org/roadmap/> update
      -

         Add: FunctionCatalog and storage-partitioned joins in Spark
         -

         CDC reads
         -

         Partition stats (and indexes?)
         -

            Physical/file sequence number
            -

         REST APIs for planning scans and committing files
         -

            Partial commit to check a conflict (commit as much as possible)
            -

         Improving conflict detection – failing compaction due to position
         deletes
         -

            Concurrent MERGE and compaction
            -

            Pessimistic locking
            -

         Maintaining delete files
         -

            Rewriting to position deletes
            -

            Delete file compaction
            -

         Multi-table transactions
         -

         An updated roadmap proposal will be created and shared for review


Thank you to everyone who joined and contributed!