You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2021/01/04 11:48:00 UTC

[jira] [Work logged] (BEAM-11526) Add Beam schema support to ParquetIO

     [ https://issues.apache.org/jira/browse/BEAM-11526?focusedWorklogId=530617&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-530617 ]

ASF GitHub Bot logged work on BEAM-11526:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 04/Jan/21 11:47
            Start Date: 04/Jan/21 11:47
    Worklog Time Spent: 10m 
      Work Description: iemejia commented on pull request #13639:
URL: https://github.com/apache/beam/pull/13639#issuecomment-753930666


   @anantdamle other interesting contribution would be to support filter predicates on ParquetIO.
   
   I just asked in the ticket if the person who was working on this is still interested, otherwise maybe you can take it (if you feel like it of course).
   https://issues.apache.org/jira/browse/BEAM-7925
   
   I am not 100% sure on the Parquet APIs but it looks like this one is the one.
   https://www.javadoc.io/doc/org.apache.parquet/parquet-column/1.8.1/org/apache/parquet/filter/ColumnRecordFilter.html


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Issue Time Tracking
-------------------

    Worklog Id:     (was: 530617)
    Time Spent: 1h 10m  (was: 1h)

> Add Beam schema support to ParquetIO
> ------------------------------------
>
>                 Key: BEAM-11526
>                 URL: https://issues.apache.org/jira/browse/BEAM-11526
>             Project: Beam
>          Issue Type: New Feature
>          Components: io-java-parquet
>            Reporter: Ismaël Mejía
>            Assignee: Anant Damle
>            Priority: P2
>             Fix For: 2.28.0
>
>          Time Spent: 1h 10m
>  Remaining Estimate: 0h
>
> We can add support to use Beam Schema-based collections on ParquetIO. The idea is to follow the approach done on AvroIO
> For ref.
> [https://github.com/apache/beam/commit/8bdbb3361701240a3d909c9dad24f3f31af18eac]
> [https://github.com/apache/beam/pull/9130]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)