You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@asterixdb.apache.org by "ASF subversion and git services (Jira)" <ji...@apache.org> on 2020/10/22 16:15:00 UTC

[jira] [Commented] (ASTERIXDB-1254) support Parquet HDFS format

    [ https://issues.apache.org/jira/browse/ASTERIXDB-1254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17219135#comment-17219135 ] 

ASF subversion and git services commented on ASTERIXDB-1254:
------------------------------------------------------------

Commit f984ced94b2aa998c6ae3a895e85bb04e20426c7 in asterixdb's branch refs/heads/master from Wail Alkowaileet
[ https://gitbox.apache.org/repos/asf?p=asterixdb.git;h=f984ced ]

[ASTERIXDB-1254][EXT] Add Apache Parquet as external format

- user model changes: yes
- storage format changes: no
- interface changes: yes

Details:
- Add the support for Apache Parquet format as an external one
- Add new compiler property compiler.expression.pushdowns
  which allows us to pushdown field-accesses down to data-scan.
  Currently, this is only supported for Parquet input-format.
- Allow single-NC configuration to utilize Hadoop formats without
  the need for HDFS.

Changes:
- Add IProjectionInfo to hold the projected field names
  information for data-scan
- Add IFieldValue to extend Parquet Converter to get values' field names.
- Add NoOpDataParserFactory and NoOpDataParser.

Change-Id: I00f3625b332c298ea93b5cab44cb648c89075079
Reviewed-on: https://asterix-gerrit.ics.uci.edu/c/asterixdb/+/6904
Integration-Tests: Jenkins <je...@fulliautomatix.ics.uci.edu>
Tested-by: Jenkins <je...@fulliautomatix.ics.uci.edu>
Reviewed-by: Dmitry Lychagin <dm...@couchbase.com>


> support Parquet HDFS format
> ---------------------------
>
>                 Key: ASTERIXDB-1254
>                 URL: https://issues.apache.org/jira/browse/ASTERIXDB-1254
>             Project: Apache AsterixDB
>          Issue Type: Wish
>          Components: EXT - External data
>            Reporter: Yingyi Bu
>            Assignee: Wail Y. Alkowaileet
>            Priority: Major
>
> Support Parquet as an external data source format in addition to current formats such as text and sequence file.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)