You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Fabian Hueske (JIRA)" <ji...@apache.org> on 2015/02/04 23:33:35 UTC

[jira] [Updated] (FLINK-1466) Add InputFormat to read HCatalog tables

     [ https://issues.apache.org/jira/browse/FLINK-1466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Fabian Hueske updated FLINK-1466:
---------------------------------
    Description: 
HCatalog is a metadata repository and InputFormat to make Hive tables accessible to other frameworks such as Pig.

Adding support for HCatalog would give access to Hive managed data.

  was:
Hive features a table catalog called MetaStore to store information about tables. This information includes:

- Table schema (field names, types)
- Storage location in HDFS
- HadoopInputFormat to read the data
- SerDe information to extract fields
- Information about partitions

This JIRA proposes to add an InputFormat to read Hive tables. For a given database and table name, the IF should access the megastore to get all relevant information to read the data. Further parameters are field names (selection) and partitions (limited predicates).

For a first version, the IF can be limited to flat tables (Hive supports nested data) with primitive field types.

        Summary: Add InputFormat to read HCatalog tables  (was: Add InputFormat to read from (Hive) tables stored in MetaStore)

> Add InputFormat to read HCatalog tables
> ---------------------------------------
>
>                 Key: FLINK-1466
>                 URL: https://issues.apache.org/jira/browse/FLINK-1466
>             Project: Flink
>          Issue Type: New Feature
>          Components: Java API, Scala API
>            Reporter: Fabian Hueske
>            Assignee: Fabian Hueske
>            Priority: Minor
>
> HCatalog is a metadata repository and InputFormat to make Hive tables accessible to other frameworks such as Pig.
> Adding support for HCatalog would give access to Hive managed data.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)