You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@falcon.apache.org by "Venkat Ramachandran (JIRA)" <ji...@apache.org> on 2015/08/04 01:24:05 UTC

[jira] [Commented] (FALCON-36) Ability to ingest data from databases

    [ https://issues.apache.org/jira/browse/FALCON-36?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14652774#comment-14652774 ] 

Venkat Ramachandran commented on FALCON-36:
-------------------------------------------

[~ajayyadava] datasource type here is so generic that it covered various database types (MySQL, Oracle ...), broker types (Kafka, ActiveMQ ...) and filesystem types. Obviously, the functionality of them various (like filter push down, driver types, API). So, it is better to handle these datasource types separately as there is no common denominator of capabilities.


> Ability to ingest data from databases
> -------------------------------------
>
>                 Key: FALCON-36
>                 URL: https://issues.apache.org/jira/browse/FALCON-36
>             Project: Falcon
>          Issue Type: Improvement
>          Components: acquisition
>    Affects Versions: 0.3
>            Reporter: Venkatesh Seetharam
>            Assignee: Venkat Ramachandran
>         Attachments: FALCON-36.patch, FALCON-36.rebase.patch, FALCON-36.review.patch, Falcon Data Ingestion - Proposal.docx, falcon-36.xsd.patch.1
>
>
> Attempt to address data import from RDBMS into hadoop and export of data from Hadoop into RDBMS. The plan is to use sqoop 1.x to materialize data motion from/to RDBMS to/from HDFS. Hive will not be integrated in the first pass until Falcon has a first class integration with HCatalog.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)