You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@falcon.apache.org by "Venkat Ramachandran (JIRA)" <ji...@apache.org> on 2015/08/04 01:35:05 UTC

[jira] [Updated] (FALCON-36) Ability to ingest data from databases

     [ https://issues.apache.org/jira/browse/FALCON-36?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Venkat Ramachandran updated FALCON-36:
--------------------------------------
    Attachment: FALCON-36.patch.2

* Attaching a scratch patch with improved xsd for database types along with associated code for an early feedback and discussion. 
* This does not yet have unit tests and all other code style changes
* sqoop needs to be added to oozie shared lib (if not already present)
* also, need mysql-connector-java-5.1.31.jar for sqoop to work


> Ability to ingest data from databases
> -------------------------------------
>
>                 Key: FALCON-36
>                 URL: https://issues.apache.org/jira/browse/FALCON-36
>             Project: Falcon
>          Issue Type: Improvement
>          Components: acquisition
>    Affects Versions: 0.3
>            Reporter: Venkatesh Seetharam
>            Assignee: Venkat Ramachandran
>         Attachments: FALCON-36.patch, FALCON-36.patch.2, FALCON-36.rebase.patch, FALCON-36.review.patch, Falcon Data Ingestion - Proposal.docx, falcon-36.xsd.patch.1
>
>
> Attempt to address data import from RDBMS into hadoop and export of data from Hadoop into RDBMS. The plan is to use sqoop 1.x to materialize data motion from/to RDBMS to/from HDFS. Hive will not be integrated in the first pass until Falcon has a first class integration with HCatalog.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)