You are viewing a plain text version of this content. The canonical link for it is here.

Posted to common-dev@hadoop.apache.org by "Enis Soztutar (JIRA)" <ji...@apache.org> on 2008/09/10 16:26:44 UTC

[jira] Updated: (HADOOP-2536) MapReduce for MySQL

     [ https://issues.apache.org/jira/browse/HADOOP-2536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Enis Soztutar updated HADOOP-2536:
----------------------------------

    Attachment: mapred_jdbc_v3.patch

Since Fredrik said that he cannot continue to work on the patch, I have updated it with some changes. 
The changes include :
# package and class names have DB prefix instead of database. 
# DBInputSplit is now an inner class of DBInputFormat
# instead of the type mapping to convert the data types in the library, a new DBWritable interface is introduced. The classes implement DBWritable to convert from/to db tuples. 
# DBRecordReader emits <LongWritable, T> types where record number is the key and T is of type DBWritable. 
# DBRecordWriter accepts <K, V> where K implements DBWritable(hence written to db) and V is discarded. 
# JDBC uses JDBC batch update. 
# introduced two ways of setting the input query. 
# improved documentation.
# added a sample mapred program reading data from db and writing the results back to db. The program calculates the number of pageviews in a syntactically generated access log. The example program uses HSQLDB as an embedded database. 
# added a test case running the example job in the MiniCluster. 



> MapReduce for MySQL
> -------------------
>
>                 Key: HADOOP-2536
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2536
>             Project: Hadoop Core
>          Issue Type: New Feature
>          Components: mapred
>            Reporter: Fredrik Hedberg
>            Assignee: Fredrik Hedberg
>            Priority: Minor
>         Attachments: database-2.diff, database.diff, mapred_jdbc_v3.patch
>
>
> Add support for running MapReduce jobs over data residing in a MySQL table.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.