You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-user@hadoop.apache.org by hm...@tsmc.com on 2010/10/08 02:14:43 UTC

custom Input fornat


Hi there,

My files, each about 40G, content format like:

       <start>
            datadatadatadatadata
            datadatadatadatadata
            datadatadatadatadata
       <end>
       <start>
            datadatadatadatadata
            datadatadatadatadata
            datadatadatadatadata
       <end>
       <start>
            datadatadatadatadata
            datadatadatadatadata
            datadatadatadatadata
       <end>
                       .
                       .
I think I should create a custom input format.
Any suggetion or sample would be appreciated!
Thank you


Fleming Chiu(邱宏明)
Ext: 707-2260
Be Veg, Go Green, Save the Planet!
 --------------------------------------------------------------------------- 
                                                         TSMC PROPERTY       
 This email communication (and any attachments) is proprietary information   
 for the sole use of its                                                     
 intended recipient. Any unauthorized review, use or distribution by anyone  
 other than the intended                                                     
 recipient is strictly prohibited.  If you are not the intended recipient,   
 please notify the sender by                                                 
 replying to this email, and then delete this email and any copies of it     
 immediately. Thank you.                                                     
 ---------------------------------------------------------------------------