You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@storm.apache.org by sowmiya kandasamy <ch...@hotmail.com> on 2014/01/31 11:26:31 UTC

Storm using STAX

    Hi All,
    
I am  very new to STORM and STAX parser.I am using Latest version of storm version9.
I want to process large xml file using Stax parser in storm.
I want to give input xml file into the spout and the spout emit it into bolt.
The bolt will emit the seperate xml files for the big xml file.
Spout should read this xml file and pass it to the bolt,
This is some sample part of my Xml file :
    
    <row>
      <Id>342a65e5b977e03281b588638d4e4bef59922ff6</Id> 
       <contentsize>11</contentsize>
        <Title>Program Leader</JobTitle>
     </row>
      <row>
       <Id>342a65e5b977e03281b588638d4e4bef59922ff7</Id>
       <contentsize>11</contentsize>
        <Title>Program Leader</JobTitle>
     </row>
      <row>
       <Id>342a65e5b977e03281b588638d4e4bef59922ff8</Id>
       <contentsize>11</contentsize>
        <Title>Program Leader</JobTitle>
     </row>

 

The output emitted by the bolt should be :

     342a65e5b977e03281b588638d4e4bef59922ff6.xml
    
    <row>
      <Id>342a65e5b977e03281b588638d4e4bef59922ff6</Id>
       <contentsize>11</contentsize>
        <Title>Program Leader</JobTitle>
     </row>
     
    342a65e5b977e03281b588638d4e4bef59922ff7.xml
    
    <row>
      <Id>342a65e5b977e03281b588638d4e4bef59922ff7</Id>
       <contentsize>11</contentsize>
        <Title>Program Leader</JobTitle>
     </row>
    
i want ro runt his program into local mode in storm as well as cluster mode in storm.
Can you please guide us?
    
    Thanks,
    Sowmiya

 		 	   		  

Re: Storm using STAX

Posted by Richards Peter <hb...@gmail.com>.
Hi,

Please check the following pages:
https://github.com/nathanmarz/storm/wiki/Local-mode
https://github.com/nathanmarz/storm/wiki/Running-topologies-on-a-production-cluster
https://github.com/nathanmarz/storm/wiki/Setting-up-a-Storm-cluster

In short: In local mode you use LocalCluster object. In cluster mode you
use StormSubmitter.

Regards,
Richards Peter.