You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Sachin Shaju <sa...@mstack.com> on 2016/11/04 09:05:38 UTC

Custom elastic indexer in nutch

Hi,

     I was running test runs on nutch elastic indexer.I would like to add
some custom fields and custom typenames(instead of "doc") that can be given
as arguments to the indexing job. I understand *NutchDocument* is the class
which is responsible for setting field names and metadata but couldn't
figure out where nutch create instance of this and sets values. Or Is there
any other way for this. Please help.

Regards,
Sachin Shaju

sachin.s@mstack.com

-- 
 

The information contained in this electronic message and any attachments to 
this message are intended for the exclusive use of the addressee(s) and may 
contain proprietary, confidential or privileged information. If you are not 
the intended recipient, you should not disseminate, distribute or copy this 
e-mail. Please notify the sender immediately and destroy all copies of this 
message and any attachments.

WARNING: Computer viruses can be transmitted via email. The recipient 
should check this email and any attachments for the presence of viruses. 
The company accepts no liability for any damage caused by any virus 
transmitted by this email.

www.mStack.com

Re: Custom elastic indexer in nutch

Posted by Sachin Shaju <sa...@mstack.com>.
One elaborated answer to the same :
http://stackoverflow.com/questions/40418712/adding-custom-fields-and-types-in-nutch-elastic-indexer/40423485#40423485

Regards,
Sachin Shaju

sachin.s@mstack.com

On Fri, Nov 4, 2016 at 2:35 PM, Sachin Shaju <sa...@mstack.com> wrote:

> Hi,
>
>      I was running test runs on nutch elastic indexer.I would like to add
> some custom fields and custom typenames(instead of "doc") that can be given
> as arguments to the indexing job. I understand *NutchDocument* is the
> class which is responsible for setting field names and metadata but
> couldn't figure out where nutch create instance of this and sets values. Or
> Is there any other way for this. Please help.
>
> Regards,
> Sachin Shaju
>
> sachin.s@mstack.com
>

-- 
 

The information contained in this electronic message and any attachments to 
this message are intended for the exclusive use of the addressee(s) and may 
contain proprietary, confidential or privileged information. If you are not 
the intended recipient, you should not disseminate, distribute or copy this 
e-mail. Please notify the sender immediately and destroy all copies of this 
message and any attachments.

WARNING: Computer viruses can be transmitted via email. The recipient 
should check this email and any attachments for the presence of viruses. 
The company accepts no liability for any damage caused by any virus 
transmitted by this email.

www.mStack.com

Re: Custom elastic indexer in nutch

Posted by Sachin Shaju <sa...@mstack.com>.
How to do the same with index.parse.md ? Any useful links or demonstration
please.

Regards,
Sachin Shaju

sachin.s@mstack.com
+919539887554

On Sat, Nov 5, 2016 at 8:49 PM, MrSrivastavaRK . <sr...@gmail.com>
wrote:

> I am facing same problem. Thought of to share some work around,  you can
> add in configuration during crawl request and retrieve same when indexer
> job start.
>
> On Nov 5, 2016 5:00 PM, "Markus Jelsma" <ma...@openindex.io>
> wrote:
>
> > Hi - If you want to index some custom fields, you can either use
> > index.parse.md or create a custom indexing filter plugin.
> > Markus
> >
> > -----Original message-----
> > > From:Sachin Shaju <sa...@mstack.com>
> > > Sent: Friday 4th November 2016 10:05
> > > To: user@nutch.apache.org
> > > Subject: Custom elastic indexer in nutch
> > >
> > > Hi,
> > >
> > >      I was running test runs on nutch elastic indexer.I would like to
> add
> > > some custom fields and custom typenames(instead of "doc") that can be
> > given
> > > as arguments to the indexing job. I understand *NutchDocument* is the
> > class
> > > which is responsible for setting field names and metadata but couldn't
> > > figure out where nutch create instance of this and sets values. Or Is
> > there
> > > any other way for this. Please help.
> > >
> > > Regards,
> > > Sachin Shaju
> > >
> > > sachin.s@mstack.com
> > >
> > > --
> > >
> > >
> > > The information contained in this electronic message and any
> attachments
> > to
> > > this message are intended for the exclusive use of the addressee(s) and
> > may
> > > contain proprietary, confidential or privileged information. If you are
> > not
> > > the intended recipient, you should not disseminate, distribute or copy
> > this
> > > e-mail. Please notify the sender immediately and destroy all copies of
> > this
> > > message and any attachments.
> > >
> > > WARNING: Computer viruses can be transmitted via email. The recipient
> > > should check this email and any attachments for the presence of
> viruses.
> > > The company accepts no liability for any damage caused by any virus
> > > transmitted by this email.
> > >
> > > www.mStack.com
> > >
> >
>

-- 
 

The information contained in this electronic message and any attachments to 
this message are intended for the exclusive use of the addressee(s) and may 
contain proprietary, confidential or privileged information. If you are not 
the intended recipient, you should not disseminate, distribute or copy this 
e-mail. Please notify the sender immediately and destroy all copies of this 
message and any attachments.

WARNING: Computer viruses can be transmitted via email. The recipient 
should check this email and any attachments for the presence of viruses. 
The company accepts no liability for any damage caused by any virus 
transmitted by this email.

www.mStack.com

RE: Custom elastic indexer in nutch

Posted by "MrSrivastavaRK ." <sr...@gmail.com>.
I am facing same problem. Thought of to share some work around,  you can
add in configuration during crawl request and retrieve same when indexer
job start.

On Nov 5, 2016 5:00 PM, "Markus Jelsma" <ma...@openindex.io> wrote:

> Hi - If you want to index some custom fields, you can either use
> index.parse.md or create a custom indexing filter plugin.
> Markus
>
> -----Original message-----
> > From:Sachin Shaju <sa...@mstack.com>
> > Sent: Friday 4th November 2016 10:05
> > To: user@nutch.apache.org
> > Subject: Custom elastic indexer in nutch
> >
> > Hi,
> >
> >      I was running test runs on nutch elastic indexer.I would like to add
> > some custom fields and custom typenames(instead of "doc") that can be
> given
> > as arguments to the indexing job. I understand *NutchDocument* is the
> class
> > which is responsible for setting field names and metadata but couldn't
> > figure out where nutch create instance of this and sets values. Or Is
> there
> > any other way for this. Please help.
> >
> > Regards,
> > Sachin Shaju
> >
> > sachin.s@mstack.com
> >
> > --
> >
> >
> > The information contained in this electronic message and any attachments
> to
> > this message are intended for the exclusive use of the addressee(s) and
> may
> > contain proprietary, confidential or privileged information. If you are
> not
> > the intended recipient, you should not disseminate, distribute or copy
> this
> > e-mail. Please notify the sender immediately and destroy all copies of
> this
> > message and any attachments.
> >
> > WARNING: Computer viruses can be transmitted via email. The recipient
> > should check this email and any attachments for the presence of viruses.
> > The company accepts no liability for any damage caused by any virus
> > transmitted by this email.
> >
> > www.mStack.com
> >
>

RE: Custom elastic indexer in nutch

Posted by Markus Jelsma <ma...@openindex.io>.
Hi - If you want to index some custom fields, you can either use index.parse.md or create a custom indexing filter plugin.
Markus
 
-----Original message-----
> From:Sachin Shaju <sa...@mstack.com>
> Sent: Friday 4th November 2016 10:05
> To: user@nutch.apache.org
> Subject: Custom elastic indexer in nutch
> 
> Hi,
> 
>      I was running test runs on nutch elastic indexer.I would like to add
> some custom fields and custom typenames(instead of "doc") that can be given
> as arguments to the indexing job. I understand *NutchDocument* is the class
> which is responsible for setting field names and metadata but couldn't
> figure out where nutch create instance of this and sets values. Or Is there
> any other way for this. Please help.
> 
> Regards,
> Sachin Shaju
> 
> sachin.s@mstack.com
> 
> -- 
>  
> 
> The information contained in this electronic message and any attachments to 
> this message are intended for the exclusive use of the addressee(s) and may 
> contain proprietary, confidential or privileged information. If you are not 
> the intended recipient, you should not disseminate, distribute or copy this 
> e-mail. Please notify the sender immediately and destroy all copies of this 
> message and any attachments.
> 
> WARNING: Computer viruses can be transmitted via email. The recipient 
> should check this email and any attachments for the presence of viruses. 
> The company accepts no liability for any damage caused by any virus 
> transmitted by this email.
> 
> www.mStack.com
>