You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@opennlp.apache.org by Damiano Porta <da...@gmail.com> on 2018/01/30 17:10:11 UTC

NER Features

Hello everybody,

how can we understand what are the most important features during the NER
process? I mean.. when the TokenNameFinder selects a label is it possible
to retrieve the most important features too ?

Thanks
Damiano

Re: NER Features

Posted by Damiano Porta <da...@gmail.com>.

Thank you Daniel! Hmm ok... I will try to guess what features are the most
important :)

2018-01-30 18:19 GMT+01:00 Dan Russ <da...@gmail.com>:

> Damiano,
>    You are treading in some dangerous waters.  You need to open up the
> black box of the model and peek inside.  I open up the NER model and get
> the maxent model inside.  I make the context using the feature generator,
> then I get the context map inside the maxent model to lookup the weights.
> I am grossly oversimplifying the process because otherwise I would be
> writing a treatise on OpenNLP.  I would suggest looking at the find()
> method and align what that method does with my comments on the steps you
> need to take.
>
> Hope it helps...
> Daniel
>
> > On Jan 30, 2018, at 12:10 PM, Damiano Porta <da...@gmail.com>
> wrote:
> >
> > Hello everybody,
> >
> > how can we understand what are the most important features during the NER
> > process? I mean.. when the TokenNameFinder selects a label is it possible
> > to retrieve the most important features too ?
> >
> > Thanks
> > Damiano
>
>

Re: NER Features

Posted by Dan Russ <da...@gmail.com>.

Damiano,
   You are treading in some dangerous waters.  You need to open up the black box of the model and peek inside.  I open up the NER model and get the maxent model inside.  I make the context using the feature generator, then I get the context map inside the maxent model to lookup the weights.  I am grossly oversimplifying the process because otherwise I would be writing a treatise on OpenNLP.  I would suggest looking at the find() method and align what that method does with my comments on the steps you need to take.

Hope it helps...
Daniel

> On Jan 30, 2018, at 12:10 PM, Damiano Porta <da...@gmail.com> wrote:
> 
> Hello everybody,
> 
> how can we understand what are the most important features during the NER
> process? I mean.. when the TokenNameFinder selects a label is it possible
> to retrieve the most important features too ?
> 
> Thanks
> Damiano