You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@opennlp.apache.org by Girivaraprasad Nambari <gi...@gmail.com> on 2013/08/06 23:12:50 UTC
Text annotation tool used by open-nlp team
Hi Team,
I ran pre-trained English NER models on text I have and results are not as
I expected. So, I thought to train NER on the "trainining" text to generate
new models, but I am stuck on *Annotation tools*.
Could someone help me on which tools are being used by Open-NLP team for
text annotation? As well as any references around text annotation topics
would be really appreciated.
Thanks for your time and help.
Thank you,
Giri
Re: Text annotation tool used by open-nlp team
Posted by Girivaraprasad Nambari <gi...@gmail.com>.
Thanks Jim!
I will check this tool coming weekend.
Thank you,
Giri
On Tue, Aug 6, 2013 at 7:13 PM, Jim - FooBar(); <ji...@gmail.com>wrote:
> *not* speaking on behalf of the team but I wrote my own little tool to do
> the annotation whenever there is a dictionary or dictionaries available. It
> is open-source, it exposes a Java and a clojure API or you can even run it
> from the command line. You can find it here:
>
> https://github.com/jimpil/**annotator-clj<https://github.com/jimpil/annotator-clj>
>
> proper annotation in a specialised domain however is usually done by
> domain experts. Don't expect to get gold-data using annotator-clj or in
> fact any dictionary based annotator...the good with annotator-clj is that
> it can produce annotations compatible with openNLP, stanfordNLP or NLTK
> which is great sometimes. It is also very fast as it leverages the
> fork-join framework available in java7.
>
> hope that helps :)
>
> Jim
>
> ps: now that you've made me think about it again, instead of only allowing
> raw dictionaries I could add support for the user to provide his own
> specialised regex Pattern objects...hmmm...I'll investigate over the
> weekend.
>
>
>
> On 06/08/13 22:23, Girivaraprasad Nambari wrote:
>
>> Hi Team,
>>
>> I ran pre-trained English NER models on text I have and results are not as
>> I expected. So, I thought to train NER on the "trainining" text to
>> generate
>> new models, but I am stuck on *Annotation tools*.
>>
>>
>> Could someone help me on which tools are being used by Open-NLP team for
>> text annotation? As well as any references around text annotation topics
>> would be really appreciated.
>>
>> Thanks for your time and help.
>>
>> Thank you,
>> Giri
>>
>>
>> On Tue, Aug 6, 2013 at 5:12 PM, Girivaraprasad Nambari <
>> girinambari@gmail.com> wrote:
>>
>> Hi Team,
>>>
>>> I ran pre-trained English NER models on text I have and results are not
>>> as
>>> I expected. So, I thought to train NER on the "trainining" text to
>>> generate
>>> new models, but I am stuck on *Annotation tools*.
>>>
>>>
>>> Could someone help me on which tools are being used by Open-NLP team for
>>> text annotation? As well as any references around text annotation topics
>>> would be really appreciated.
>>>
>>> Thanks for your time and help.
>>>
>>> Thank you,
>>> Giri
>>>
>>>
>
Re: Text annotation tool used by open-nlp team
Posted by "Jim - FooBar();" <ji...@gmail.com>.
*not* speaking on behalf of the team but I wrote my own little tool to
do the annotation whenever there is a dictionary or dictionaries
available. It is open-source, it exposes a Java and a clojure API or you
can even run it from the command line. You can find it here:
https://github.com/jimpil/annotator-clj
proper annotation in a specialised domain however is usually done by
domain experts. Don't expect to get gold-data using annotator-clj or in
fact any dictionary based annotator...the good with annotator-clj is
that it can produce annotations compatible with openNLP, stanfordNLP or
NLTK which is great sometimes. It is also very fast as it leverages the
fork-join framework available in java7.
hope that helps :)
Jim
ps: now that you've made me think about it again, instead of only
allowing raw dictionaries I could add support for the user to provide
his own specialised regex Pattern objects...hmmm...I'll investigate over
the weekend.
On 06/08/13 22:23, Girivaraprasad Nambari wrote:
> Hi Team,
>
> I ran pre-trained English NER models on text I have and results are not as
> I expected. So, I thought to train NER on the "trainining" text to generate
> new models, but I am stuck on *Annotation tools*.
>
> Could someone help me on which tools are being used by Open-NLP team for
> text annotation? As well as any references around text annotation topics
> would be really appreciated.
>
> Thanks for your time and help.
>
> Thank you,
> Giri
>
>
> On Tue, Aug 6, 2013 at 5:12 PM, Girivaraprasad Nambari <
> girinambari@gmail.com> wrote:
>
>> Hi Team,
>>
>> I ran pre-trained English NER models on text I have and results are not as
>> I expected. So, I thought to train NER on the "trainining" text to generate
>> new models, but I am stuck on *Annotation tools*.
>>
>> Could someone help me on which tools are being used by Open-NLP team for
>> text annotation? As well as any references around text annotation topics
>> would be really appreciated.
>>
>> Thanks for your time and help.
>>
>> Thank you,
>> Giri
>>
Re: Text annotation tool used by open-nlp team
Posted by Girivaraprasad Nambari <gi...@gmail.com>.
Hi Lance,
Thanks for the input.
Thank you,
Giri
On Wed, Aug 14, 2013 at 1:15 PM, Lance Norskog <go...@gmail.com> wrote:
> Check out XAMPP. It is a web server package with Apache, MySql, an ftp
> server, PHP, and other standard web server doodads. This a great package
> for hosting PHP apps.
>
> http://www.apachefriends.org/**en/xampp.html<http://www.apachefriends.org/en/xampp.html>
>
>
> On 08/13/2013 04:33 AM, Jörn Kottmann wrote:
>
>> On 08/07/2013 05:09 AM, Girivaraprasad Nambari wrote:
>>
>>> Brat was the one I explored first, it is little bit complicated to
>>> install
>>> on Windows. That is the reason why I am looking for other tools.
>>>
>>
>>
>> Brat should be installed on a server and can be accessed via a browser
>> interface.
>> We now have support to train OpenNLP on the brat data files directly, and
>> soon
>> there will be support in the Tagging Server to integrate the OpenNLP Name
>> Finder
>> directly into to brat to pre-label documents.
>>
>> Jörn
>>
>
>
Re: Text annotation tool used by open-nlp team
Posted by Lance Norskog <go...@gmail.com>.
Check out XAMPP. It is a web server package with Apache, MySql, an ftp
server, PHP, and other standard web server doodads. This a great package
for hosting PHP apps.
http://www.apachefriends.org/en/xampp.html
On 08/13/2013 04:33 AM, Jörn Kottmann wrote:
> On 08/07/2013 05:09 AM, Girivaraprasad Nambari wrote:
>> Brat was the one I explored first, it is little bit complicated to
>> install
>> on Windows. That is the reason why I am looking for other tools.
>
>
> Brat should be installed on a server and can be accessed via a browser
> interface.
> We now have support to train OpenNLP on the brat data files directly,
> and soon
> there will be support in the Tagging Server to integrate the OpenNLP
> Name Finder
> directly into to brat to pre-label documents.
>
> Jörn
Re: Text annotation tool used by open-nlp team
Posted by Girivaraprasad Nambari <gi...@gmail.com>.
*Brat should be installed on a server *
*
*
Yes Jorn, my understanding is it should be PHP server. I may end up doing
this because there are no other easy alternatives comparing with Brat.
Thank you,
Giri
On Tue, Aug 13, 2013 at 7:33 AM, Jörn Kottmann <ko...@gmail.com> wrote:
> On 08/07/2013 05:09 AM, Girivaraprasad Nambari wrote:
>
>> Brat was the one I explored first, it is little bit complicated to install
>> on Windows. That is the reason why I am looking for other tools.
>>
>
>
> Brat should be installed on a server and can be accessed via a browser
> interface.
> We now have support to train OpenNLP on the brat data files directly, and
> soon
> there will be support in the Tagging Server to integrate the OpenNLP Name
> Finder
> directly into to brat to pre-label documents.
>
> Jörn
>
Re: Text annotation tool used by open-nlp team
Posted by Jörn Kottmann <ko...@gmail.com>.
On 08/07/2013 05:09 AM, Girivaraprasad Nambari wrote:
> Brat was the one I explored first, it is little bit complicated to install
> on Windows. That is the reason why I am looking for other tools.
Brat should be installed on a server and can be accessed via a browser
interface.
We now have support to train OpenNLP on the brat data files directly,
and soon
there will be support in the Tagging Server to integrate the OpenNLP
Name Finder
directly into to brat to pre-label documents.
Jörn
Re: Text annotation tool used by open-nlp team
Posted by Girivaraprasad Nambari <gi...@gmail.com>.
Hi Lance,
Thank you!
Brat was the one I explored first, it is little bit complicated to install
on Windows. That is the reason why I am looking for other tools.
Thank you,
Giri
On Tue, Aug 6, 2013 at 9:00 PM, Lance Norskog <go...@gmail.com> wrote:
> Here's one: http://brat.nlplab.org/
>
>
>
> On 08/06/2013 02:23 PM, Girivaraprasad Nambari wrote:
>
>> Hi Team,
>>
>> I ran pre-trained English NER models on text I have and results are not as
>> I expected. So, I thought to train NER on the "trainining" text to
>> generate
>> new models, but I am stuck on *Annotation tools*.
>>
>>
>> Could someone help me on which tools are being used by Open-NLP team for
>> text annotation? As well as any references around text annotation topics
>> would be really appreciated.
>>
>> Thanks for your time and help.
>>
>> Thank you,
>> Giri
>>
>>
>> On Tue, Aug 6, 2013 at 5:12 PM, Girivaraprasad Nambari <
>> girinambari@gmail.com> wrote:
>>
>> Hi Team,
>>>
>>> I ran pre-trained English NER models on text I have and results are not
>>> as
>>> I expected. So, I thought to train NER on the "trainining" text to
>>> generate
>>> new models, but I am stuck on *Annotation tools*.
>>>
>>>
>>> Could someone help me on which tools are being used by Open-NLP team for
>>> text annotation? As well as any references around text annotation topics
>>> would be really appreciated.
>>>
>>> Thanks for your time and help.
>>>
>>> Thank you,
>>> Giri
>>>
>>>
>
Re: Text annotation tool used by open-nlp team
Posted by Jim <ji...@gmail.com>.
I think the main focus of 'brat' is to assist domain-experts to the
annotation. It is not a tool that automatically generates annotations
and probably not in the format openNLP understands. It is more a tool to
abstract away marking-up of the annotations because domain experts are
not programmers and thus they cannot write their own annotation routine.
Jim
On 07/08/13 02:00, Lance Norskog wrote:
> Here's one: http://brat.nlplab.org/
>
>
> On 08/06/2013 02:23 PM, Girivaraprasad Nambari wrote:
>> Hi Team,
>>
>> I ran pre-trained English NER models on text I have and results are
>> not as
>> I expected. So, I thought to train NER on the "trainining" text to
>> generate
>> new models, but I am stuck on *Annotation tools*.
>>
>> Could someone help me on which tools are being used by Open-NLP team for
>> text annotation? As well as any references around text annotation topics
>> would be really appreciated.
>>
>> Thanks for your time and help.
>>
>> Thank you,
>> Giri
>>
>>
>> On Tue, Aug 6, 2013 at 5:12 PM, Girivaraprasad Nambari <
>> girinambari@gmail.com> wrote:
>>
>>> Hi Team,
>>>
>>> I ran pre-trained English NER models on text I have and results are
>>> not as
>>> I expected. So, I thought to train NER on the "trainining" text to
>>> generate
>>> new models, but I am stuck on *Annotation tools*.
>>>
>>> Could someone help me on which tools are being used by Open-NLP team
>>> for
>>> text annotation? As well as any references around text annotation
>>> topics
>>> would be really appreciated.
>>>
>>> Thanks for your time and help.
>>>
>>> Thank you,
>>> Giri
>>>
>
>
Re: Text annotation tool used by open-nlp team
Posted by Lance Norskog <go...@gmail.com>.
Here's one: http://brat.nlplab.org/
On 08/06/2013 02:23 PM, Girivaraprasad Nambari wrote:
> Hi Team,
>
> I ran pre-trained English NER models on text I have and results are not as
> I expected. So, I thought to train NER on the "trainining" text to generate
> new models, but I am stuck on *Annotation tools*.
>
> Could someone help me on which tools are being used by Open-NLP team for
> text annotation? As well as any references around text annotation topics
> would be really appreciated.
>
> Thanks for your time and help.
>
> Thank you,
> Giri
>
>
> On Tue, Aug 6, 2013 at 5:12 PM, Girivaraprasad Nambari <
> girinambari@gmail.com> wrote:
>
>> Hi Team,
>>
>> I ran pre-trained English NER models on text I have and results are not as
>> I expected. So, I thought to train NER on the "trainining" text to generate
>> new models, but I am stuck on *Annotation tools*.
>>
>> Could someone help me on which tools are being used by Open-NLP team for
>> text annotation? As well as any references around text annotation topics
>> would be really appreciated.
>>
>> Thanks for your time and help.
>>
>> Thank you,
>> Giri
>>
Re: Text annotation tool used by open-nlp team
Posted by Girivaraprasad Nambari <gi...@gmail.com>.
Hi Team,
I ran pre-trained English NER models on text I have and results are not as
I expected. So, I thought to train NER on the "trainining" text to generate
new models, but I am stuck on *Annotation tools*.
Could someone help me on which tools are being used by Open-NLP team for
text annotation? As well as any references around text annotation topics
would be really appreciated.
Thanks for your time and help.
Thank you,
Giri
On Tue, Aug 6, 2013 at 5:12 PM, Girivaraprasad Nambari <
girinambari@gmail.com> wrote:
> Hi Team,
>
> I ran pre-trained English NER models on text I have and results are not as
> I expected. So, I thought to train NER on the "trainining" text to generate
> new models, but I am stuck on *Annotation tools*.
>
> Could someone help me on which tools are being used by Open-NLP team for
> text annotation? As well as any references around text annotation topics
> would be really appreciated.
>
> Thanks for your time and help.
>
> Thank you,
> Giri
>
Re: Text annotation tool used by open-nlp team
Posted by Girivaraprasad Nambari <gi...@gmail.com>.
Hi Team,
I ran pre-trained English NER models on text I have and results are not as
I expected. So, I thought to train NER on the "trainining" text to generate
new models, but I am stuck on *Annotation tools*.
Could someone help me on which tools are being used by Open-NLP team for
text annotation? As well as any references around text annotation topics
would be really appreciated.
Thanks for your time and help.
Thank you,
Giri
On Tue, Aug 6, 2013 at 5:12 PM, Girivaraprasad Nambari <
girinambari@gmail.com> wrote:
> Hi Team,
>
> I ran pre-trained English NER models on text I have and results are not as
> I expected. So, I thought to train NER on the "trainining" text to generate
> new models, but I am stuck on *Annotation tools*.
>
> Could someone help me on which tools are being used by Open-NLP team for
> text annotation? As well as any references around text annotation topics
> would be really appreciated.
>
> Thanks for your time and help.
>
> Thank you,
> Giri
>