You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@opennlp.apache.org by Girivaraprasad Nambari <gi...@gmail.com> on 2013/08/06 23:12:50 UTC

Text annotation tool used by open-nlp team

Hi Team,

I ran pre-trained English NER models on text I have and results are not as
I expected. So, I thought to train NER on the "trainining" text to generate
new models, but I am stuck on *Annotation tools*.

Could someone help me on which tools are being used by Open-NLP team for
text annotation? As well as any references around text annotation topics
would be really appreciated.

Thanks for your time and help.

Thank you,
Giri

Re: Text annotation tool used by open-nlp team

Posted by Girivaraprasad Nambari <gi...@gmail.com>.
Thanks Jim!

I will check this tool coming weekend.

Thank you,
Giri


On Tue, Aug 6, 2013 at 7:13 PM, Jim - FooBar(); <ji...@gmail.com>wrote:

> *not* speaking on behalf of the team but I wrote my own little tool to do
> the annotation whenever there is a dictionary or dictionaries available. It
> is open-source, it exposes a Java and a clojure API or you can even run it
> from the command line. You can find it  here:
>
> https://github.com/jimpil/**annotator-clj<https://github.com/jimpil/annotator-clj>
>
> proper annotation in a specialised domain however is usually done by
> domain experts. Don't expect to get gold-data using annotator-clj or in
> fact any dictionary based annotator...the good with annotator-clj is that
> it can produce annotations compatible with openNLP, stanfordNLP or NLTK
> which is great sometimes. It is also very fast as it leverages the
>  fork-join framework available in java7.
>
> hope that helps :)
>
> Jim
>
> ps: now that you've made me think about it again, instead of only allowing
> raw dictionaries I could add support for the user to provide his own
> specialised regex Pattern objects...hmmm...I'll investigate over the
> weekend.
>
>
>
> On 06/08/13 22:23, Girivaraprasad Nambari wrote:
>
>> Hi Team,
>>
>> I ran pre-trained English NER models on text I have and results are not as
>> I expected. So, I thought to train NER on the "trainining" text to
>> generate
>> new models, but I am stuck on *Annotation tools*.
>>
>>
>> Could someone help me on which tools are being used by Open-NLP team for
>> text annotation? As well as any references around text annotation topics
>> would be really appreciated.
>>
>> Thanks for your time and help.
>>
>> Thank you,
>> Giri
>>
>>
>> On Tue, Aug 6, 2013 at 5:12 PM, Girivaraprasad Nambari <
>> girinambari@gmail.com> wrote:
>>
>>  Hi Team,
>>>
>>> I ran pre-trained English NER models on text I have and results are not
>>> as
>>> I expected. So, I thought to train NER on the "trainining" text to
>>> generate
>>> new models, but I am stuck on *Annotation tools*.
>>>
>>>
>>> Could someone help me on which tools are being used by Open-NLP team for
>>> text annotation? As well as any references around text annotation topics
>>> would be really appreciated.
>>>
>>> Thanks for your time and help.
>>>
>>> Thank you,
>>> Giri
>>>
>>>
>

Re: Text annotation tool used by open-nlp team

Posted by "Jim - FooBar();" <ji...@gmail.com>.
*not* speaking on behalf of the team but I wrote my own little tool to 
do the annotation whenever there is a dictionary or dictionaries 
available. It is open-source, it exposes a Java and a clojure API or you 
can even run it from the command line. You can find it  here:

https://github.com/jimpil/annotator-clj

proper annotation in a specialised domain however is usually done by 
domain experts. Don't expect to get gold-data using annotator-clj or in 
fact any dictionary based annotator...the good with annotator-clj is 
that it can produce annotations compatible with openNLP, stanfordNLP or 
NLTK which is great sometimes. It is also very fast as it leverages the  
fork-join framework available in java7.

hope that helps :)

Jim

ps: now that you've made me think about it again, instead of only 
allowing raw dictionaries I could add support for the user to provide 
his own specialised regex Pattern objects...hmmm...I'll investigate over 
the weekend.


On 06/08/13 22:23, Girivaraprasad Nambari wrote:
> Hi Team,
>
> I ran pre-trained English NER models on text I have and results are not as
> I expected. So, I thought to train NER on the "trainining" text to generate
> new models, but I am stuck on *Annotation tools*.
>
> Could someone help me on which tools are being used by Open-NLP team for
> text annotation? As well as any references around text annotation topics
> would be really appreciated.
>
> Thanks for your time and help.
>
> Thank you,
> Giri
>
>
> On Tue, Aug 6, 2013 at 5:12 PM, Girivaraprasad Nambari <
> girinambari@gmail.com> wrote:
>
>> Hi Team,
>>
>> I ran pre-trained English NER models on text I have and results are not as
>> I expected. So, I thought to train NER on the "trainining" text to generate
>> new models, but I am stuck on *Annotation tools*.
>>
>> Could someone help me on which tools are being used by Open-NLP team for
>> text annotation? As well as any references around text annotation topics
>> would be really appreciated.
>>
>> Thanks for your time and help.
>>
>> Thank you,
>> Giri
>>


Re: Text annotation tool used by open-nlp team

Posted by Girivaraprasad Nambari <gi...@gmail.com>.
Hi Lance,

Thanks for the input.


Thank you,
Giri


On Wed, Aug 14, 2013 at 1:15 PM, Lance Norskog <go...@gmail.com> wrote:

> Check out XAMPP. It is a web server package with Apache, MySql, an ftp
> server, PHP, and other standard web server doodads. This a great package
> for hosting PHP apps.
>
> http://www.apachefriends.org/**en/xampp.html<http://www.apachefriends.org/en/xampp.html>
>
>
> On 08/13/2013 04:33 AM, Jörn Kottmann wrote:
>
>> On 08/07/2013 05:09 AM, Girivaraprasad Nambari wrote:
>>
>>> Brat was the one I explored first, it is little bit complicated to
>>> install
>>> on Windows. That is the reason why I am looking for other tools.
>>>
>>
>>
>> Brat should be installed on a server and can be accessed via a browser
>> interface.
>> We now have support to train OpenNLP on the brat data files directly, and
>> soon
>> there will be support in the Tagging Server to integrate the OpenNLP Name
>> Finder
>> directly into to brat to pre-label documents.
>>
>> Jörn
>>
>
>

Re: Text annotation tool used by open-nlp team

Posted by Lance Norskog <go...@gmail.com>.
Check out XAMPP. It is a web server package with Apache, MySql, an ftp 
server, PHP, and other standard web server doodads. This a great package 
for hosting PHP apps.

http://www.apachefriends.org/en/xampp.html

On 08/13/2013 04:33 AM, Jörn Kottmann wrote:
> On 08/07/2013 05:09 AM, Girivaraprasad Nambari wrote:
>> Brat was the one I explored first, it is little bit complicated to 
>> install
>> on Windows. That is the reason why I am looking for other tools.
>
>
> Brat should be installed on a server and can be accessed via a browser 
> interface.
> We now have support to train OpenNLP on the brat data files directly, 
> and soon
> there will be support in the Tagging Server to integrate the OpenNLP 
> Name Finder
> directly into to brat to pre-label documents.
>
> Jörn


Re: Text annotation tool used by open-nlp team

Posted by Girivaraprasad Nambari <gi...@gmail.com>.
*Brat should be installed on a server *
*
*
Yes Jorn, my understanding is it should be PHP server. I may end up doing
this because there are no other easy alternatives comparing with Brat.

Thank you,
Giri


On Tue, Aug 13, 2013 at 7:33 AM, Jörn Kottmann <ko...@gmail.com> wrote:

> On 08/07/2013 05:09 AM, Girivaraprasad Nambari wrote:
>
>> Brat was the one I explored first, it is little bit complicated to install
>> on Windows. That is the reason why I am looking for other tools.
>>
>
>
> Brat should be installed on a server and can be accessed via a browser
> interface.
> We now have support to train OpenNLP on the brat data files directly, and
> soon
> there will be support in the Tagging Server to integrate the OpenNLP Name
> Finder
> directly into to brat to pre-label documents.
>
> Jörn
>

Re: Text annotation tool used by open-nlp team

Posted by Jörn Kottmann <ko...@gmail.com>.
On 08/07/2013 05:09 AM, Girivaraprasad Nambari wrote:
> Brat was the one I explored first, it is little bit complicated to install
> on Windows. That is the reason why I am looking for other tools.


Brat should be installed on a server and can be accessed via a browser 
interface.
We now have support to train OpenNLP on the brat data files directly, 
and soon
there will be support in the Tagging Server to integrate the OpenNLP 
Name Finder
directly into to brat to pre-label documents.

Jörn

Re: Text annotation tool used by open-nlp team

Posted by Girivaraprasad Nambari <gi...@gmail.com>.
Hi Lance,

Thank you!

Brat was the one I explored first, it is little bit complicated to install
on Windows. That is the reason why I am looking for other tools.

Thank you,
Giri


On Tue, Aug 6, 2013 at 9:00 PM, Lance Norskog <go...@gmail.com> wrote:

> Here's one: http://brat.nlplab.org/
>
>
>
> On 08/06/2013 02:23 PM, Girivaraprasad Nambari wrote:
>
>> Hi Team,
>>
>> I ran pre-trained English NER models on text I have and results are not as
>> I expected. So, I thought to train NER on the "trainining" text to
>> generate
>> new models, but I am stuck on *Annotation tools*.
>>
>>
>> Could someone help me on which tools are being used by Open-NLP team for
>> text annotation? As well as any references around text annotation topics
>> would be really appreciated.
>>
>> Thanks for your time and help.
>>
>> Thank you,
>> Giri
>>
>>
>> On Tue, Aug 6, 2013 at 5:12 PM, Girivaraprasad Nambari <
>> girinambari@gmail.com> wrote:
>>
>>  Hi Team,
>>>
>>> I ran pre-trained English NER models on text I have and results are not
>>> as
>>> I expected. So, I thought to train NER on the "trainining" text to
>>> generate
>>> new models, but I am stuck on *Annotation tools*.
>>>
>>>
>>> Could someone help me on which tools are being used by Open-NLP team for
>>> text annotation? As well as any references around text annotation topics
>>> would be really appreciated.
>>>
>>> Thanks for your time and help.
>>>
>>> Thank you,
>>> Giri
>>>
>>>
>

Re: Text annotation tool used by open-nlp team

Posted by Jim <ji...@gmail.com>.
I think the main focus of 'brat' is to assist domain-experts to the 
annotation. It is not a tool that automatically generates annotations 
and probably not in the format openNLP understands. It is more a tool to 
abstract away marking-up of the annotations because domain experts are 
not programmers and thus they cannot write their own annotation routine.

Jim


On 07/08/13 02:00, Lance Norskog wrote:
> Here's one: http://brat.nlplab.org/
>
>
> On 08/06/2013 02:23 PM, Girivaraprasad Nambari wrote:
>> Hi Team,
>>
>> I ran pre-trained English NER models on text I have and results are 
>> not as
>> I expected. So, I thought to train NER on the "trainining" text to 
>> generate
>> new models, but I am stuck on *Annotation tools*.
>>
>> Could someone help me on which tools are being used by Open-NLP team for
>> text annotation? As well as any references around text annotation topics
>> would be really appreciated.
>>
>> Thanks for your time and help.
>>
>> Thank you,
>> Giri
>>
>>
>> On Tue, Aug 6, 2013 at 5:12 PM, Girivaraprasad Nambari <
>> girinambari@gmail.com> wrote:
>>
>>> Hi Team,
>>>
>>> I ran pre-trained English NER models on text I have and results are 
>>> not as
>>> I expected. So, I thought to train NER on the "trainining" text to 
>>> generate
>>> new models, but I am stuck on *Annotation tools*.
>>>
>>> Could someone help me on which tools are being used by Open-NLP team 
>>> for
>>> text annotation? As well as any references around text annotation 
>>> topics
>>> would be really appreciated.
>>>
>>> Thanks for your time and help.
>>>
>>> Thank you,
>>> Giri
>>>
>
>


Re: Text annotation tool used by open-nlp team

Posted by Lance Norskog <go...@gmail.com>.
Here's one: http://brat.nlplab.org/


On 08/06/2013 02:23 PM, Girivaraprasad Nambari wrote:
> Hi Team,
>
> I ran pre-trained English NER models on text I have and results are not as
> I expected. So, I thought to train NER on the "trainining" text to generate
> new models, but I am stuck on *Annotation tools*.
>
> Could someone help me on which tools are being used by Open-NLP team for
> text annotation? As well as any references around text annotation topics
> would be really appreciated.
>
> Thanks for your time and help.
>
> Thank you,
> Giri
>
>
> On Tue, Aug 6, 2013 at 5:12 PM, Girivaraprasad Nambari <
> girinambari@gmail.com> wrote:
>
>> Hi Team,
>>
>> I ran pre-trained English NER models on text I have and results are not as
>> I expected. So, I thought to train NER on the "trainining" text to generate
>> new models, but I am stuck on *Annotation tools*.
>>
>> Could someone help me on which tools are being used by Open-NLP team for
>> text annotation? As well as any references around text annotation topics
>> would be really appreciated.
>>
>> Thanks for your time and help.
>>
>> Thank you,
>> Giri
>>


Re: Text annotation tool used by open-nlp team

Posted by Girivaraprasad Nambari <gi...@gmail.com>.
Hi Team,

I ran pre-trained English NER models on text I have and results are not as
I expected. So, I thought to train NER on the "trainining" text to generate
new models, but I am stuck on *Annotation tools*.

Could someone help me on which tools are being used by Open-NLP team for
text annotation? As well as any references around text annotation topics
would be really appreciated.

Thanks for your time and help.

Thank you,
Giri


On Tue, Aug 6, 2013 at 5:12 PM, Girivaraprasad Nambari <
girinambari@gmail.com> wrote:

> Hi Team,
>
> I ran pre-trained English NER models on text I have and results are not as
> I expected. So, I thought to train NER on the "trainining" text to generate
> new models, but I am stuck on *Annotation tools*.
>
> Could someone help me on which tools are being used by Open-NLP team for
> text annotation? As well as any references around text annotation topics
> would be really appreciated.
>
> Thanks for your time and help.
>
> Thank you,
> Giri
>

Re: Text annotation tool used by open-nlp team

Posted by Girivaraprasad Nambari <gi...@gmail.com>.
Hi Team,

I ran pre-trained English NER models on text I have and results are not as
I expected. So, I thought to train NER on the "trainining" text to generate
new models, but I am stuck on *Annotation tools*.

Could someone help me on which tools are being used by Open-NLP team for
text annotation? As well as any references around text annotation topics
would be really appreciated.

Thanks for your time and help.

Thank you,
Giri


On Tue, Aug 6, 2013 at 5:12 PM, Girivaraprasad Nambari <
girinambari@gmail.com> wrote:

> Hi Team,
>
> I ran pre-trained English NER models on text I have and results are not as
> I expected. So, I thought to train NER on the "trainining" text to generate
> new models, but I am stuck on *Annotation tools*.
>
> Could someone help me on which tools are being used by Open-NLP team for
> text annotation? As well as any references around text annotation topics
> would be really appreciated.
>
> Thanks for your time and help.
>
> Thank you,
> Giri
>