You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@opennlp.apache.org by Rodrigo Agerri <ra...@apache.org> on 2015/09/29 14:34:38 UTC

mallet addon

Hello,

I have seen that there is a mallet addon here

https://github.com/kottmann/opennlp-mallet-addon

is this currently being used or integrated in opennlp? I have not seen
with the rest of the addons.

Cheers,

Rodrigo

Re: mallet addon

Posted by "Russ, Daniel (NIH/CIT) [E]" <dr...@mail.nih.gov>.
from:  https://opennlp.apache.org/mail-lists.html
To un-subscribe send an e-mail to dev-unsubscribe@opennlp.apache.org<ma...@opennlp.apache.org>


Dan


On Oct 20, 2015, at 10:43 AM, Eldad Yamin <el...@gmail.com>> wrote:

How can I unsubscribe?
On Sep 29, 2015 15:35, "Rodrigo Agerri" <ra...@apache.org>> wrote:

Hello,

I have seen that there is a mallet addon here

https://github.com/kottmann/opennlp-mallet-addon

is this currently being used or integrated in opennlp? I have not seen
with the rest of the addons.

Cheers,

Rodrigo



Re: mallet addon

Posted by Eldad Yamin <el...@gmail.com>.
How can I unsubscribe?
On Sep 29, 2015 15:35, "Rodrigo Agerri" <ra...@apache.org> wrote:

> Hello,
>
> I have seen that there is a mallet addon here
>
> https://github.com/kottmann/opennlp-mallet-addon
>
> is this currently being used or integrated in opennlp? I have not seen
> with the rest of the addons.
>
> Cheers,
>
> Rodrigo
>

Re: mallet addon

Posted by Rodrigo Agerri <ra...@apache.org>.
I think moving it to the sandbox will be very nice. I can also help if
needed.  I will check the mallet license.

Thanks

Rodrigo
On 29 Sep 2015 15:41, "Joern Kottmann" <ko...@gmail.com> wrote:

> Hello,
>
> this doesn't work with the 1.6.0 release, I build it for testing of one of
> the first drafts of the machine learning rewrite work we did for 1.6.0.
> There have been a few changes afterwards.
> Anyway, if you have a need for it I am happy to fix it up. We can also move
> it to the sandbox, releasing it at Apache might be more difficult since
> mallet pulls in incompatible licensed dependencies. But maybe that changed,
> we can check.
>
> What do you think?
>
> Jörn
>
>
>
> On Tue, Sep 29, 2015 at 2:34 PM, Rodrigo Agerri <ra...@apache.org>
> wrote:
>
> > Hello,
> >
> > I have seen that there is a mallet addon here
> >
> > https://github.com/kottmann/opennlp-mallet-addon
> >
> > is this currently being used or integrated in opennlp? I have not seen
> > with the rest of the addons.
> >
> > Cheers,
> >
> > Rodrigo
> >
>

Re: mallet addon

Posted by Joern Kottmann <ko...@gmail.com>.
Hello,

I updated the code and afterwards spent some time evaluating it again. The
maxent training is very close to our maxent classifier. I also checked the
training code again and it looks good to me, but it would be nice if you
can review it.

There are a couple of other classifiers in mallet, it should be trivial to
expose them all to OpenNLP.

Jörn

On Tue, Oct 20, 2015 at 9:12 AM, Rodrigo Agerri <ro...@ehu.eus>
wrote:

> Hello,
>
> Thanks. I thought I had an idea for CRF not obtaining good results
> with OpenNLP default features, e.g.,
>
> http://lingpipe-blog.com/2006/11/22/why-do-you-hate-crfs/
>
> but if results are also worse in Maxent, that is intriguing. I will
> look at the Mallet implementation to see if I find out something.
>
> R
>
>
>
> On Mon, Oct 12, 2015 at 4:07 PM, Joern Kottmann <ko...@gmail.com>
> wrote:
> > Hello,
> >
> > fixed up the code a bit. The performance is not really good. Do you have
> > any idea why that could be?
> >
> > Neither the maxent or crf get good evaluation numbers on NER.
> >
> > I will push the changes and then you can experiment with it too.
> >
> > Jörn
> >
> >
> > On Mon, Oct 5, 2015 at 4:45 PM, Rodrigo Agerri <ra...@apache.org>
> wrote:
> >
> >> Hi,
> >>
> >> On Tue, Sep 29, 2015 at 3:41 PM, Joern Kottmann <ko...@gmail.com>
> >> wrote:
> >> > We can also move
> >> > it to the sandbox, releasing it at Apache might be more difficult
> since
> >> > mallet pulls in incompatible licensed dependencies. But maybe that
> >> changed,
> >> > we can check.
> >>
> >> Mallet is released under Common Public License
> >>
> >> http://opensource.org/licenses/cpl1.0.php
> >>
> >> but as you have mentioned, it pulls several dependencies that are
> >> LGPL. These are the dependencies:
> >>
> >>   <dependency>
> >>       <groupId>org.beanshell</groupId>
> >>       <artifactId>bsh</artifactId>
> >>       <version>2.0b4</version>
> >>     </dependency>
> >>
> >> This version is LGPL, however, later versions are APL 2.0
> >>
> >> https://github.com/beanshell/beanshell
> >>
> >> <dependency>
> >>       <groupId>jgrapht</groupId>
> >>       <artifactId>jgrapht</artifactId>
> >>       <version>0.6.0</version>
> >>     </dependency>
> >>
> >> that version was also LGPL, but it has now been dual-licensed with EPL
> 1.0
> >>
> >> https://github.com/jgrapht/jgrapht/wiki/Relicensing
> >>
> >> which could be included also in APL 2.0 projects
> >>
> >> http://www.apache.org/legal/resolved.html
> >>
> >>  <dependency>
> >>       <groupId>net.sf.jwordnet</groupId>
> >>       <artifactId>jwnl</artifactId>
> >>       <version>1.4_rc3</version>
> >>     </dependency>
> >>
> >> BSD license, but this library has already been discussed here.
> >>
> >>  <dependency>
> >>       <groupId>net.sf.trove4j</groupId>
> >>       <artifactId>trove4j</artifactId>
> >>       <version>2.0.2</version>
> >>     </dependency>
> >>
> >> LGPL-ed.
> >>
> >> <dependency>
> >>       <groupId>com.googlecode.matrix-toolkits-java</groupId>
> >>       <artifactId>mtj</artifactId>
> >>       <version>0.9.14</version>
> >>     </dependency>
> >>
> >> also LGPL
> >>
> >> Rodrigo
> >>
>

Re: mallet addon

Posted by Rodrigo Agerri <ro...@ehu.eus>.
Hello,

Thanks. I thought I had an idea for CRF not obtaining good results
with OpenNLP default features, e.g.,

http://lingpipe-blog.com/2006/11/22/why-do-you-hate-crfs/

but if results are also worse in Maxent, that is intriguing. I will
look at the Mallet implementation to see if I find out something.

R



On Mon, Oct 12, 2015 at 4:07 PM, Joern Kottmann <ko...@gmail.com> wrote:
> Hello,
>
> fixed up the code a bit. The performance is not really good. Do you have
> any idea why that could be?
>
> Neither the maxent or crf get good evaluation numbers on NER.
>
> I will push the changes and then you can experiment with it too.
>
> Jörn
>
>
> On Mon, Oct 5, 2015 at 4:45 PM, Rodrigo Agerri <ra...@apache.org> wrote:
>
>> Hi,
>>
>> On Tue, Sep 29, 2015 at 3:41 PM, Joern Kottmann <ko...@gmail.com>
>> wrote:
>> > We can also move
>> > it to the sandbox, releasing it at Apache might be more difficult since
>> > mallet pulls in incompatible licensed dependencies. But maybe that
>> changed,
>> > we can check.
>>
>> Mallet is released under Common Public License
>>
>> http://opensource.org/licenses/cpl1.0.php
>>
>> but as you have mentioned, it pulls several dependencies that are
>> LGPL. These are the dependencies:
>>
>>   <dependency>
>>       <groupId>org.beanshell</groupId>
>>       <artifactId>bsh</artifactId>
>>       <version>2.0b4</version>
>>     </dependency>
>>
>> This version is LGPL, however, later versions are APL 2.0
>>
>> https://github.com/beanshell/beanshell
>>
>> <dependency>
>>       <groupId>jgrapht</groupId>
>>       <artifactId>jgrapht</artifactId>
>>       <version>0.6.0</version>
>>     </dependency>
>>
>> that version was also LGPL, but it has now been dual-licensed with EPL 1.0
>>
>> https://github.com/jgrapht/jgrapht/wiki/Relicensing
>>
>> which could be included also in APL 2.0 projects
>>
>> http://www.apache.org/legal/resolved.html
>>
>>  <dependency>
>>       <groupId>net.sf.jwordnet</groupId>
>>       <artifactId>jwnl</artifactId>
>>       <version>1.4_rc3</version>
>>     </dependency>
>>
>> BSD license, but this library has already been discussed here.
>>
>>  <dependency>
>>       <groupId>net.sf.trove4j</groupId>
>>       <artifactId>trove4j</artifactId>
>>       <version>2.0.2</version>
>>     </dependency>
>>
>> LGPL-ed.
>>
>> <dependency>
>>       <groupId>com.googlecode.matrix-toolkits-java</groupId>
>>       <artifactId>mtj</artifactId>
>>       <version>0.9.14</version>
>>     </dependency>
>>
>> also LGPL
>>
>> Rodrigo
>>

Re: mallet addon

Posted by Joern Kottmann <ko...@gmail.com>.
Hello,

fixed up the code a bit. The performance is not really good. Do you have
any idea why that could be?

Neither the maxent or crf get good evaluation numbers on NER.

I will push the changes and then you can experiment with it too.

Jörn


On Mon, Oct 5, 2015 at 4:45 PM, Rodrigo Agerri <ra...@apache.org> wrote:

> Hi,
>
> On Tue, Sep 29, 2015 at 3:41 PM, Joern Kottmann <ko...@gmail.com>
> wrote:
> > We can also move
> > it to the sandbox, releasing it at Apache might be more difficult since
> > mallet pulls in incompatible licensed dependencies. But maybe that
> changed,
> > we can check.
>
> Mallet is released under Common Public License
>
> http://opensource.org/licenses/cpl1.0.php
>
> but as you have mentioned, it pulls several dependencies that are
> LGPL. These are the dependencies:
>
>   <dependency>
>       <groupId>org.beanshell</groupId>
>       <artifactId>bsh</artifactId>
>       <version>2.0b4</version>
>     </dependency>
>
> This version is LGPL, however, later versions are APL 2.0
>
> https://github.com/beanshell/beanshell
>
> <dependency>
>       <groupId>jgrapht</groupId>
>       <artifactId>jgrapht</artifactId>
>       <version>0.6.0</version>
>     </dependency>
>
> that version was also LGPL, but it has now been dual-licensed with EPL 1.0
>
> https://github.com/jgrapht/jgrapht/wiki/Relicensing
>
> which could be included also in APL 2.0 projects
>
> http://www.apache.org/legal/resolved.html
>
>  <dependency>
>       <groupId>net.sf.jwordnet</groupId>
>       <artifactId>jwnl</artifactId>
>       <version>1.4_rc3</version>
>     </dependency>
>
> BSD license, but this library has already been discussed here.
>
>  <dependency>
>       <groupId>net.sf.trove4j</groupId>
>       <artifactId>trove4j</artifactId>
>       <version>2.0.2</version>
>     </dependency>
>
> LGPL-ed.
>
> <dependency>
>       <groupId>com.googlecode.matrix-toolkits-java</groupId>
>       <artifactId>mtj</artifactId>
>       <version>0.9.14</version>
>     </dependency>
>
> also LGPL
>
> Rodrigo
>

Re: mallet addon

Posted by Rodrigo Agerri <ra...@apache.org>.
Hi,

On Tue, Sep 29, 2015 at 3:41 PM, Joern Kottmann <ko...@gmail.com> wrote:
> We can also move
> it to the sandbox, releasing it at Apache might be more difficult since
> mallet pulls in incompatible licensed dependencies. But maybe that changed,
> we can check.

Mallet is released under Common Public License

http://opensource.org/licenses/cpl1.0.php

but as you have mentioned, it pulls several dependencies that are
LGPL. These are the dependencies:

  <dependency>
      <groupId>org.beanshell</groupId>
      <artifactId>bsh</artifactId>
      <version>2.0b4</version>
    </dependency>

This version is LGPL, however, later versions are APL 2.0

https://github.com/beanshell/beanshell

<dependency>
      <groupId>jgrapht</groupId>
      <artifactId>jgrapht</artifactId>
      <version>0.6.0</version>
    </dependency>

that version was also LGPL, but it has now been dual-licensed with EPL 1.0

https://github.com/jgrapht/jgrapht/wiki/Relicensing

which could be included also in APL 2.0 projects

http://www.apache.org/legal/resolved.html

 <dependency>
      <groupId>net.sf.jwordnet</groupId>
      <artifactId>jwnl</artifactId>
      <version>1.4_rc3</version>
    </dependency>

BSD license, but this library has already been discussed here.

 <dependency>
      <groupId>net.sf.trove4j</groupId>
      <artifactId>trove4j</artifactId>
      <version>2.0.2</version>
    </dependency>

LGPL-ed.

<dependency>
      <groupId>com.googlecode.matrix-toolkits-java</groupId>
      <artifactId>mtj</artifactId>
      <version>0.9.14</version>
    </dependency>

also LGPL

Rodrigo

Re: mallet addon

Posted by Joern Kottmann <ko...@gmail.com>.
Hello,

this doesn't work with the 1.6.0 release, I build it for testing of one of
the first drafts of the machine learning rewrite work we did for 1.6.0.
There have been a few changes afterwards.
Anyway, if you have a need for it I am happy to fix it up. We can also move
it to the sandbox, releasing it at Apache might be more difficult since
mallet pulls in incompatible licensed dependencies. But maybe that changed,
we can check.

What do you think?

Jörn



On Tue, Sep 29, 2015 at 2:34 PM, Rodrigo Agerri <ra...@apache.org> wrote:

> Hello,
>
> I have seen that there is a mallet addon here
>
> https://github.com/kottmann/opennlp-mallet-addon
>
> is this currently being used or integrated in opennlp? I have not seen
> with the rest of the addons.
>
> Cheers,
>
> Rodrigo
>