You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@opennlp.apache.org by Bertrand Rigaldies <br...@gmail.com> on 2022/07/24 19:28:04 UTC

"Reuters" data for the CONLL 2003 task

Hi folks, I’m working on https://issues.apache.org/jira/browse/OPENNLP-1373, and I need to get the “Reuters” data. For that, it appears that Reuters require this form to be filled out to request authorization: https://trec.nist.gov/data/reuters/ind_appl_reuters_v4.html

The form requires an “organization” information and signature:

Organization ____________________________________________________
Corporation/Partnership/Legal Entity ____________________________
Official mail address __________________________________________
_________________________________________________________________
_________________________________________________________________
Telephone _____________________________________
Facsimile _____________________________________
Electronic mail ________________________________


Accepted by the organization:

Signature_________________________________

Date ________________________________

Name (please print) ______________________________

Title ___________________________

Institution/Agency ______________________________________________________________

What do I fill out for OpenNLP-associated work? Also, how do I get an “organization’s signature”?

Thanks.
Bertrand Rigaldies

Re: "Reuters" data for the CONLL 2003 task

Posted by Bertrand Rigaldies <br...@gmail.com>.
Thank you Jeff for responding quickly. I’ll submit the form self-approved and see what happens. If it works, I’ll update our doc accordingly.

I’ll also stop email you directly unless I get no answer on dev.

Have a great week.

Bertrand

> On Jul 24, 2022, at 6:49 PM, Suneel Marthi <su...@gmail.com> wrote:
> 
> Did we want to discontinue Reuters and go with a dataset like IMDB reviews etc… ?
> 
> Sent from my iPhone
> 
>> On Jul 24, 2022, at 6:43 PM, Jeff Zemerick <jz...@apache.org> wrote:
>> 
>> HI Bertrand,
>> 
>> This probably shouldn't be considered factual advice, but I think as an
>> individual you can "accept" it yourself. The folks at NIST (
>> reuters-request@nist.gov) can likely give a definitive answer to that.
>> 
>> Thanks,
>> Jeff
>> 
>> 
>>> On Sun, Jul 24, 2022 at 3:28 PM Bertrand Rigaldies <br...@gmail.com>
>>> wrote:
>>> 
>>> Hi folks, I’m working on
>>> https://issues.apache.org/jira/browse/OPENNLP-1373, and I need to get the
>>> “Reuters” data. For that, it appears that Reuters require this form to be
>>> filled out to request authorization:
>>> https://trec.nist.gov/data/reuters/ind_appl_reuters_v4.html
>>> 
>>> The form requires an “organization” information and signature:
>>> 
>>> Organization ____________________________________________________
>>> Corporation/Partnership/Legal Entity ____________________________
>>> Official mail address __________________________________________
>>> _________________________________________________________________
>>> _________________________________________________________________
>>> Telephone _____________________________________
>>> Facsimile _____________________________________
>>> Electronic mail ________________________________
>>> 
>>> 
>>> Accepted by the organization:
>>> 
>>> Signature_________________________________
>>> 
>>> Date ________________________________
>>> 
>>> Name (please print) ______________________________
>>> 
>>> Title ___________________________
>>> 
>>> Institution/Agency
>>> ______________________________________________________________
>>> 
>>> What do I fill out for OpenNLP-associated work? Also, how do I get an
>>> “organization’s signature”?
>>> 
>>> Thanks.
>>> Bertrand Rigaldies


Re: "Reuters" data for the CONLL 2003 task

Posted by Suneel Marthi <su...@gmail.com>.
Did we want to discontinue Reuters and go with a dataset like IMDB reviews etc… ?

Sent from my iPhone

> On Jul 24, 2022, at 6:43 PM, Jeff Zemerick <jz...@apache.org> wrote:
> 
> HI Bertrand,
> 
> This probably shouldn't be considered factual advice, but I think as an
> individual you can "accept" it yourself. The folks at NIST (
> reuters-request@nist.gov) can likely give a definitive answer to that.
> 
> Thanks,
> Jeff
> 
> 
>> On Sun, Jul 24, 2022 at 3:28 PM Bertrand Rigaldies <br...@gmail.com>
>> wrote:
>> 
>> Hi folks, I’m working on
>> https://issues.apache.org/jira/browse/OPENNLP-1373, and I need to get the
>> “Reuters” data. For that, it appears that Reuters require this form to be
>> filled out to request authorization:
>> https://trec.nist.gov/data/reuters/ind_appl_reuters_v4.html
>> 
>> The form requires an “organization” information and signature:
>> 
>> Organization ____________________________________________________
>> Corporation/Partnership/Legal Entity ____________________________
>> Official mail address __________________________________________
>> _________________________________________________________________
>> _________________________________________________________________
>> Telephone _____________________________________
>> Facsimile _____________________________________
>> Electronic mail ________________________________
>> 
>> 
>> Accepted by the organization:
>> 
>> Signature_________________________________
>> 
>> Date ________________________________
>> 
>> Name (please print) ______________________________
>> 
>> Title ___________________________
>> 
>> Institution/Agency
>> ______________________________________________________________
>> 
>> What do I fill out for OpenNLP-associated work? Also, how do I get an
>> “organization’s signature”?
>> 
>> Thanks.
>> Bertrand Rigaldies

Re: "Reuters" data for the CONLL 2003 task

Posted by Jeff Zemerick <jz...@apache.org>.
HI Bertrand,

This probably shouldn't be considered factual advice, but I think as an
individual you can "accept" it yourself. The folks at NIST (
reuters-request@nist.gov) can likely give a definitive answer to that.

Thanks,
Jeff


On Sun, Jul 24, 2022 at 3:28 PM Bertrand Rigaldies <br...@gmail.com>
wrote:

> Hi folks, I’m working on
> https://issues.apache.org/jira/browse/OPENNLP-1373, and I need to get the
> “Reuters” data. For that, it appears that Reuters require this form to be
> filled out to request authorization:
> https://trec.nist.gov/data/reuters/ind_appl_reuters_v4.html
>
> The form requires an “organization” information and signature:
>
> Organization ____________________________________________________
> Corporation/Partnership/Legal Entity ____________________________
> Official mail address __________________________________________
> _________________________________________________________________
> _________________________________________________________________
> Telephone _____________________________________
> Facsimile _____________________________________
> Electronic mail ________________________________
>
>
> Accepted by the organization:
>
> Signature_________________________________
>
> Date ________________________________
>
> Name (please print) ______________________________
>
> Title ___________________________
>
> Institution/Agency
> ______________________________________________________________
>
> What do I fill out for OpenNLP-associated work? Also, how do I get an
> “organization’s signature”?
>
> Thanks.
> Bertrand Rigaldies