You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@opennlp.apache.org by Bertrand Rigaldies <br...@gmail.com> on 2022/07/24 19:28:04 UTC
"Reuters" data for the CONLL 2003 task
Hi folks, I’m working on https://issues.apache.org/jira/browse/OPENNLP-1373, and I need to get the “Reuters” data. For that, it appears that Reuters require this form to be filled out to request authorization: https://trec.nist.gov/data/reuters/ind_appl_reuters_v4.html
The form requires an “organization” information and signature:
Organization ____________________________________________________
Corporation/Partnership/Legal Entity ____________________________
Official mail address __________________________________________
_________________________________________________________________
_________________________________________________________________
Telephone _____________________________________
Facsimile _____________________________________
Electronic mail ________________________________
Accepted by the organization:
Signature_________________________________
Date ________________________________
Name (please print) ______________________________
Title ___________________________
Institution/Agency ______________________________________________________________
What do I fill out for OpenNLP-associated work? Also, how do I get an “organization’s signature”?
Thanks.
Bertrand Rigaldies
Re: "Reuters" data for the CONLL 2003 task
Posted by Bertrand Rigaldies <br...@gmail.com>.
Thank you Jeff for responding quickly. I’ll submit the form self-approved and see what happens. If it works, I’ll update our doc accordingly.
I’ll also stop email you directly unless I get no answer on dev.
Have a great week.
Bertrand
> On Jul 24, 2022, at 6:49 PM, Suneel Marthi <su...@gmail.com> wrote:
>
> Did we want to discontinue Reuters and go with a dataset like IMDB reviews etc… ?
>
> Sent from my iPhone
>
>> On Jul 24, 2022, at 6:43 PM, Jeff Zemerick <jz...@apache.org> wrote:
>>
>> HI Bertrand,
>>
>> This probably shouldn't be considered factual advice, but I think as an
>> individual you can "accept" it yourself. The folks at NIST (
>> reuters-request@nist.gov) can likely give a definitive answer to that.
>>
>> Thanks,
>> Jeff
>>
>>
>>> On Sun, Jul 24, 2022 at 3:28 PM Bertrand Rigaldies <br...@gmail.com>
>>> wrote:
>>>
>>> Hi folks, I’m working on
>>> https://issues.apache.org/jira/browse/OPENNLP-1373, and I need to get the
>>> “Reuters” data. For that, it appears that Reuters require this form to be
>>> filled out to request authorization:
>>> https://trec.nist.gov/data/reuters/ind_appl_reuters_v4.html
>>>
>>> The form requires an “organization” information and signature:
>>>
>>> Organization ____________________________________________________
>>> Corporation/Partnership/Legal Entity ____________________________
>>> Official mail address __________________________________________
>>> _________________________________________________________________
>>> _________________________________________________________________
>>> Telephone _____________________________________
>>> Facsimile _____________________________________
>>> Electronic mail ________________________________
>>>
>>>
>>> Accepted by the organization:
>>>
>>> Signature_________________________________
>>>
>>> Date ________________________________
>>>
>>> Name (please print) ______________________________
>>>
>>> Title ___________________________
>>>
>>> Institution/Agency
>>> ______________________________________________________________
>>>
>>> What do I fill out for OpenNLP-associated work? Also, how do I get an
>>> “organization’s signature”?
>>>
>>> Thanks.
>>> Bertrand Rigaldies
Re: "Reuters" data for the CONLL 2003 task
Posted by Suneel Marthi <su...@gmail.com>.
Did we want to discontinue Reuters and go with a dataset like IMDB reviews etc… ?
Sent from my iPhone
> On Jul 24, 2022, at 6:43 PM, Jeff Zemerick <jz...@apache.org> wrote:
>
> HI Bertrand,
>
> This probably shouldn't be considered factual advice, but I think as an
> individual you can "accept" it yourself. The folks at NIST (
> reuters-request@nist.gov) can likely give a definitive answer to that.
>
> Thanks,
> Jeff
>
>
>> On Sun, Jul 24, 2022 at 3:28 PM Bertrand Rigaldies <br...@gmail.com>
>> wrote:
>>
>> Hi folks, I’m working on
>> https://issues.apache.org/jira/browse/OPENNLP-1373, and I need to get the
>> “Reuters” data. For that, it appears that Reuters require this form to be
>> filled out to request authorization:
>> https://trec.nist.gov/data/reuters/ind_appl_reuters_v4.html
>>
>> The form requires an “organization” information and signature:
>>
>> Organization ____________________________________________________
>> Corporation/Partnership/Legal Entity ____________________________
>> Official mail address __________________________________________
>> _________________________________________________________________
>> _________________________________________________________________
>> Telephone _____________________________________
>> Facsimile _____________________________________
>> Electronic mail ________________________________
>>
>>
>> Accepted by the organization:
>>
>> Signature_________________________________
>>
>> Date ________________________________
>>
>> Name (please print) ______________________________
>>
>> Title ___________________________
>>
>> Institution/Agency
>> ______________________________________________________________
>>
>> What do I fill out for OpenNLP-associated work? Also, how do I get an
>> “organization’s signature”?
>>
>> Thanks.
>> Bertrand Rigaldies
Re: "Reuters" data for the CONLL 2003 task
Posted by Jeff Zemerick <jz...@apache.org>.
HI Bertrand,
This probably shouldn't be considered factual advice, but I think as an
individual you can "accept" it yourself. The folks at NIST (
reuters-request@nist.gov) can likely give a definitive answer to that.
Thanks,
Jeff
On Sun, Jul 24, 2022 at 3:28 PM Bertrand Rigaldies <br...@gmail.com>
wrote:
> Hi folks, I’m working on
> https://issues.apache.org/jira/browse/OPENNLP-1373, and I need to get the
> “Reuters” data. For that, it appears that Reuters require this form to be
> filled out to request authorization:
> https://trec.nist.gov/data/reuters/ind_appl_reuters_v4.html
>
> The form requires an “organization” information and signature:
>
> Organization ____________________________________________________
> Corporation/Partnership/Legal Entity ____________________________
> Official mail address __________________________________________
> _________________________________________________________________
> _________________________________________________________________
> Telephone _____________________________________
> Facsimile _____________________________________
> Electronic mail ________________________________
>
>
> Accepted by the organization:
>
> Signature_________________________________
>
> Date ________________________________
>
> Name (please print) ______________________________
>
> Title ___________________________
>
> Institution/Agency
> ______________________________________________________________
>
> What do I fill out for OpenNLP-associated work? Also, how do I get an
> “organization’s signature”?
>
> Thanks.
> Bertrand Rigaldies