You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@opennlp.apache.org by "Bruno P. Kinoshita (JIRA)" <ji...@apache.org> on 2017/07/07 12:41:00 UTC

[jira] [Commented] (OPENNLP-1013) [OpenNLP][R Language][1.5.3-2] Bug when using French models

    [ https://issues.apache.org/jira/browse/OPENNLP-1013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16078014#comment-16078014 ] 

Bruno P. Kinoshita commented on OPENNLP-1013:
---------------------------------------------

I believe this issue could be in the R module, and not in OpenNLP.

Looking at the code around [this part|https://github.com/cran/openNLP/blob/a1709dea5f8a92757fcfa5bf672aa922041dc119/R/pos.R#L54], it appears for English (default language value) we have the right fields in the meta var.

But when you give it a different language, it is trying to load the models in a different way. I am not sure what could be the problem exactly, but it looks to be in the R code, of that package, not in OpenNLP code.

> [OpenNLP][R Language][1.5.3-2] Bug when using French models
> -----------------------------------------------------------
>
>                 Key: OPENNLP-1013
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-1013
>             Project: OpenNLP
>          Issue Type: Bug
>          Components: POS Tagger
>    Affects Versions: tools-1.5.3
>         Environment: R Language, RStudio
>            Reporter: Iuri Deolindo Nogueira
>
> When using French models in R language, I'm receving a "subscript out of bound" issue. I'm going to detail:
> -------------------------
> Well, I'm using French models to NLP in R environment. To get the french models, I'm using binaries compiled and develloped by Nicolas:
> https://sites.google.com/site/nicolashernandez/resources/opennlp
> http://enicolashernandez.blogspot.fr/2012/12/apache-opennlp-fr-models.html
> https://drive.google.com/drive/folders/0B4AyWQriFkxgWHR6QzlvcmxmdE0
> -------------------------
> The problem it happens only with the POS function. This is how I call the function and respective issue:
> Maxent_POS_Tag_Annotator(language = "fr", probs = TRUE, model = paste0(<path_folder_with_bins>, "fr-pos.bin"))
> Issue: 
> Error in environment(f)$meta[[tag]] : subscript out of bounds
> -------------------------
> However, if I deleted the language parameter, the issue does not happen anymore:
> Maxent_POS_Tag_Annotator(probs = TRUE, model = paste0(<path_folder_with_bins>, "fr-pos.bin"))



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)