You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@opennlp.apache.org by "Luca (JIRA)" <ji...@apache.org> on 2018/03/26 10:01:00 UTC

[jira] [Commented] (OPENNLP-402) CLI tools and formats refactored

    [ https://issues.apache.org/jira/browse/OPENNLP-402?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16413635#comment-16413635 ] 

Luca commented on OPENNLP-402:
------------------------------

support for CONL02 format is currently broken since it expects 3 columns, however the conl002 format has only 2 columns ([https://www.clips.uantwerpen.be/conll2002/ner/data/000README)]

 

> CLI tools and formats refactored
> --------------------------------
>
>                 Key: OPENNLP-402
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-402
>             Project: OpenNLP
>          Issue Type: Improvement
>          Components: Command Line Interface, Formats
>    Affects Versions: tools-1.5.3
>            Reporter: Aliaksandr Autayeu
>            Assignee: Joern Kottmann
>            Priority: Major
>              Labels: patch
>             Fix For: tools-1.5.3
>
>         Attachments: 0016-CLI-tools-and-formats-refactored.patch, 0017-added-direct-format-support-to-CLI-tools-and-formats.patch, open-nlp-cli-package.png, opennlp-cmdline-package-class-structure.png
>
>
> Proposed patch refactors CLI tools and simplifies the code by introducing hierarchy and removing a lot of code duplication. It also introduces better error and help messages, including help for formats and listing available formats in various tools, which are now able to work with formats directly. This, in turn, eliminates the need to keep converted files on disk.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)