You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@uima.apache.org by "Philipp Daumke (JIRA)" <de...@uima.apache.org> on 2012/08/01 23:58:02 UTC

[jira] [Created] (UIMA-2452) Complex Types

Philipp Daumke created UIMA-2452:
------------------------------------

             Summary: Complex Types
                 Key: UIMA-2452
                 URL: https://issues.apache.org/jira/browse/UIMA-2452
             Project: UIMA
          Issue Type: New Feature
          Components: TextMarker
            Reporter: Philipp Daumke


Hi Peter,

I think, some more complex types would really be a useful add-on. Currently I have in mind:

SENTENCE
LINE
DATE
TIME

Best regards
Philipp

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (UIMA-2452) Complex Types

Posted by "Peter Klügl (JIRA)" <de...@uima.apache.org>.
    [ https://issues.apache.org/jira/browse/UIMA-2452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13427208#comment-13427208 ] 

Peter Klügl commented on UIMA-2452:
-----------------------------------

The problem here is that there are different interpretations what a sentence, date and time is or should look like. All but line are somewhat domain or language dependent. A new seeder only for a line annotation...

Date and time are yet more complex to define and rather a use case for TextMarker rules that are used as a library.

I'd rather pursue different approaches:
- adding those annotation to the cas/xmi before TextMarker rules are applied, e.g., by a sentence splitter analysis engine
- adding an analysis engine, e.g., to the descriptor/utils folder and calling it within a TextMarker script in order to add those annotation.

You could of course also write our own seeder and configure the TextMarker analysis engine template in a project so that more annotations are automatically created but that new feature isn't well tested yet.

Anyways, that is just my opinion and I am open to discussion.

                
> Complex Types
> -------------
>
>                 Key: UIMA-2452
>                 URL: https://issues.apache.org/jira/browse/UIMA-2452
>             Project: UIMA
>          Issue Type: New Feature
>          Components: TextMarker
>            Reporter: Philipp Daumke
>
> Hi Peter,
> I think, some more complex types would really be a useful add-on. Currently I have in mind:
> SENTENCE
> LINE
> DATE
> TIME
> Best regards
> Philipp

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

[jira] [Assigned] (UIMA-2452) Complex Types

Posted by "Peter Klügl (JIRA)" <de...@uima.apache.org>.
     [ https://issues.apache.org/jira/browse/UIMA-2452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Peter Klügl reassigned UIMA-2452:
---------------------------------

    Assignee: Peter Klügl
    
> Complex Types
> -------------
>
>                 Key: UIMA-2452
>                 URL: https://issues.apache.org/jira/browse/UIMA-2452
>             Project: UIMA
>          Issue Type: New Feature
>          Components: TextMarker
>            Reporter: Philipp Daumke
>            Assignee: Peter Klügl
>
> Hi Peter,
> I think, some more complex types would really be a useful add-on. Currently I have in mind:
> SENTENCE
> LINE
> DATE
> TIME
> Best regards
> Philipp

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

[jira] [Commented] (UIMA-2452) Complex Types

Posted by "Peter Klügl (JIRA)" <de...@uima.apache.org>.
    [ https://issues.apache.org/jira/browse/UIMA-2452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13431171#comment-13431171 ] 

Peter Klügl commented on UIMA-2452:
-----------------------------------

I added a new analysis engine in the utils folder of TextMarker projects. Here is how to apply it in TextMarker: 

PACKAGE org.apache.uima;

TYPESYSTEM utils.PlainTextTypeSystem;
ENGINE utils.PlainTextAnnotator;

Document{ -> EXEC(PlainTextAnnotator, {Line, Paragraph})};
Line;
Paragraph;
                
> Complex Types
> -------------
>
>                 Key: UIMA-2452
>                 URL: https://issues.apache.org/jira/browse/UIMA-2452
>             Project: UIMA
>          Issue Type: New Feature
>          Components: TextMarker
>            Reporter: Philipp Daumke
>
> Hi Peter,
> I think, some more complex types would really be a useful add-on. Currently I have in mind:
> SENTENCE
> LINE
> DATE
> TIME
> Best regards
> Philipp

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira