You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@uima.apache.org by "Philipp Daumke (JIRA)" <de...@uima.apache.org> on 2012/08/01 23:58:02 UTC
[jira] [Created] (UIMA-2452) Complex Types
Philipp Daumke created UIMA-2452:
------------------------------------
Summary: Complex Types
Key: UIMA-2452
URL: https://issues.apache.org/jira/browse/UIMA-2452
Project: UIMA
Issue Type: New Feature
Components: TextMarker
Reporter: Philipp Daumke
Hi Peter,
I think, some more complex types would really be a useful add-on. Currently I have in mind:
SENTENCE
LINE
DATE
TIME
Best regards
Philipp
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (UIMA-2452) Complex Types
Posted by "Peter Klügl (JIRA)" <de...@uima.apache.org>.
[ https://issues.apache.org/jira/browse/UIMA-2452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13427208#comment-13427208 ]
Peter Klügl commented on UIMA-2452:
-----------------------------------
The problem here is that there are different interpretations what a sentence, date and time is or should look like. All but line are somewhat domain or language dependent. A new seeder only for a line annotation...
Date and time are yet more complex to define and rather a use case for TextMarker rules that are used as a library.
I'd rather pursue different approaches:
- adding those annotation to the cas/xmi before TextMarker rules are applied, e.g., by a sentence splitter analysis engine
- adding an analysis engine, e.g., to the descriptor/utils folder and calling it within a TextMarker script in order to add those annotation.
You could of course also write our own seeder and configure the TextMarker analysis engine template in a project so that more annotations are automatically created but that new feature isn't well tested yet.
Anyways, that is just my opinion and I am open to discussion.
> Complex Types
> -------------
>
> Key: UIMA-2452
> URL: https://issues.apache.org/jira/browse/UIMA-2452
> Project: UIMA
> Issue Type: New Feature
> Components: TextMarker
> Reporter: Philipp Daumke
>
> Hi Peter,
> I think, some more complex types would really be a useful add-on. Currently I have in mind:
> SENTENCE
> LINE
> DATE
> TIME
> Best regards
> Philipp
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Assigned] (UIMA-2452) Complex Types
Posted by "Peter Klügl (JIRA)" <de...@uima.apache.org>.
[ https://issues.apache.org/jira/browse/UIMA-2452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Peter Klügl reassigned UIMA-2452:
---------------------------------
Assignee: Peter Klügl
> Complex Types
> -------------
>
> Key: UIMA-2452
> URL: https://issues.apache.org/jira/browse/UIMA-2452
> Project: UIMA
> Issue Type: New Feature
> Components: TextMarker
> Reporter: Philipp Daumke
> Assignee: Peter Klügl
>
> Hi Peter,
> I think, some more complex types would really be a useful add-on. Currently I have in mind:
> SENTENCE
> LINE
> DATE
> TIME
> Best regards
> Philipp
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (UIMA-2452) Complex Types
Posted by "Peter Klügl (JIRA)" <de...@uima.apache.org>.
[ https://issues.apache.org/jira/browse/UIMA-2452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13431171#comment-13431171 ]
Peter Klügl commented on UIMA-2452:
-----------------------------------
I added a new analysis engine in the utils folder of TextMarker projects. Here is how to apply it in TextMarker:
PACKAGE org.apache.uima;
TYPESYSTEM utils.PlainTextTypeSystem;
ENGINE utils.PlainTextAnnotator;
Document{ -> EXEC(PlainTextAnnotator, {Line, Paragraph})};
Line;
Paragraph;
> Complex Types
> -------------
>
> Key: UIMA-2452
> URL: https://issues.apache.org/jira/browse/UIMA-2452
> Project: UIMA
> Issue Type: New Feature
> Components: TextMarker
> Reporter: Philipp Daumke
>
> Hi Peter,
> I think, some more complex types would really be a useful add-on. Currently I have in mind:
> SENTENCE
> LINE
> DATE
> TIME
> Best regards
> Philipp
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira