You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@uima.apache.org by "Rico Landefeld (JIRA)" <ui...@incubator.apache.org> on 2009/03/02 12:58:12 UTC

[jira] Created: (UIMA-1299) Contribution of Lucene CAS Indexer

Contribution of Lucene CAS Indexer
----------------------------------

                 Key: UIMA-1299
                 URL: https://issues.apache.org/jira/browse/UIMA-1299
             Project: UIMA
          Issue Type: New Feature
          Components: Sandbox
            Reporter: Rico Landefeld


Lucas is a UIMA CAS consumer component which writes CAS data into a Lucene index. It is based on a XML-based "mapping configuration  file" in which the user can determine which UIMA annotations should be put into which Lucene field, and how this field is set up (e.g. indexed and/or stored). In addition, some basic functionality for (ontolgical) hypernym indexing is provided.

Additionally, Lucas is able to perform offset-based token stream alignment and merging of UIMA annotations (via token position increment) in the same Lucene field (e.g. "documenttext" or "title")


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Re: [jira] Commented: (UIMA-1299) Contribution of Lucene CAS Indexer

Posted by Rico Landefeld <ri...@googlemail.com>.
Yes, it should.
2009/5/27 Jörn Kottmann (JIRA) <ui...@incubator.apache.org>

>
>    [
> https://issues.apache.org/jira/browse/UIMA-1299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12713534#action_12713534]
>
> Jörn Kottmann commented on UIMA-1299:
> -------------------------------------
>
> Is the software grant now listed for this contribution ?
>
> > Contribution of Lucene CAS Indexer
> > ----------------------------------
> >
> >                 Key: UIMA-1299
> >                 URL: https://issues.apache.org/jira/browse/UIMA-1299
> >             Project: UIMA
> >          Issue Type: New Feature
> >          Components: Sandbox
> >            Reporter: Rico Landefeld
> >         Attachments: lucene-indexer.tar.gz, pom.xml
> >
> >
> > Lucas is a UIMA CAS consumer component which writes CAS data into a
> Lucene index. It is based on a XML-based "mapping configuration  file" in
> which the user can determine which UIMA annotations should be put into which
> Lucene field, and how this field is set up (e.g. indexed and/or stored). In
> addition, some basic functionality for (ontolgical) hypernym indexing is
> provided.
> > Additionally, Lucas is able to perform offset-based token stream
> alignment and merging of UIMA annotations (via token position increment) in
> the same Lucene field (e.g. "documenttext" or "title")
>
> --
> This message is automatically generated by JIRA.
> -
> You can reply to this email to add a comment to the issue online.
>
>

[jira] Updated: (UIMA-1299) Contribution of Lucene CAS Indexer

Posted by "Marshall Schor (JIRA)" <ui...@incubator.apache.org>.
     [ https://issues.apache.org/jira/browse/UIMA-1299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Marshall Schor updated UIMA-1299:
---------------------------------

    Fix Version/s: 2.3S

> Contribution of Lucene CAS Indexer
> ----------------------------------
>
>                 Key: UIMA-1299
>                 URL: https://issues.apache.org/jira/browse/UIMA-1299
>             Project: UIMA
>          Issue Type: New Feature
>          Components: Sandbox-Lucas
>            Reporter: Rico Landefeld
>            Assignee: Jörn Kottmann
>             Fix For: 2.3S
>
>         Attachments: lucas.tar.gz, lucene-indexer.tar.gz, pom.xml
>
>
> Lucas is a UIMA CAS consumer component which writes CAS data into a Lucene index. It is based on a XML-based "mapping configuration  file" in which the user can determine which UIMA annotations should be put into which Lucene field, and how this field is set up (e.g. indexed and/or stored). In addition, some basic functionality for (ontolgical) hypernym indexing is provided.
> Additionally, Lucas is able to perform offset-based token stream alignment and merging of UIMA annotations (via token position increment) in the same Lucene field (e.g. "documenttext" or "title")

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (UIMA-1299) Contribution of Lucene CAS Indexer

Posted by "Jukka Zitting (JIRA)" <ui...@incubator.apache.org>.
    [ https://issues.apache.org/jira/browse/UIMA-1299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12714678#action_12714678 ] 

Jukka Zitting commented on UIMA-1299:
-------------------------------------

Actually, the grant Marshall asked me to look for was another one (should read the bug description closer before commenting...). Anyway, both grants are in.

> Contribution of Lucene CAS Indexer
> ----------------------------------
>
>                 Key: UIMA-1299
>                 URL: https://issues.apache.org/jira/browse/UIMA-1299
>             Project: UIMA
>          Issue Type: New Feature
>          Components: Sandbox
>            Reporter: Rico Landefeld
>         Attachments: lucene-indexer.tar.gz, pom.xml
>
>
> Lucas is a UIMA CAS consumer component which writes CAS data into a Lucene index. It is based on a XML-based "mapping configuration  file" in which the user can determine which UIMA annotations should be put into which Lucene field, and how this field is set up (e.g. indexed and/or stored). In addition, some basic functionality for (ontolgical) hypernym indexing is provided.
> Additionally, Lucas is able to perform offset-based token stream alignment and merging of UIMA annotations (via token position increment) in the same Lucene field (e.g. "documenttext" or "title")

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (UIMA-1299) Contribution of Lucene CAS Indexer

Posted by "Jukka Zitting (JIRA)" <ui...@incubator.apache.org>.
    [ https://issues.apache.org/jira/browse/UIMA-1299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12714677#action_12714677 ] 

Jukka Zitting commented on UIMA-1299:
-------------------------------------

Marshall asked me to check the status of the software grant.

Yes, it has been received and recorded by the ASF secretary.

Feel free to move forward with this.

> Contribution of Lucene CAS Indexer
> ----------------------------------
>
>                 Key: UIMA-1299
>                 URL: https://issues.apache.org/jira/browse/UIMA-1299
>             Project: UIMA
>          Issue Type: New Feature
>          Components: Sandbox
>            Reporter: Rico Landefeld
>         Attachments: lucene-indexer.tar.gz, pom.xml
>
>
> Lucas is a UIMA CAS consumer component which writes CAS data into a Lucene index. It is based on a XML-based "mapping configuration  file" in which the user can determine which UIMA annotations should be put into which Lucene field, and how this field is set up (e.g. indexed and/or stored). In addition, some basic functionality for (ontolgical) hypernym indexing is provided.
> Additionally, Lucas is able to perform offset-based token stream alignment and merging of UIMA annotations (via token position increment) in the same Lucene field (e.g. "documenttext" or "title")

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Re: [jira] Updated: (UIMA-1299) Contribution of Lucene CAS Indexer

Posted by Thilo Goetz <tw...@gmx.de>.
Jörn Kottmann wrote:
> Thilo Goetz wrote:
>> Rico,
>>
>> have you sent in a contributor's license agreement (CLA) yet?
>> If not, please do. See
>> http://www.apache.org/licenses/#clas
>>
>> If you have done so already, we need to find out why you're
>> not listed here:
>> http://people.apache.org/~jim/committers.html
>>   
> Can we check in the update anyway ?
> 
> Jörn

Yes, we can.  Rico checked the "Grant license to ASF for
inclusion in ASF works (as per the Apache License §5)"
checkbox for the attachment, so we're good.  However, as
Rico will (hopefully) continue to make contributions, it
would be better to have an ICLA on file.

--Thilo


Re: [jira] Updated: (UIMA-1299) Contribution of Lucene CAS Indexer

Posted by Jörn Kottmann <ko...@gmail.com>.
Thilo Goetz wrote:
> Rico,
>
> have you sent in a contributor's license agreement (CLA) yet?
> If not, please do. See
> http://www.apache.org/licenses/#clas
>
> If you have done so already, we need to find out why you're
> not listed here:
> http://people.apache.org/~jim/committers.html
>   
Can we check in the update anyway ?

Jörn

Re: [jira] Updated: (UIMA-1299) Contribution of Lucene CAS Indexer

Posted by Thilo Goetz <tw...@gmx.de>.
Rico,

have you sent in a contributor's license agreement (CLA) yet?
If not, please do. See
http://www.apache.org/licenses/#clas

If you have done so already, we need to find out why you're
not listed here:
http://people.apache.org/~jim/committers.html

--Thilo

Rico Landefeld (JIRA) wrote:
>      [ https://issues.apache.org/jira/browse/UIMA-1299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
> 
> Rico Landefeld updated UIMA-1299:
> ---------------------------------
> 
>     Attachment: lucas.tar.gz
> 
> The version in the sandbox is not the latest one because of the software grants delay.
> 
>> Contribution of Lucene CAS Indexer
>> ----------------------------------
>>
>>                 Key: UIMA-1299
>>                 URL: https://issues.apache.org/jira/browse/UIMA-1299
>>             Project: UIMA
>>          Issue Type: New Feature
>>          Components: Sandbox-Lucas
>>            Reporter: Rico Landefeld
>>            Assignee: Jörn Kottmann
>>         Attachments: lucas.tar.gz, lucene-indexer.tar.gz, pom.xml
>>
>>
>> Lucas is a UIMA CAS consumer component which writes CAS data into a Lucene index. It is based on a XML-based "mapping configuration  file" in which the user can determine which UIMA annotations should be put into which Lucene field, and how this field is set up (e.g. indexed and/or stored). In addition, some basic functionality for (ontolgical) hypernym indexing is provided.
>> Additionally, Lucas is able to perform offset-based token stream alignment and merging of UIMA annotations (via token position increment) in the same Lucene field (e.g. "documenttext" or "title")
> 


[jira] Updated: (UIMA-1299) Contribution of Lucene CAS Indexer

Posted by "Rico Landefeld (JIRA)" <ui...@incubator.apache.org>.
     [ https://issues.apache.org/jira/browse/UIMA-1299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Rico Landefeld updated UIMA-1299:
---------------------------------

    Attachment: lucas.tar.gz

The version in the sandbox is not the latest one because of the software grants delay.

> Contribution of Lucene CAS Indexer
> ----------------------------------
>
>                 Key: UIMA-1299
>                 URL: https://issues.apache.org/jira/browse/UIMA-1299
>             Project: UIMA
>          Issue Type: New Feature
>          Components: Sandbox-Lucas
>            Reporter: Rico Landefeld
>            Assignee: Jörn Kottmann
>         Attachments: lucas.tar.gz, lucene-indexer.tar.gz, pom.xml
>
>
> Lucas is a UIMA CAS consumer component which writes CAS data into a Lucene index. It is based on a XML-based "mapping configuration  file" in which the user can determine which UIMA annotations should be put into which Lucene field, and how this field is set up (e.g. indexed and/or stored). In addition, some basic functionality for (ontolgical) hypernym indexing is provided.
> Additionally, Lucas is able to perform offset-based token stream alignment and merging of UIMA annotations (via token position increment) in the same Lucene field (e.g. "documenttext" or "title")

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Assigned: (UIMA-1299) Contribution of Lucene CAS Indexer

Posted by "Jörn Kottmann (JIRA)" <ui...@incubator.apache.org>.
     [ https://issues.apache.org/jira/browse/UIMA-1299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jörn Kottmann reassigned UIMA-1299:
-----------------------------------

    Assignee: Jörn Kottmann

> Contribution of Lucene CAS Indexer
> ----------------------------------
>
>                 Key: UIMA-1299
>                 URL: https://issues.apache.org/jira/browse/UIMA-1299
>             Project: UIMA
>          Issue Type: New Feature
>          Components: Sandbox
>            Reporter: Rico Landefeld
>            Assignee: Jörn Kottmann
>         Attachments: lucene-indexer.tar.gz, pom.xml
>
>
> Lucas is a UIMA CAS consumer component which writes CAS data into a Lucene index. It is based on a XML-based "mapping configuration  file" in which the user can determine which UIMA annotations should be put into which Lucene field, and how this field is set up (e.g. indexed and/or stored). In addition, some basic functionality for (ontolgical) hypernym indexing is provided.
> Additionally, Lucas is able to perform offset-based token stream alignment and merging of UIMA annotations (via token position increment) in the same Lucene field (e.g. "documenttext" or "title")

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (UIMA-1299) Contribution of Lucene CAS Indexer

Posted by "Jörn Kottmann (JIRA)" <ui...@incubator.apache.org>.
     [ https://issues.apache.org/jira/browse/UIMA-1299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jörn Kottmann updated UIMA-1299:
--------------------------------

    Component/s:     (was: Sandbox)
                 Sandbox-Lucas

> Contribution of Lucene CAS Indexer
> ----------------------------------
>
>                 Key: UIMA-1299
>                 URL: https://issues.apache.org/jira/browse/UIMA-1299
>             Project: UIMA
>          Issue Type: New Feature
>          Components: Sandbox-Lucas
>            Reporter: Rico Landefeld
>            Assignee: Jörn Kottmann
>         Attachments: lucene-indexer.tar.gz, pom.xml
>
>
> Lucas is a UIMA CAS consumer component which writes CAS data into a Lucene index. It is based on a XML-based "mapping configuration  file" in which the user can determine which UIMA annotations should be put into which Lucene field, and how this field is set up (e.g. indexed and/or stored). In addition, some basic functionality for (ontolgical) hypernym indexing is provided.
> Additionally, Lucas is able to perform offset-based token stream alignment and merging of UIMA annotations (via token position increment) in the same Lucene field (e.g. "documenttext" or "title")

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Closed: (UIMA-1299) Contribution of Lucene CAS Indexer

Posted by "Jörn Kottmann (JIRA)" <ui...@incubator.apache.org>.
     [ https://issues.apache.org/jira/browse/UIMA-1299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jörn Kottmann closed UIMA-1299.
-------------------------------

    Resolution: Fixed

Ricos update was applied to the project.

Java version dependency was changed from 6 to 5 since the project
did not use any Java 6 features.

The UAE Stemmer was removed to have a discussion about license
handling on the mailing list. It seems that it is licensed under ASL, but
what is the process to include 3rd party source files ?

There are still many test cases failing, though that will be handled in
issue UIMA-1403.

The maven site was removed since documentation is now in docbook format
and we do not use maven sites within UIMA projects, the corresponding isse UIMA-1361
is now fixed.

Most files have now a license header.

All author tags were removed to comply with apache rules.

> Contribution of Lucene CAS Indexer
> ----------------------------------
>
>                 Key: UIMA-1299
>                 URL: https://issues.apache.org/jira/browse/UIMA-1299
>             Project: UIMA
>          Issue Type: New Feature
>          Components: Sandbox-Lucas
>            Reporter: Rico Landefeld
>            Assignee: Jörn Kottmann
>         Attachments: lucas.tar.gz, lucene-indexer.tar.gz, pom.xml
>
>
> Lucas is a UIMA CAS consumer component which writes CAS data into a Lucene index. It is based on a XML-based "mapping configuration  file" in which the user can determine which UIMA annotations should be put into which Lucene field, and how this field is set up (e.g. indexed and/or stored). In addition, some basic functionality for (ontolgical) hypernym indexing is provided.
> Additionally, Lucas is able to perform offset-based token stream alignment and merging of UIMA annotations (via token position increment) in the same Lucene field (e.g. "documenttext" or "title")

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (UIMA-1299) Contribution of Lucene CAS Indexer

Posted by "Rico Landefeld (JIRA)" <ui...@incubator.apache.org>.
    [ https://issues.apache.org/jira/browse/UIMA-1299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12682020#action_12682020 ] 

Rico Landefeld commented on UIMA-1299:
--------------------------------------

Software grants where already faxed to the asf office.

> Contribution of Lucene CAS Indexer
> ----------------------------------
>
>                 Key: UIMA-1299
>                 URL: https://issues.apache.org/jira/browse/UIMA-1299
>             Project: UIMA
>          Issue Type: New Feature
>          Components: Sandbox
>            Reporter: Rico Landefeld
>         Attachments: lucene-indexer.tar.gz, pom.xml
>
>
> Lucas is a UIMA CAS consumer component which writes CAS data into a Lucene index. It is based on a XML-based "mapping configuration  file" in which the user can determine which UIMA annotations should be put into which Lucene field, and how this field is set up (e.g. indexed and/or stored). In addition, some basic functionality for (ontolgical) hypernym indexing is provided.
> Additionally, Lucas is able to perform offset-based token stream alignment and merging of UIMA annotations (via token position increment) in the same Lucene field (e.g. "documenttext" or "title")

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Closed: (UIMA-1299) Contribution of Lucene CAS Indexer

Posted by "Jörn Kottmann (JIRA)" <ui...@incubator.apache.org>.
     [ https://issues.apache.org/jira/browse/UIMA-1299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jörn Kottmann closed UIMA-1299.
-------------------------------

    Resolution: Fixed

> Contribution of Lucene CAS Indexer
> ----------------------------------
>
>                 Key: UIMA-1299
>                 URL: https://issues.apache.org/jira/browse/UIMA-1299
>             Project: UIMA
>          Issue Type: New Feature
>          Components: Sandbox-Lucas
>            Reporter: Rico Landefeld
>            Assignee: Jörn Kottmann
>         Attachments: lucene-indexer.tar.gz, pom.xml
>
>
> Lucas is a UIMA CAS consumer component which writes CAS data into a Lucene index. It is based on a XML-based "mapping configuration  file" in which the user can determine which UIMA annotations should be put into which Lucene field, and how this field is set up (e.g. indexed and/or stored). In addition, some basic functionality for (ontolgical) hypernym indexing is provided.
> Additionally, Lucas is able to perform offset-based token stream alignment and merging of UIMA annotations (via token position increment) in the same Lucene field (e.g. "documenttext" or "title")

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Re: [jira] Updated: (UIMA-1299) Contribution of Lucene CAS Indexer

Posted by Thilo Goetz <tw...@gmx.de>.
In order to move this along, I'll call for a vote.
No use prevaricating about the bush ;-)

--Thilo

Rico Landefeld (JIRA) wrote:
>      [ https://issues.apache.org/jira/browse/UIMA-1299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
> 
> Rico Landefeld updated UIMA-1299:
> ---------------------------------
> 
>     Attachment: lucene-indexer.tar.gz
> 
>> Contribution of Lucene CAS Indexer
>> ----------------------------------
>>
>>                 Key: UIMA-1299
>>                 URL: https://issues.apache.org/jira/browse/UIMA-1299
>>             Project: UIMA
>>          Issue Type: New Feature
>>          Components: Sandbox
>>            Reporter: Rico Landefeld
>>         Attachments: lucene-indexer.tar.gz
>>
>>
>> Lucas is a UIMA CAS consumer component which writes CAS data into a Lucene index. It is based on a XML-based "mapping configuration  file" in which the user can determine which UIMA annotations should be put into which Lucene field, and how this field is set up (e.g. indexed and/or stored). In addition, some basic functionality for (ontolgical) hypernym indexing is provided.
>> Additionally, Lucas is able to perform offset-based token stream alignment and merging of UIMA annotations (via token position increment) in the same Lucene field (e.g. "documenttext" or "title")
> 

[jira] Updated: (UIMA-1299) Contribution of Lucene CAS Indexer

Posted by "Rico Landefeld (JIRA)" <ui...@incubator.apache.org>.
     [ https://issues.apache.org/jira/browse/UIMA-1299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Rico Landefeld updated UIMA-1299:
---------------------------------

    Attachment: lucene-indexer.tar.gz

> Contribution of Lucene CAS Indexer
> ----------------------------------
>
>                 Key: UIMA-1299
>                 URL: https://issues.apache.org/jira/browse/UIMA-1299
>             Project: UIMA
>          Issue Type: New Feature
>          Components: Sandbox
>            Reporter: Rico Landefeld
>         Attachments: lucene-indexer.tar.gz
>
>
> Lucas is a UIMA CAS consumer component which writes CAS data into a Lucene index. It is based on a XML-based "mapping configuration  file" in which the user can determine which UIMA annotations should be put into which Lucene field, and how this field is set up (e.g. indexed and/or stored). In addition, some basic functionality for (ontolgical) hypernym indexing is provided.
> Additionally, Lucas is able to perform offset-based token stream alignment and merging of UIMA annotations (via token position increment) in the same Lucene field (e.g. "documenttext" or "title")

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Reopened: (UIMA-1299) Contribution of Lucene CAS Indexer

Posted by "Jörn Kottmann (JIRA)" <ui...@incubator.apache.org>.
     [ https://issues.apache.org/jira/browse/UIMA-1299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jörn Kottmann reopened UIMA-1299:
---------------------------------


Project must be updated with the current version from Rico Landefeld.

> Contribution of Lucene CAS Indexer
> ----------------------------------
>
>                 Key: UIMA-1299
>                 URL: https://issues.apache.org/jira/browse/UIMA-1299
>             Project: UIMA
>          Issue Type: New Feature
>          Components: Sandbox-Lucas
>            Reporter: Rico Landefeld
>            Assignee: Jörn Kottmann
>         Attachments: lucene-indexer.tar.gz, pom.xml
>
>
> Lucas is a UIMA CAS consumer component which writes CAS data into a Lucene index. It is based on a XML-based "mapping configuration  file" in which the user can determine which UIMA annotations should be put into which Lucene field, and how this field is set up (e.g. indexed and/or stored). In addition, some basic functionality for (ontolgical) hypernym indexing is provided.
> Additionally, Lucas is able to perform offset-based token stream alignment and merging of UIMA annotations (via token position increment) in the same Lucene field (e.g. "documenttext" or "title")

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (UIMA-1299) Contribution of Lucene CAS Indexer

Posted by "Marshall Schor (JIRA)" <ui...@incubator.apache.org>.
    [ https://issues.apache.org/jira/browse/UIMA-1299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12681901#action_12681901 ] 

Marshall Schor commented on UIMA-1299:
--------------------------------------

Status update:  Vote for this was done, result: voted to accept, see: http://markmail.org/message/4v4lpggkccmeiczv

Waiting for software grant to be sent in and acknowledged.

> Contribution of Lucene CAS Indexer
> ----------------------------------
>
>                 Key: UIMA-1299
>                 URL: https://issues.apache.org/jira/browse/UIMA-1299
>             Project: UIMA
>          Issue Type: New Feature
>          Components: Sandbox
>            Reporter: Rico Landefeld
>         Attachments: lucene-indexer.tar.gz, pom.xml
>
>
> Lucas is a UIMA CAS consumer component which writes CAS data into a Lucene index. It is based on a XML-based "mapping configuration  file" in which the user can determine which UIMA annotations should be put into which Lucene field, and how this field is set up (e.g. indexed and/or stored). In addition, some basic functionality for (ontolgical) hypernym indexing is provided.
> Additionally, Lucas is able to perform offset-based token stream alignment and merging of UIMA annotations (via token position increment) in the same Lucene field (e.g. "documenttext" or "title")

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (UIMA-1299) Contribution of Lucene CAS Indexer

Posted by "Jörn Kottmann (JIRA)" <ui...@incubator.apache.org>.
    [ https://issues.apache.org/jira/browse/UIMA-1299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12713534#action_12713534 ] 

Jörn Kottmann commented on UIMA-1299:
-------------------------------------

Is the software grant now listed for this contribution ?

> Contribution of Lucene CAS Indexer
> ----------------------------------
>
>                 Key: UIMA-1299
>                 URL: https://issues.apache.org/jira/browse/UIMA-1299
>             Project: UIMA
>          Issue Type: New Feature
>          Components: Sandbox
>            Reporter: Rico Landefeld
>         Attachments: lucene-indexer.tar.gz, pom.xml
>
>
> Lucas is a UIMA CAS consumer component which writes CAS data into a Lucene index. It is based on a XML-based "mapping configuration  file" in which the user can determine which UIMA annotations should be put into which Lucene field, and how this field is set up (e.g. indexed and/or stored). In addition, some basic functionality for (ontolgical) hypernym indexing is provided.
> Additionally, Lucas is able to perform offset-based token stream alignment and merging of UIMA annotations (via token position increment) in the same Lucene field (e.g. "documenttext" or "title")

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (UIMA-1299) Contribution of Lucene CAS Indexer

Posted by "Rico Landefeld (JIRA)" <ui...@incubator.apache.org>.
     [ https://issues.apache.org/jira/browse/UIMA-1299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Rico Landefeld updated UIMA-1299:
---------------------------------

    Attachment: pom.xml

Corrected POM without parent POM.

> Contribution of Lucene CAS Indexer
> ----------------------------------
>
>                 Key: UIMA-1299
>                 URL: https://issues.apache.org/jira/browse/UIMA-1299
>             Project: UIMA
>          Issue Type: New Feature
>          Components: Sandbox
>            Reporter: Rico Landefeld
>         Attachments: lucene-indexer.tar.gz, pom.xml
>
>
> Lucas is a UIMA CAS consumer component which writes CAS data into a Lucene index. It is based on a XML-based "mapping configuration  file" in which the user can determine which UIMA annotations should be put into which Lucene field, and how this field is set up (e.g. indexed and/or stored). In addition, some basic functionality for (ontolgical) hypernym indexing is provided.
> Additionally, Lucas is able to perform offset-based token stream alignment and merging of UIMA annotations (via token position increment) in the same Lucene field (e.g. "documenttext" or "title")

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (UIMA-1299) Contribution of Lucene CAS Indexer

Posted by "Thilo Goetz (JIRA)" <ui...@incubator.apache.org>.
    [ https://issues.apache.org/jira/browse/UIMA-1299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12678678#action_12678678 ] 

Thilo Goetz commented on UIMA-1299:
-----------------------------------

The md5 signature of lucene-indexer.tar.gz is 48 15 80 58 4D D0 96 3A  4E 90 74 E3 45 97 51 12

> Contribution of Lucene CAS Indexer
> ----------------------------------
>
>                 Key: UIMA-1299
>                 URL: https://issues.apache.org/jira/browse/UIMA-1299
>             Project: UIMA
>          Issue Type: New Feature
>          Components: Sandbox
>            Reporter: Rico Landefeld
>         Attachments: lucene-indexer.tar.gz
>
>
> Lucas is a UIMA CAS consumer component which writes CAS data into a Lucene index. It is based on a XML-based "mapping configuration  file" in which the user can determine which UIMA annotations should be put into which Lucene field, and how this field is set up (e.g. indexed and/or stored). In addition, some basic functionality for (ontolgical) hypernym indexing is provided.
> Additionally, Lucas is able to perform offset-based token stream alignment and merging of UIMA annotations (via token position increment) in the same Lucene field (e.g. "documenttext" or "title")

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.