You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@uima.apache.org by Marshall Schor <ms...@schor.com> on 2011/12/09 16:51:39 UTC
Re: uima matched pattern
Hi,
There may be some confusion about roles, here. UIMA itself doesn't have a
concept of "matched - from - pattern".
It does have a general concept of "annotations", which have the begin and end
points. It is up to particular Annotators to specify these. Most annotators
(that process "text") set these values to correspond to the part of the text the
annotation is associated with.
What Annotator are you running that you expect to produce matched patterns?
-Marshall
On 12/10/2011 7:31 AM, francesco.tangari.inf@gmail.com wrote:
> how do i get the matched text from a jcas object ??
> looking at the begin and end feature it is wrong becuz it sometime say : begin : 0 . end: a number. and at 0 there isn't a match.
> i m talking about matched patterns.
>
>
Re: uima matched pattern
Posted by Francesco Tangari <fr...@gmail.com>.
Wait i give you the full project ;)
2011/12/12 Peter Klügl <pk...@uni-wuerzburg.de>
> Hi,
>
> your problem is not really reproducible with the given information.
>
> Does line 133 "PrintAnnotations.**printAnnotations(aCAS, System.out);"
> print all annotations of the CAS? Then, there is probably only the
> DocumentAnnotation that is the line of your BufferedReader in the code.
>
> Have you tried to debug your Annotator and checked if your regexp pattern
> matches at all on the input text?
>
> Peter
>
> Am 11.12.2011 11:04, schrieb francesco.tangari.inf@gmail.**com<fr...@gmail.com>
> :
>
>> first of all thx for the reply. the Annotator is this one
>> http://pastebin.com/44EcubqC very simple regular expression
>> Annotator. And this is called by this http://pastebin.com/6vwuRvvt. But
>> it dose not print the Matched "Word" , "text" , instead , it show me the
>> full Line that i read from file ( i don't want that).
>>
>> --
>> francesco.tangari.inf@gmail.**com <fr...@gmail.com>
>> Inviato con Sparrow (http://www.sparrowmailapp.**com/?sig<http://www.sparrowmailapp.com/?sig>
>> )
>>
>>
>>
>> Il giorno venerdì 9 dicembre 2011, alle ore 16.51, Marshall Schor ha
>> scritto:
>>
>> Hi,
>>>
>>> There may be some confusion about roles, here. UIMA itself doesn't have a
>>> concept of "matched - from - pattern".
>>>
>>> It does have a general concept of "annotations", which have the begin
>>> and end
>>> points. It is up to particular Annotators to specify these. Most
>>> annotators
>>> (that process "text") set these values to correspond to the part of the
>>> text the
>>> annotation is associated with.
>>>
>>> What Annotator are you running that you expect to produce matched
>>> patterns?
>>>
>>> -Marshall
>>>
>>> On 12/10/2011 7:31 AM, francesco.tangari.inf@gmail.**com<fr...@gmail.com>(mailto:
>>> francesco.tangari.inf@**gmail.com <fr...@gmail.com>)
>>> wrote:
>>>
>>>> how do i get the matched text from a jcas object ??
>>>> looking at the begin and end feature it is wrong becuz it sometime say
>>>> : begin : 0 . end: a number. and at 0 there isn't a match.
>>>> i m talking about matched patterns.
>>>>
>>>>
>>>
>>>
>>>
>>
>>
>
> --
> ------------------------------**------------------------------**---------
> Dipl.-Inf. Peter Klügl
> Universität Würzburg Tel.: +49-(0)931-31-86741
> Am Hubland Fax.: +49-(0)931-31-86732
> 97074 Würzburg mail: pkluegl@informatik.uni-**wuerzburg.de<pk...@informatik.uni-wuerzburg.de>
> http://www.is.informatik.uni-**wuerzburg.de/en/staff/kluegl_**peter/<http://www.is.informatik.uni-wuerzburg.de/en/staff/kluegl_peter/>
> ------------------------------**------------------------------**---------
>
>
Re: uima matched pattern
Posted by Peter Klügl <pk...@uni-wuerzburg.de>.
Hi,
your problem is not really reproducible with the given information.
Does line 133 "PrintAnnotations.printAnnotations(aCAS, System.out);"
print all annotations of the CAS? Then, there is probably only the
DocumentAnnotation that is the line of your BufferedReader in the code.
Have you tried to debug your Annotator and checked if your regexp
pattern matches at all on the input text?
Peter
Am 11.12.2011 11:04, schrieb francesco.tangari.inf@gmail.com:
> first of all thx for the reply. the Annotator is this one http://pastebin.com/44EcubqC very simple regular expression Annotator. And this is called by this http://pastebin.com/6vwuRvvt. But it dose not print the Matched "Word" , "text" , instead , it show me the full Line that i read from file ( i don't want that).
>
> --
> francesco.tangari.inf@gmail.com
> Inviato con Sparrow (http://www.sparrowmailapp.com/?sig)
>
>
> Il giorno venerdì 9 dicembre 2011, alle ore 16.51, Marshall Schor ha scritto:
>
>> Hi,
>>
>> There may be some confusion about roles, here. UIMA itself doesn't have a
>> concept of "matched - from - pattern".
>>
>> It does have a general concept of "annotations", which have the begin and end
>> points. It is up to particular Annotators to specify these. Most annotators
>> (that process "text") set these values to correspond to the part of the text the
>> annotation is associated with.
>>
>> What Annotator are you running that you expect to produce matched patterns?
>>
>> -Marshall
>>
>> On 12/10/2011 7:31 AM, francesco.tangari.inf@gmail.com (mailto:francesco.tangari.inf@gmail.com) wrote:
>>> how do i get the matched text from a jcas object ??
>>> looking at the begin and end feature it is wrong becuz it sometime say : begin : 0 . end: a number. and at 0 there isn't a match.
>>> i m talking about matched patterns.
>>>
>>
>>
>>
>
>
--
---------------------------------------------------------------------
Dipl.-Inf. Peter Klügl
Universität Würzburg Tel.: +49-(0)931-31-86741
Am Hubland Fax.: +49-(0)931-31-86732
97074 Würzburg mail: pkluegl@informatik.uni-wuerzburg.de
http://www.is.informatik.uni-wuerzburg.de/en/staff/kluegl_peter/
---------------------------------------------------------------------
Re: uima matched pattern
Posted by fr...@gmail.com.
first of all thx for the reply. the Annotator is this one http://pastebin.com/44EcubqC very simple regular expression Annotator. And this is called by this http://pastebin.com/6vwuRvvt. But it dose not print the Matched "Word" , "text" , instead , it show me the full Line that i read from file ( i don't want that).
--
francesco.tangari.inf@gmail.com
Inviato con Sparrow (http://www.sparrowmailapp.com/?sig)
Il giorno venerdì 9 dicembre 2011, alle ore 16.51, Marshall Schor ha scritto:
> Hi,
>
> There may be some confusion about roles, here. UIMA itself doesn't have a
> concept of "matched - from - pattern".
>
> It does have a general concept of "annotations", which have the begin and end
> points. It is up to particular Annotators to specify these. Most annotators
> (that process "text") set these values to correspond to the part of the text the
> annotation is associated with.
>
> What Annotator are you running that you expect to produce matched patterns?
>
> -Marshall
>
> On 12/10/2011 7:31 AM, francesco.tangari.inf@gmail.com (mailto:francesco.tangari.inf@gmail.com) wrote:
> > how do i get the matched text from a jcas object ??
> > looking at the begin and end feature it is wrong becuz it sometime say : begin : 0 . end: a number. and at 0 there isn't a match.
> > i m talking about matched patterns.
> >
>
>
>