You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Suresh Chandran (JIRA)" <ji...@apache.org> on 2007/01/19 15:47:29 UTC

[jira] Created: (LUCENE-779) behaviour of Lucene in tokenizing - causes query problems

behaviour of Lucene in tokenizing - causes query problems
---------------------------------------------------------

                 Key: LUCENE-779
                 URL: https://issues.apache.org/jira/browse/LUCENE-779
             Project: Lucene - Java
          Issue Type: Improvement
          Components: Index
         Environment: SLES9.3, JIRA 3.6.3 (Standalone)
            Reporter: Suresh Chandran
         Attachments: query error.doc

I recently submitted a problem with queries to Atlassian Support team and they have pointed out that it is an issue with Lucene. I'm copying the contents from my support request to Atlassian. Please help.

Question to Atlassian:

When I use "text search" to query an item from summary it gives me zero match. But when I use query through custom field, I get results displayed properly.

In the attachment, I did a query for no: 10012363 using text search and I know this entry is part of the summary for several issues. But the result says "no matching issues found".

But when I selected, Vendor name and number, I get proper results. I tried re-indexing, still no help

Response from Atlassian:

The reason why you couldn't receive any results from the query is due to the behaviour of Lucene (an Apache-developed search and index library being used in JIRA) in tokenizing.

For example, *Invoice Number-123456-API ASSOCIATES-10012363* is tokenized to:

{noformat}Invoice
Number-123456-API
ASSOCIATES-10012363{noformat}

In this case, {{Number-123456-API}} and {{ASSOCIATES-10012363}} are recognized as {{NUM}} (e.g. floating point, serial, model numbers, ip addresses, etc.).

~(Hope this doesn't sound too technical to you.)~

You can somehow call this a limitation of the third party library. If you are keen to have it improved, feel free to raise a support request at [our issue tracker|http://jira.atlassian.com] or perhaps try asking it at [Lucene's issue tracker|http://issues.apache.org/jira/browse/LUCENE].

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: https://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


[jira] Updated: (LUCENE-779) behaviour of Lucene in tokenizing - causes query problems

Posted by "Suresh Chandran (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/LUCENE-779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Suresh Chandran updated LUCENE-779:
-----------------------------------

    Attachment: query error.doc

> behaviour of Lucene in tokenizing - causes query problems
> ---------------------------------------------------------
>
>                 Key: LUCENE-779
>                 URL: https://issues.apache.org/jira/browse/LUCENE-779
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Index
>         Environment: SLES9.3, JIRA 3.6.3 (Standalone)
>            Reporter: Suresh Chandran
>         Attachments: query error.doc
>
>
> I recently submitted a problem with queries to Atlassian Support team and they have pointed out that it is an issue with Lucene. I'm copying the contents from my support request to Atlassian. Please help.
> Question to Atlassian:
> When I use "text search" to query an item from summary it gives me zero match. But when I use query through custom field, I get results displayed properly.
> In the attachment, I did a query for no: 10012363 using text search and I know this entry is part of the summary for several issues. But the result says "no matching issues found".
> But when I selected, Vendor name and number, I get proper results. I tried re-indexing, still no help
> Response from Atlassian:
> The reason why you couldn't receive any results from the query is due to the behaviour of Lucene (an Apache-developed search and index library being used in JIRA) in tokenizing.
> For example, *Invoice Number-123456-API ASSOCIATES-10012363* is tokenized to:
> {noformat}Invoice
> Number-123456-API
> ASSOCIATES-10012363{noformat}
> In this case, {{Number-123456-API}} and {{ASSOCIATES-10012363}} are recognized as {{NUM}} (e.g. floating point, serial, model numbers, ip addresses, etc.).
> ~(Hope this doesn't sound too technical to you.)~
> You can somehow call this a limitation of the third party library. If you are keen to have it improved, feel free to raise a support request at [our issue tracker|http://jira.atlassian.com] or perhaps try asking it at [Lucene's issue tracker|http://issues.apache.org/jira/browse/LUCENE].

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: https://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


[jira] Resolved: (LUCENE-779) behaviour of Lucene in tokenizing - causes query problems

Posted by "Otis Gospodnetic (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/LUCENE-779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Otis Gospodnetic resolved LUCENE-779.
-------------------------------------

    Resolution: Invalid

This is not a Lucene limitation/bug.  Atlassian guys could/should change how they handle numbers and solve the problem.


> behaviour of Lucene in tokenizing - causes query problems
> ---------------------------------------------------------
>
>                 Key: LUCENE-779
>                 URL: https://issues.apache.org/jira/browse/LUCENE-779
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Index
>         Environment: SLES9.3, JIRA 3.6.3 (Standalone)
>            Reporter: Suresh Chandran
>         Attachments: query error.doc
>
>
> I recently submitted a problem with queries to Atlassian Support team and they have pointed out that it is an issue with Lucene. I'm copying the contents from my support request to Atlassian. Please help.
> Question to Atlassian:
> When I use "text search" to query an item from summary it gives me zero match. But when I use query through custom field, I get results displayed properly.
> In the attachment, I did a query for no: 10012363 using text search and I know this entry is part of the summary for several issues. But the result says "no matching issues found".
> But when I selected, Vendor name and number, I get proper results. I tried re-indexing, still no help
> Response from Atlassian:
> The reason why you couldn't receive any results from the query is due to the behaviour of Lucene (an Apache-developed search and index library being used in JIRA) in tokenizing.
> For example, *Invoice Number-123456-API ASSOCIATES-10012363* is tokenized to:
> {noformat}Invoice
> Number-123456-API
> ASSOCIATES-10012363{noformat}
> In this case, {{Number-123456-API}} and {{ASSOCIATES-10012363}} are recognized as {{NUM}} (e.g. floating point, serial, model numbers, ip addresses, etc.).
> ~(Hope this doesn't sound too technical to you.)~
> You can somehow call this a limitation of the third party library. If you are keen to have it improved, feel free to raise a support request at [our issue tracker|http://jira.atlassian.com] or perhaps try asking it at [Lucene's issue tracker|http://issues.apache.org/jira/browse/LUCENE].

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: https://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org