You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@jspwiki.apache.org by "Bruno Peeters (JIRA)" <ji...@apache.org> on 2009/02/02 16:55:59 UTC

[jira] Created: (JSPWIKI-489) Searching with wildcards

Searching with wildcards 
-------------------------

                 Key: JSPWIKI-489
                 URL: https://issues.apache.org/jira/browse/JSPWIKI-489
             Project: JSPWiki
          Issue Type: Bug
    Affects Versions: 2.8.1
            Reporter: Bruno Peeters


We have noticed that searching for two letter words is not possible, e.g. looking for "AS IS". Sometimes it could be nice if this would be possible, but this is not the point of this issue.

A search for "%%sub" (or for "%%sup") results in all pages where these formatting instructions are used.

A search for "su*" results in lots of pages where words starting with "su" appear.

A search for "%%su*" is however unsuccessful, no pages are found, although there are indeed pages with %%sup en %%sub.



-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (JSPWIKI-489) Searching with wildcards

Posted by "Bruno Peeters (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/JSPWIKI-489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12669908#action_12669908 ] 

Bruno Peeters commented on JSPWIKI-489:
---------------------------------------

The behaviour I observed seems strange to me. Searching for "su*" results in a long list of pages, searching for "%%su*" does not. Is there an explanation for this inconsistent behaviour ?

> Searching with wildcards 
> -------------------------
>
>                 Key: JSPWIKI-489
>                 URL: https://issues.apache.org/jira/browse/JSPWIKI-489
>             Project: JSPWiki
>          Issue Type: Bug
>    Affects Versions: 2.8.1
>            Reporter: Bruno Peeters
>
> We have noticed that searching for two letter words is not possible, e.g. looking for "AS IS". Sometimes it could be nice if this would be possible, but this is not the point of this issue.
> A search for "%%sub" (or for "%%sup") results in all pages where these formatting instructions are used.
> A search for "su*" results in lots of pages where words starting with "su" appear.
> A search for "%%su*" is however unsuccessful, no pages are found, although there are indeed pages with %%sup en %%sub.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (JSPWIKI-489) Searching with wildcards

Posted by "Janne Jalkanen (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/JSPWIKI-489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12669916#action_12669916 ] 

Janne Jalkanen commented on JSPWIKI-489:
----------------------------------------

You would have to ask that on the Apache Lucene mailing list, I'm afraid...

> Searching with wildcards 
> -------------------------
>
>                 Key: JSPWIKI-489
>                 URL: https://issues.apache.org/jira/browse/JSPWIKI-489
>             Project: JSPWiki
>          Issue Type: Bug
>    Affects Versions: 2.8.1
>            Reporter: Bruno Peeters
>
> We have noticed that searching for two letter words is not possible, e.g. looking for "AS IS". Sometimes it could be nice if this would be possible, but this is not the point of this issue.
> A search for "%%sub" (or for "%%sup") results in all pages where these formatting instructions are used.
> A search for "su*" results in lots of pages where words starting with "su" appear.
> A search for "%%su*" is however unsuccessful, no pages are found, although there are indeed pages with %%sup en %%sub.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (JSPWIKI-489) Searching with wildcards

Posted by "Janne Jalkanen (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/JSPWIKI-489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12669682#action_12669682 ] 

Janne Jalkanen commented on JSPWIKI-489:
----------------------------------------

Of course, I meant to say "two letter", not "two word"...

> Searching with wildcards 
> -------------------------
>
>                 Key: JSPWIKI-489
>                 URL: https://issues.apache.org/jira/browse/JSPWIKI-489
>             Project: JSPWiki
>          Issue Type: Bug
>    Affects Versions: 2.8.1
>            Reporter: Bruno Peeters
>
> We have noticed that searching for two letter words is not possible, e.g. looking for "AS IS". Sometimes it could be nice if this would be possible, but this is not the point of this issue.
> A search for "%%sub" (or for "%%sup") results in all pages where these formatting instructions are used.
> A search for "su*" results in lots of pages where words starting with "su" appear.
> A search for "%%su*" is however unsuccessful, no pages are found, although there are indeed pages with %%sup en %%sub.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Resolved: (JSPWIKI-489) Searching with wildcards

Posted by "Janne Jalkanen (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/JSPWIKI-489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Janne Jalkanen resolved JSPWIKI-489.
------------------------------------

    Resolution: Won't Fix

If you need this feature, please use BasicSearchProvider.  Lucene does not support two-word searches.

> Searching with wildcards 
> -------------------------
>
>                 Key: JSPWIKI-489
>                 URL: https://issues.apache.org/jira/browse/JSPWIKI-489
>             Project: JSPWiki
>          Issue Type: Bug
>    Affects Versions: 2.8.1
>            Reporter: Bruno Peeters
>
> We have noticed that searching for two letter words is not possible, e.g. looking for "AS IS". Sometimes it could be nice if this would be possible, but this is not the point of this issue.
> A search for "%%sub" (or for "%%sup") results in all pages where these formatting instructions are used.
> A search for "su*" results in lots of pages where words starting with "su" appear.
> A search for "%%su*" is however unsuccessful, no pages are found, although there are indeed pages with %%sup en %%sub.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (JSPWIKI-489) Searching with wildcards

Posted by "Murray Altheim (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/JSPWIKI-489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12669788#action_12669788 ] 

Murray Altheim commented on JSPWIKI-489:
----------------------------------------

Actually, Lucene does support two letter searches, just not any of the words found in the
English stop list (e.g., "to"). I often search on 'en' or 'ko' or any of  the two letter ISO language
codes with no problem. This does however mean that I'll never find Tonga ("to"). We'd need
to modify our Lucene implementation to either stop ignoring the stop words or modify the
list, which might be problematic. If I lived in Tonga I'd probably do the latter.



> Searching with wildcards 
> -------------------------
>
>                 Key: JSPWIKI-489
>                 URL: https://issues.apache.org/jira/browse/JSPWIKI-489
>             Project: JSPWiki
>          Issue Type: Bug
>    Affects Versions: 2.8.1
>            Reporter: Bruno Peeters
>
> We have noticed that searching for two letter words is not possible, e.g. looking for "AS IS". Sometimes it could be nice if this would be possible, but this is not the point of this issue.
> A search for "%%sub" (or for "%%sup") results in all pages where these formatting instructions are used.
> A search for "su*" results in lots of pages where words starting with "su" appear.
> A search for "%%su*" is however unsuccessful, no pages are found, although there are indeed pages with %%sup en %%sub.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (JSPWIKI-489) Searching with wildcards

Posted by "Siegfried Goeschl (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/JSPWIKI-489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12669969#action_12669969 ] 

Siegfried Goeschl commented on JSPWIKI-489:
-------------------------------------------

You cannot use a * or ? symbol as the first character of a search (see http://lucene.apache.org/java/2_3_2/queryparsersyntax.html) because you have a wildcard query where the wildcard is at the beginning of the search string - this causes performance problems. Usually you get a Lucene exception when doing that - but an implementation can choose to overwrite this method to silently ignore that exception

> Searching with wildcards 
> -------------------------
>
>                 Key: JSPWIKI-489
>                 URL: https://issues.apache.org/jira/browse/JSPWIKI-489
>             Project: JSPWiki
>          Issue Type: Bug
>    Affects Versions: 2.8.1
>            Reporter: Bruno Peeters
>
> We have noticed that searching for two letter words is not possible, e.g. looking for "AS IS". Sometimes it could be nice if this would be possible, but this is not the point of this issue.
> A search for "%%sub" (or for "%%sup") results in all pages where these formatting instructions are used.
> A search for "su*" results in lots of pages where words starting with "su" appear.
> A search for "%%su*" is however unsuccessful, no pages are found, although there are indeed pages with %%sup en %%sub.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (JSPWIKI-489) Searching with wildcards

Posted by "Bruno Peeters (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/JSPWIKI-489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12669984#action_12669984 ] 

Bruno Peeters commented on JSPWIKI-489:
---------------------------------------

Siegfried, the wildcard is at the end of the string, not at the beginning. The percentage sign (%) is not mentioned as a special character in the Apache Lucene document you refer to. Even with the escape character (\%\%su*) there are not results found.

> Searching with wildcards 
> -------------------------
>
>                 Key: JSPWIKI-489
>                 URL: https://issues.apache.org/jira/browse/JSPWIKI-489
>             Project: JSPWiki
>          Issue Type: Bug
>    Affects Versions: 2.8.1
>            Reporter: Bruno Peeters
>
> We have noticed that searching for two letter words is not possible, e.g. looking for "AS IS". Sometimes it could be nice if this would be possible, but this is not the point of this issue.
> A search for "%%sub" (or for "%%sup") results in all pages where these formatting instructions are used.
> A search for "su*" results in lots of pages where words starting with "su" appear.
> A search for "%%su*" is however unsuccessful, no pages are found, although there are indeed pages with %%sup en %%sub.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.