You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@jspwiki.apache.org by "Kurt Stein (JIRA)" <ji...@apache.org> on 2008/11/12 15:15:44 UTC

[jira] Created: (JSPWIKI-427) Keywords for Lucene Index

Keywords for Lucene Index
-------------------------

                 Key: JSPWIKI-427
                 URL: https://issues.apache.org/jira/browse/JSPWIKI-427
             Project: JSPWiki
          Issue Type: New Feature
          Components: Core & storage
    Affects Versions: 2.6.4
         Environment: LuceneSearchProvider
            Reporter: Kurt Stein


It would be nice feature if you can define keywords for wikipages so that the search results can be made more accurate. 

Cause:
Sometime you have a big page where you explain something that is only mentioned in the headline. So this page contains very good content but lucene does not rank it high enough because its relation is very low.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (JSPWIKI-427) Keywords for Lucene Index

Posted by "Murray Altheim (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/JSPWIKI-427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12646997#action_12646997 ] 

Murray Altheim commented on JSPWIKI-427:
----------------------------------------

Given that 'keywords' and 'tags' might be considered similar, the approach that we've taken is to use the TagManager to provide Lucene with information about the declared tags for a given wiki page. This requires a slight modification to the Lucene indexer since the change is to the Lucene index. 

Another difference between tags and keywords is that in our implementation of tags they are by definition a 'controlled vocabulary', specifically each tag must reference a wiki page. This is so that each tag has a definition, which more restrictive reduces spelling errors and ambiguity of meaning. 

If this proposal moves forward I would hope we store the keywords in a metadata field labeled 'DC.subject' since that's the appropriate field in Dublin Core for keywords, as used internationally.


> Keywords for Lucene Index
> -------------------------
>
>                 Key: JSPWIKI-427
>                 URL: https://issues.apache.org/jira/browse/JSPWIKI-427
>             Project: JSPWiki
>          Issue Type: New Feature
>          Components: Core & storage
>    Affects Versions: 2.6.4
>         Environment: LuceneSearchProvider
>            Reporter: Kurt Stein
>
> It would be nice feature if you can define keywords for wikipages so that the search results can be made more accurate. 
> Cause:
> Sometime you have a big page where you explain something that is only mentioned in the headline. So this page contains very good content but lucene does not rank it high enough because its relation is very low.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (JSPWIKI-427) Keywords for Lucene Index

Posted by "Florian Holeczek (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/JSPWIKI-427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12654750#action_12654750 ] 

Florian Holeczek commented on JSPWIKI-427:
------------------------------------------

Definitely! As Janne mentioned, JSPWiki will get a better metadata management in 3.0. Should be quite easy to implement this feature on top of it then.

> Keywords for Lucene Index
> -------------------------
>
>                 Key: JSPWIKI-427
>                 URL: https://issues.apache.org/jira/browse/JSPWIKI-427
>             Project: JSPWiki
>          Issue Type: New Feature
>          Components: Core & storage
>    Affects Versions: 2.6.4
>         Environment: LuceneSearchProvider
>            Reporter: Kurt Stein
>
> It would be nice feature if you can define keywords for wikipages so that the search results can be made more accurate. 
> Cause:
> Sometime you have a big page where you explain something that is only mentioned in the headline. So this page contains very good content but lucene does not rank it high enough because its relation is very low.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (JSPWIKI-427) Keywords for Lucene Index

Posted by "Janne Jalkanen (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/JSPWIKI-427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12646975#action_12646975 ] 

Janne Jalkanen commented on JSPWIKI-427:
----------------------------------------

Very good idea!  Do you have an idea for the syntax - or should we have a separate editor for it?

In 3.0, we can add arbitrary metadata to all pages, with or without separate editors.

Also, how would keywords relate to tags, and should they have some sort of an RDF-like relations?  I would like to hear Murray's insights on how this sort of stuff could be handled.

> Keywords for Lucene Index
> -------------------------
>
>                 Key: JSPWIKI-427
>                 URL: https://issues.apache.org/jira/browse/JSPWIKI-427
>             Project: JSPWiki
>          Issue Type: New Feature
>          Components: Core & storage
>    Affects Versions: 2.6.4
>         Environment: LuceneSearchProvider
>            Reporter: Kurt Stein
>
> It would be nice feature if you can define keywords for wikipages so that the search results can be made more accurate. 
> Cause:
> Sometime you have a big page where you explain something that is only mentioned in the headline. So this page contains very good content but lucene does not rank it high enough because its relation is very low.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (JSPWIKI-427) Keywords for Lucene Index

Posted by "Kurt Stein (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/JSPWIKI-427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12654741#action_12654741 ] 

Kurt Stein commented on JSPWIKI-427:
------------------------------------

Is there any further interest in this issue?

> Keywords for Lucene Index
> -------------------------
>
>                 Key: JSPWIKI-427
>                 URL: https://issues.apache.org/jira/browse/JSPWIKI-427
>             Project: JSPWiki
>          Issue Type: New Feature
>          Components: Core & storage
>    Affects Versions: 2.6.4
>         Environment: LuceneSearchProvider
>            Reporter: Kurt Stein
>
> It would be nice feature if you can define keywords for wikipages so that the search results can be made more accurate. 
> Cause:
> Sometime you have a big page where you explain something that is only mentioned in the headline. So this page contains very good content but lucene does not rank it high enough because its relation is very low.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.