You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@forrest.apache.org by "David Crossley (JIRA)" <ji...@apache.org> on 2005/10/05 09:14:49 UTC

[jira] Created: (FOR-703) Our local search with Lucene gets failures at index time when remote files are declared in site.xml

Our local search with Lucene gets failures at index time when remote files are declared in site.xml
---------------------------------------------------------------------------------------------------

         Key: FOR-703
         URL: http://issues.apache.org/jira/browse/FOR-703
     Project: Forrest
        Type: Bug
    Versions: 0.8-dev    
    Reporter: David Crossley


Our Lucene search, when building its index, follows our site.xml (linkmap) to gather the files to be indexed. If the entries include a link to a remote site, then Lucene build a wonky reference.

There are some examples in our site-author/content/site.xml for example ...
----------
<SVN label="Browse SVN" href="http://svn.apache.org/viewcvs.cgi/forrest/trunk/"/>
... causes error because it tries to index this URI ...
/svn/asf/forrest/site-author/./content/xdocs/http:/svn.apache.org/viewcvs.cgi/forrest/trunk/index.xml
----------

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


[jira] Updated: (FOR-703) Our local search with Lucene gets failures at index time when remote files are declared in site.xml

Posted by "Tim Williams (JIRA)" <ji...@apache.org>.
     [ http://issues.apache.org/jira/browse/FOR-703?page=all ]

Tim Williams updated FOR-703:
-----------------------------

    Component: Core operations
               Other

> Our local search with Lucene gets failures at index time when remote files are declared in site.xml
> ---------------------------------------------------------------------------------------------------
>
>          Key: FOR-703
>          URL: http://issues.apache.org/jira/browse/FOR-703
>      Project: Forrest
>         Type: Bug
>   Components: Other, Core operations
>     Versions: 0.8-dev
>     Reporter: David Crossley

>
> Our Lucene search, when building its index, follows our site.xml (linkmap) to gather the files to be indexed. If the entries include a link to a remote site, then Lucene build a wonky reference.
> There are some examples in our site-author/content/site.xml for example ...
> ----------
> <SVN label="Browse SVN" href="http://svn.apache.org/viewcvs.cgi/forrest/trunk/"/>
> ... causes error because it tries to index this URI ...
> /svn/asf/forrest/site-author/./content/xdocs/http:/svn.apache.org/viewcvs.cgi/forrest/trunk/index.xml
> ----------

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


[jira] Closed: (FOR-703) Our local search with Lucene gets failures at index time when remote files are declared in site.xml

Posted by "Tim Williams (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/FOR-703?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Tim Williams closed FOR-703.
----------------------------

       Resolution: Fixed
    Fix Version/s: 0.9-dev
         Assignee: Tim Williams

Fixed in r1032832 by not indexing URIs with ext, site protocols and lowering the priority of the long template match.

> Our local search with Lucene gets failures at index time when remote files are declared in site.xml
> ---------------------------------------------------------------------------------------------------
>
>                 Key: FOR-703
>                 URL: https://issues.apache.org/jira/browse/FOR-703
>             Project: Forrest
>          Issue Type: Bug
>          Components: Core operations, Other
>    Affects Versions: 0.8
>            Reporter: David Crossley
>            Assignee: Tim Williams
>             Fix For: 0.9-dev
>
>
> Our Lucene search, when building its index, follows our site.xml (linkmap) to gather the files to be indexed. If the entries include a link to a remote site, then Lucene build a wonky reference.
> There are some examples in our site-author/content/site.xml for example ...
> ----------
> <SVN label="Browse SVN" href="http://svn.apache.org/viewcvs.cgi/forrest/trunk/"/>
> ... causes error because it tries to index this URI ...
> /svn/asf/forrest/site-author/./content/xdocs/http:/svn.apache.org/viewcvs.cgi/forrest/trunk/index.xml
> ----------

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (FOR-703) Our local search with Lucene gets failures at index time when remote files are declared in site.xml

Posted by "Dmitriy Kargapolov (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/FOR-703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12632462#action_12632462 ] 

Dmitriy Kargapolov commented on FOR-703:
----------------------------------------

I got similar issue which was fixed by patching book-to-cinclude-lucene.xsl:

--- book-to-cinclude-lucene.xsl.orig    2007-04-14 22:42:01.000000000 -0400
+++ book-to-cinclude-lucene.xsl 2008-09-18 19:37:51.000000000 -0400
@@ -59,6 +59,10 @@
 <!-- Ignore absolute http urls -->
   <xsl:template match="menu-item[starts-with(@href, 'https:')]"/>
 <!-- Ignore absolute https urls -->
+  <xsl:template match="menu-item[starts-with(@href, 'site:')]"/>
+<!-- Ignore internal site: urls -->
+  <xsl:template match="menu-item[starts-with(@href, 'ext:')]"/>
+<!-- Ignore external ext: urls -->
 <!-- For entries whose @href ends in "/", refer to @href/index.lucene -->
   <xsl:template match="menu-item[substring(@href, string-length(@href) - string-length('/') + 1) = '/']">
     <cinclude:include>

There is no much sense to try to index site: references (just duplicating info) or external links...

Regards.


> Our local search with Lucene gets failures at index time when remote files are declared in site.xml
> ---------------------------------------------------------------------------------------------------
>
>                 Key: FOR-703
>                 URL: https://issues.apache.org/jira/browse/FOR-703
>             Project: Forrest
>          Issue Type: Bug
>          Components: Core operations, Other
>    Affects Versions: 0.8
>            Reporter: David Crossley
>
> Our Lucene search, when building its index, follows our site.xml (linkmap) to gather the files to be indexed. If the entries include a link to a remote site, then Lucene build a wonky reference.
> There are some examples in our site-author/content/site.xml for example ...
> ----------
> <SVN label="Browse SVN" href="http://svn.apache.org/viewcvs.cgi/forrest/trunk/"/>
> ... causes error because it tries to index this URI ...
> /svn/asf/forrest/site-author/./content/xdocs/http:/svn.apache.org/viewcvs.cgi/forrest/trunk/index.xml
> ----------

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.