You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@forrest.apache.org by "Dmitriy Kargapolov (JIRA)" <ji...@apache.org> on 2008/09/19 02:00:44 UTC

[jira] Commented: (FOR-703) Our local search with Lucene gets failures at index time when remote files are declared in site.xml

    [ https://issues.apache.org/jira/browse/FOR-703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12632462#action_12632462 ] 

Dmitriy Kargapolov commented on FOR-703:
----------------------------------------

I got similar issue which was fixed by patching book-to-cinclude-lucene.xsl:

--- book-to-cinclude-lucene.xsl.orig    2007-04-14 22:42:01.000000000 -0400
+++ book-to-cinclude-lucene.xsl 2008-09-18 19:37:51.000000000 -0400
@@ -59,6 +59,10 @@
 <!-- Ignore absolute http urls -->
   <xsl:template match="menu-item[starts-with(@href, 'https:')]"/>
 <!-- Ignore absolute https urls -->
+  <xsl:template match="menu-item[starts-with(@href, 'site:')]"/>
+<!-- Ignore internal site: urls -->
+  <xsl:template match="menu-item[starts-with(@href, 'ext:')]"/>
+<!-- Ignore external ext: urls -->
 <!-- For entries whose @href ends in "/", refer to @href/index.lucene -->
   <xsl:template match="menu-item[substring(@href, string-length(@href) - string-length('/') + 1) = '/']">
     <cinclude:include>

There is no much sense to try to index site: references (just duplicating info) or external links...

Regards.


> Our local search with Lucene gets failures at index time when remote files are declared in site.xml
> ---------------------------------------------------------------------------------------------------
>
>                 Key: FOR-703
>                 URL: https://issues.apache.org/jira/browse/FOR-703
>             Project: Forrest
>          Issue Type: Bug
>          Components: Core operations, Other
>    Affects Versions: 0.8
>            Reporter: David Crossley
>
> Our Lucene search, when building its index, follows our site.xml (linkmap) to gather the files to be indexed. If the entries include a link to a remote site, then Lucene build a wonky reference.
> There are some examples in our site-author/content/site.xml for example ...
> ----------
> <SVN label="Browse SVN" href="http://svn.apache.org/viewcvs.cgi/forrest/trunk/"/>
> ... causes error because it tries to index this URI ...
> /svn/asf/forrest/site-author/./content/xdocs/http:/svn.apache.org/viewcvs.cgi/forrest/trunk/index.xml
> ----------

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.