You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@forrest.apache.org by "David Crossley (JIRA)" <ji...@apache.org> on 2005/10/05 09:14:49 UTC
[jira] Created: (FOR-703) Our local search with Lucene gets failures at index time when remote files are declared in site.xml
Our local search with Lucene gets failures at index time when remote files are declared in site.xml
---------------------------------------------------------------------------------------------------
Key: FOR-703
URL: http://issues.apache.org/jira/browse/FOR-703
Project: Forrest
Type: Bug
Versions: 0.8-dev
Reporter: David Crossley
Our Lucene search, when building its index, follows our site.xml (linkmap) to gather the files to be indexed. If the entries include a link to a remote site, then Lucene build a wonky reference.
There are some examples in our site-author/content/site.xml for example ...
----------
<SVN label="Browse SVN" href="http://svn.apache.org/viewcvs.cgi/forrest/trunk/"/>
... causes error because it tries to index this URI ...
/svn/asf/forrest/site-author/./content/xdocs/http:/svn.apache.org/viewcvs.cgi/forrest/trunk/index.xml
----------
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
http://www.atlassian.com/software/jira
[jira] Updated: (FOR-703) Our local search with Lucene gets failures at index time when remote files are declared in site.xml
Posted by "Tim Williams (JIRA)" <ji...@apache.org>.
[ http://issues.apache.org/jira/browse/FOR-703?page=all ]
Tim Williams updated FOR-703:
-----------------------------
Component: Core operations
Other
> Our local search with Lucene gets failures at index time when remote files are declared in site.xml
> ---------------------------------------------------------------------------------------------------
>
> Key: FOR-703
> URL: http://issues.apache.org/jira/browse/FOR-703
> Project: Forrest
> Type: Bug
> Components: Other, Core operations
> Versions: 0.8-dev
> Reporter: David Crossley
>
> Our Lucene search, when building its index, follows our site.xml (linkmap) to gather the files to be indexed. If the entries include a link to a remote site, then Lucene build a wonky reference.
> There are some examples in our site-author/content/site.xml for example ...
> ----------
> <SVN label="Browse SVN" href="http://svn.apache.org/viewcvs.cgi/forrest/trunk/"/>
> ... causes error because it tries to index this URI ...
> /svn/asf/forrest/site-author/./content/xdocs/http:/svn.apache.org/viewcvs.cgi/forrest/trunk/index.xml
> ----------
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
http://www.atlassian.com/software/jira
[jira] Closed: (FOR-703) Our local search with Lucene gets failures
at index time when remote files are declared in site.xml
Posted by "Tim Williams (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/FOR-703?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Tim Williams closed FOR-703.
----------------------------
Resolution: Fixed
Fix Version/s: 0.9-dev
Assignee: Tim Williams
Fixed in r1032832 by not indexing URIs with ext, site protocols and lowering the priority of the long template match.
> Our local search with Lucene gets failures at index time when remote files are declared in site.xml
> ---------------------------------------------------------------------------------------------------
>
> Key: FOR-703
> URL: https://issues.apache.org/jira/browse/FOR-703
> Project: Forrest
> Issue Type: Bug
> Components: Core operations, Other
> Affects Versions: 0.8
> Reporter: David Crossley
> Assignee: Tim Williams
> Fix For: 0.9-dev
>
>
> Our Lucene search, when building its index, follows our site.xml (linkmap) to gather the files to be indexed. If the entries include a link to a remote site, then Lucene build a wonky reference.
> There are some examples in our site-author/content/site.xml for example ...
> ----------
> <SVN label="Browse SVN" href="http://svn.apache.org/viewcvs.cgi/forrest/trunk/"/>
> ... causes error because it tries to index this URI ...
> /svn/asf/forrest/site-author/./content/xdocs/http:/svn.apache.org/viewcvs.cgi/forrest/trunk/index.xml
> ----------
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Commented: (FOR-703) Our local search with Lucene gets
failures at index time when remote files are declared in site.xml
Posted by "Dmitriy Kargapolov (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/FOR-703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12632462#action_12632462 ]
Dmitriy Kargapolov commented on FOR-703:
----------------------------------------
I got similar issue which was fixed by patching book-to-cinclude-lucene.xsl:
--- book-to-cinclude-lucene.xsl.orig 2007-04-14 22:42:01.000000000 -0400
+++ book-to-cinclude-lucene.xsl 2008-09-18 19:37:51.000000000 -0400
@@ -59,6 +59,10 @@
<!-- Ignore absolute http urls -->
<xsl:template match="menu-item[starts-with(@href, 'https:')]"/>
<!-- Ignore absolute https urls -->
+ <xsl:template match="menu-item[starts-with(@href, 'site:')]"/>
+<!-- Ignore internal site: urls -->
+ <xsl:template match="menu-item[starts-with(@href, 'ext:')]"/>
+<!-- Ignore external ext: urls -->
<!-- For entries whose @href ends in "/", refer to @href/index.lucene -->
<xsl:template match="menu-item[substring(@href, string-length(@href) - string-length('/') + 1) = '/']">
<cinclude:include>
There is no much sense to try to index site: references (just duplicating info) or external links...
Regards.
> Our local search with Lucene gets failures at index time when remote files are declared in site.xml
> ---------------------------------------------------------------------------------------------------
>
> Key: FOR-703
> URL: https://issues.apache.org/jira/browse/FOR-703
> Project: Forrest
> Issue Type: Bug
> Components: Core operations, Other
> Affects Versions: 0.8
> Reporter: David Crossley
>
> Our Lucene search, when building its index, follows our site.xml (linkmap) to gather the files to be indexed. If the entries include a link to a remote site, then Lucene build a wonky reference.
> There are some examples in our site-author/content/site.xml for example ...
> ----------
> <SVN label="Browse SVN" href="http://svn.apache.org/viewcvs.cgi/forrest/trunk/"/>
> ... causes error because it tries to index this URI ...
> /svn/asf/forrest/site-author/./content/xdocs/http:/svn.apache.org/viewcvs.cgi/forrest/trunk/index.xml
> ----------
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.