You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@lenya.apache.org by gr...@apache.org on 2004/12/05 04:50:07 UTC
svn commit: r109849 - /lenya/docu/src/documentation/content/xdocs/docs/1_2_x/components/search/lucene.xml
Author: gregor
Date: Sat Dec 4 19:50:06 2004
New Revision: 109849
URL: http://svn.apache.org/viewcvs?view=rev&rev=109849
Log:
Expanded Lucene documentation.
Modified:
lenya/docu/src/documentation/content/xdocs/docs/1_2_x/components/search/lucene.xml
Modified: lenya/docu/src/documentation/content/xdocs/docs/1_2_x/components/search/lucene.xml
Url: http://svn.apache.org/viewcvs/lenya/docu/src/documentation/content/xdocs/docs/1_2_x/components/search/lucene.xml?view=diff&rev=109849&p1=lenya/docu/src/documentation/content/xdocs/docs/1_2_x/components/search/lucene.xml&r1=109848&p2=lenya/docu/src/documentation/content/xdocs/docs/1_2_x/components/search/lucene.xml&r2=109849
==============================================================================
--- lenya/docu/src/documentation/content/xdocs/docs/1_2_x/components/search/lucene.xml (original)
+++ lenya/docu/src/documentation/content/xdocs/docs/1_2_x/components/search/lucene.xml Sat Dec 4 19:50:06 2004
@@ -32,49 +32,19 @@
<body>
<section>
-<title>Generic Search</title>
-<p>URL:</p>
-<source>
-<![CDATA[
-/lenya/$PUB_ID/search-$AREA/lucene
-]]>
-</source>
-
-<p>Indices and Excerpts:</p>
-<source>
-<![CDATA[
-src/webapp/lenya/pubs/$PUB_ID/work/search/index/$AREA/index
-src/webapp/lenya/pubs/$PUB_ID/work/search/htdocs_dump/$AREA
-]]>
-</source>
-
-<p>Configuration:</p>
-<source>
-<![CDATA[
-src/webapp/global-sitemap.xmap
-src/webapp/lenya/lucene.xmap
-]]>
-</source>
-</section>
+<title>Overview</title>
+<p>There are two URL for the search screen relative to your publication:
+ <code>search-live/lucene</code> to search the live area, <code>search-authoring/lucene</code> to
+ search the authoring area of your publication.</p>
+<p>If you want to customize the layout of the search screen for your publication,
+ place a stylesheet at <code>lenya/xslt/search/search-and-results.xsl</code>
+ relative to your publication root.</p>
+<p>Lucene indices are stored within the <code>work/search/index/$AREA/index</code> directory of your
+ publication. The <code>work/search/htdocs_dump/$AREA</code> directory holds content from crawling (see below).</p>
-<section>
-<title>Customizing/Overwriting Generic Search Interface</title>
-<p>XSLT:</p>
-<source>
-<![CDATA[
-src/webapp/lenya/pubs/$PUB_ID/lenya/xslt/search/search-and-results.xsl
-]]>
-</source>
-
-<p>URL:</p>
-<source>
-<![CDATA[
-/lenya/$PUB_ID/search-$AREA/lucene
-]]>
-</source>
+<p>The search pipelines are defined within <code>global-sitemap.xmap</code> and <code>lucene.xmap</code></p>
</section>
-
<section>
<title>Crawling a website</title>
<p>
@@ -82,11 +52,12 @@
</p>
<source>
<![CDATA[
-ant -f src/webapp/lenya/bin/crawl_and_index.xml crawl -Dcrawler.xconf=/home/username/src/lenya/src/webapp/lenya/pubs/default/config/search/crawler-live.xconf
+ant -f build/lenya/webapp/lenya/bin/crawl_and_index.xml -Dcrawler.xconf=build/lenya/webapp/lenya/pubs/default/config/search/crawler-live.xconf crawl
]]>
</source>
<p>
-whereas the crawler.xconf has the following elements
+Note that there is a search.properties file in build/lenya/webapp/lenya/bin that you may have to change.
+crawler.xconf needs to have the following elements:
</p>
<source>
<![CDATA[
@@ -104,7 +75,7 @@
]]>
</source>
<p>
-where the element robots is optional.
+the robots element is optional.
</p>
<p>
In case you don't have access to the server and want to disallow certain URLs from being crawled, then
@@ -128,11 +99,12 @@
<title>Creating an index from the command line</title>
<source>
<![CDATA[
-ant -f src/webapp/lenya/bin/crawl_and_index.xml -Dlucene.xconf=/home/username/src/lenya/src/webapp/lenya/pubs/default/config/search/lucene-live.xconf index
+ant -f build/lenya/webapp/lenya/bin/crawl_and_index.xml -Dlucene.xconf=build/lenya/webapp/lenya/pubs/default/config/search/lucene-live.xconf index
]]>
</source>
<p>
-whereas the lucene.xconf has the following elements
+Note that there is a search.properties file in build/lenya/webapp/lenya/bin that you may have to change.
+lucene-live.xconf has the following elements
</p>
<source>
<![CDATA[
@@ -146,18 +118,6 @@
<htdocs-dump-dir src="../../work/search/lucene/htdocs_dump"/>
<indexer class="org.apache.lenya.lucene.index.DefaultIndexer"/>
-<!--
- <indexer class="org.apache.lenya.lucene.index.ConfigurableIndexer">
- <configuration src="cmfs-luceneDoc.xconf"/>
- <extensions src="xml"/>
- </indexer>
--->
-<!--
- <indexer class="org.apache.lenya.lucene.index.ConfigurableIndexer">
- <configuration src="cmfs-luceneDoc.xconf"/>
- <filter class="foo.bar.FileFilter"/>
- </indexer>
--->
</lucene>
]]>
</source>
@@ -215,7 +175,7 @@
<title>Extract text from a PDF document</title>
<source>
<![CDATA[
-ant -f src/webapp/lenya/bin/crawl_and_index.xml -Dhtdocs.dump.dir=/home/username/src/lenya/src/webapp/lenya/pubs/default/work/search/lucene/htdocs_dump xpdf
+ant -f build/lenya/webapp/lenya/bin/crawl_and_index.xml -Dhtdocs.dump.dir=build/lenya/webapp/lenya/pubs/default/work/search/lucene/htdocs_dump xpdf
]]>
</source>
<p>
---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@lenya.apache.org
For additional commands, e-mail: commits-help@lenya.apache.org