You are viewing a plain text version of this content. The canonical link for it is here.

Posted to solr-commits@lucene.apache.org by yo...@apache.org on 2009/10/31 02:37:35 UTC

svn commit: r831504 - in /lucene/solr/trunk: site/features.html site/features.pdf src/site/src/documentation/content/xdocs/features.xml

Author: yonik
Date: Sat Oct 31 01:37:34 2009
New Revision: 831504

URL: http://svn.apache.org/viewvc?rev=831504&view=rev
Log:
doc: website features update

Modified:
    lucene/solr/trunk/site/features.html
    lucene/solr/trunk/site/features.pdf
    lucene/solr/trunk/src/site/src/documentation/content/xdocs/features.xml

Modified: lucene/solr/trunk/site/features.html
URL: http://svn.apache.org/viewvc/lucene/solr/trunk/site/features.html?rev=831504&r1=831503&r2=831504&view=diff
==============================================================================
--- lucene/solr/trunk/site/features.html (original)
+++ lucene/solr/trunk/site/features.html Sat Oct 31 01:37:34 2009
@@ -224,7 +224,7 @@
     
 <li> Optimized for High Volume Web Traffic </li>
     
-<li> Standards Based Open Interfaces - XML and HTTP </li>
+<li> Standards Based Open Interfaces - XML,JSON and HTTP </li>
     
 <li> Comprehensive HTML Administration Interfaces </li>
     
@@ -249,7 +249,7 @@
     
 <li> Powerful Extensions to the Lucene Query Language </li>
     
-<li> Support for Dynamic Faceted Browsing and Filtering </li>
+<li> Faceted Search and Filtering </li>
     
 <li> Advanced, Configurable Text Analysis </li>
     
@@ -263,24 +263,26 @@
     
 <li> Monitorable Logging </li>
     
-<li> Fast Incremental Updates and Snapshot Distribution </li>
+<li> Fast Incremental Updates and Index Replication </li>
     
-<li> Distributed search with sharded index on multiple hosts </li>
+<li> Highly Scalable Distributed search with sharded index across multiple hosts </li>
     
-<li> XML and CSV/delimited-text update formats </li>
+<li> XML, CSV/delimited-text, and binary update formats </li>
     
 <li> Easy ways to pull in data from databases and XML files from local disk and HTTP sources </li>
     
+<li> Rich Document Parsing and Indexing (PDF, Word, HTML, etc) using Apache Tika </li>
+    
 <li> Multiple search indices </li>
   
 </ul>
 </div>
 
 
-<a name="N10066"></a><a name="Detailed+Features"></a>
+<a name="N10069"></a><a name="Detailed+Features"></a>
 <h2 class="boxed">Detailed Features</h2>
 <div class="section">
-<a name="N1006C"></a><a name="Schema"></a>
+<a name="N1006F"></a><a name="Schema"></a>
 <h3 class="boxed">Schema</h3>
 <ul>
       
@@ -301,11 +303,11 @@
 <li>Many additional text analysis components including word splitting, regex and sounds-like filters</li>
     
 </ul>
-<a name="N1008D"></a><a name="Query"></a>
+<a name="N10090"></a><a name="Query"></a>
 <h3 class="boxed">Query</h3>
 <ul>
       
-<li>HTTP interface with configurable response formats (XML/XSLT, JSON, Python, Ruby)</li>
+<li>HTTP interface with configurable response formats (XML/XSLT, JSON, Python, Ruby, PHP, Velocity, binary)</li>
       
 <li>Sort by any number of fields</li>
       
@@ -313,49 +315,66 @@
       
 <li>Highlighted context snippets</li>
       
-<li>Faceted Searching based on unique field values and explicit queries</li>
+<li>Faceted Searching based on unique field values, explicit queries, or date ranges</li>
+      
+<li>Multi-Select Faceting by tagging and selectively excluding filters</li>
       
 <li>Spelling suggestions for user queries</li>
       
 <li>More Like This suggestions for given document</li>
       
-<li>Constant scoring range and prefix queries - no idf, coord, or lengthNorm factors, and no restriction on the number of terms the query matches.</li>
+<li>Function Query - influence the score by user specified complex functions of
+	     numeric fields or query relevancy scores.</li>
       
-<li>Function Query - influence the score by a function of a field's numeric value or ordinal</li>
+<li>Range filter over Function Query results</li>
       
 <li>Date Math - specify dates relative to "NOW" in queries and updates</li>
       
+<li>Dynamic search results clustering using Carrot2</li>
+      
+<li>Numeric field statistics such as min, max, average, standard deviation </li>
+      
+<li>Combine queries derived from different syntaxes</li>
+      
+<li>Auto-suggest functionality</li>
+      
+<li>Allow configuration of top results for a query, overriding normal scoring and sorting</li>
+      
 <li>Performance Optimizations</li>
     
 </ul>
-<a name="N100B7"></a><a name="Core"></a>
+<a name="N100CC"></a><a name="Core"></a>
 <h3 class="boxed">Core</h3>
 <ul>
       
+<li>Dynamically create and delete document collections without restarting</li>
+      
 <li>Pluggable query handlers and extensible XML data format</li>
       
-<li>Document uniqueness enforcement based on unique key field</li>
+<li>Pluggable user functions for Function Query</li>
       
-<li>Batches updates and deletes for high performance</li>
+<li>Customizable component based request handler with distributed search support</li>
       
-<li>User configurable commands triggered on index changes</li>
+<li>Document uniqueness enforcement based on unique key field</li>
       
-<li>Searcher concurrency control</li>
+<li>Duplicate document detection, including fuzzy near duplicates</li>
       
-<li>Correct handling of numeric types for both sorting and range queries</li>
+<li>Custom index processing chains, allowing document manipulation before indexing</li>
+      
+<li>User configurable commands triggered on index changes</li>
       
 <li>Ability to control where docs with the sort field missing will be placed</li>
       
 <li>"Luke" request handler for corpus information</li>
     
 </ul>
-<a name="N100D8"></a><a name="Caching"></a>
+<a name="N100F3"></a><a name="Caching"></a>
 <h3 class="boxed">Caching</h3>
 <ul>
       
 <li>Configurable Query Result, Filter, and Document cache instances</li>
       
-<li>Pluggable Cache implementations</li>
+<li>Pluggable Cache implementations, including a lock free, high concurrency implementation</li>
       
 <li>Cache warming in background
         <ul>
@@ -371,7 +390,7 @@
         <ul>
           
 <li>The most recently accessed items in the caches of the current
-            searcher are re-populated in the new searcher, enabing high cache hit
+            searcher are re-populated in the new searcher, enabling high cache hit
             rates across index/searcher changes.</li>
         
 </ul>
@@ -383,23 +402,31 @@
 <li>User level caching with autowarming support</li>
     
 </ul>
-<a name="N100FD"></a><a name="Replication"></a>
+<a name="N10118"></a><a name="Replication"></a>
 <h3 class="boxed">Replication</h3>
 <ul>
       
-<li>Efficient distribution of index parts that have changed via rsync transport</li>
+<li>Efficient distribution of index parts that have changed</li>
       
 <li>Pull strategy allows for easy addition of searchers</li>
       
 <li>Configurable distribution interval allows tradeoff between timeliness and cache utilization</li>
+      
+<li>Replication and automatic reloading of configuration files</li>
     
 </ul>
-<a name="N1010F"></a><a name="Admin+Interface"></a>
+<a name="N1012D"></a><a name="Admin+Interface"></a>
 <h3 class="boxed">Admin Interface</h3>
 <ul>
       
 <li>Comprehensive statistics on cache utilization, updates, and queries</li>
       
+<li>Interactive schema browser that includes index statistics</li>
+      
+<li>Replication monitoring</li>
+      
+<li>Full logging control</li>
+      
 <li>Text analysis debugger, showing result of every stage in an analyzer</li>
       
 <li>Web Query Interface w/ debugging output

Modified: lucene/solr/trunk/site/features.pdf
URL: http://svn.apache.org/viewvc/lucene/solr/trunk/site/features.pdf?rev=831504&r1=831503&r2=831504&view=diff
==============================================================================
Binary files - no diff available.

Modified: lucene/solr/trunk/src/site/src/documentation/content/xdocs/features.xml
URL: http://svn.apache.org/viewvc/lucene/solr/trunk/src/site/src/documentation/content/xdocs/features.xml?rev=831504&r1=831503&r2=831504&view=diff
==============================================================================
--- lucene/solr/trunk/src/site/src/documentation/content/xdocs/features.xml (original)
+++ lucene/solr/trunk/src/site/src/documentation/content/xdocs/features.xml Sat Oct 31 01:37:34 2009
@@ -33,7 +33,7 @@
   <ul>
     <li> Advanced Full-Text Search Capabilities </li>
     <li> Optimized for High Volume Web Traffic </li>
-    <li> Standards Based Open Interfaces - XML and HTTP </li>
+    <li> Standards Based Open Interfaces - XML,JSON and HTTP </li>
     <li> Comprehensive HTML Administration Interfaces </li>
     <li> Server statistics exposed over JMX for monitoring </li>
     <li> Scalability - Efficient Replication to other Solr Search Servers </li>
@@ -47,17 +47,18 @@
   <ul>
     <li> A Real Data Schema, with Numeric Types, Dynamic Fields, Unique Keys </li>
     <li> Powerful Extensions to the Lucene Query Language </li>
-    <li> Support for Dynamic Faceted Browsing and Filtering </li>
+    <li> Faceted Search and Filtering </li>
     <li> Advanced, Configurable Text Analysis </li>
     <li> Highly Configurable and User Extensible Caching </li>
     <li> Performance Optimizations </li>
     <li> External Configuration via XML </li>
     <li> An Administration Interface </li>
     <li> Monitorable Logging </li>
-    <li> Fast Incremental Updates and Snapshot Distribution </li>
-    <li> Distributed search with sharded index on multiple hosts </li>
-    <li> XML and CSV/delimited-text update formats </li>
+    <li> Fast Incremental Updates and Index Replication </li>
+    <li> Highly Scalable Distributed search with sharded index across multiple hosts </li>
+    <li> XML, CSV/delimited-text, and binary update formats </li>
     <li> Easy ways to pull in data from databases and XML files from local disk and HTTP sources </li>
+    <li> Rich Document Parsing and Indexing (PDF, Word, HTML, etc) using Apache Tika </li>
     <li> Multiple search indices </li>
   </ul>
 </section>
@@ -80,28 +81,37 @@
 
   <section><title>Query</title>
     <ul>
-      <li>HTTP interface with configurable response formats (XML/XSLT, JSON, Python, Ruby)</li>
+      <li>HTTP interface with configurable response formats (XML/XSLT, JSON, Python, Ruby, PHP, Velocity, binary)</li>
       <li>Sort by any number of fields</li>
       <li>Advanced DisMax query parser for high relevancy results from user-entered queries</li> 
       <li>Highlighted context snippets</li>
-      <li>Faceted Searching based on unique field values and explicit queries</li>
+      <li>Faceted Searching based on unique field values, explicit queries, or date ranges</li>
+      <li>Multi-Select Faceting by tagging and selectively excluding filters</li>
       <li>Spelling suggestions for user queries</li>
       <li>More Like This suggestions for given document</li>
-      <li>Constant scoring range and prefix queries - no idf, coord, or lengthNorm factors, and no restriction on the number of terms the query matches.</li>
-      <li>Function Query - influence the score by a function of a field's numeric value or ordinal</li>
+      <li>Function Query - influence the score by user specified complex functions of
+	     numeric fields or query relevancy scores.</li>
+      <li>Range filter over Function Query results</li>
       <li>Date Math - specify dates relative to "NOW" in queries and updates</li>
+      <li>Dynamic search results clustering using Carrot2</li>
+      <li>Numeric field statistics such as min, max, average, standard deviation </li>
+      <li>Combine queries derived from different syntaxes</li>
+      <li>Auto-suggest functionality</li>
+      <li>Allow configuration of top results for a query, overriding normal scoring and sorting</li>
       <li>Performance Optimizations</li>
     </ul>
   </section>
 
   <section><title>Core</title>
     <ul>
+      <li>Dynamically create and delete document collections without restarting</li>
       <li>Pluggable query handlers and extensible XML data format</li>
+      <li>Pluggable user functions for Function Query</li>
+      <li>Customizable component based request handler with distributed search support</li>
       <li>Document uniqueness enforcement based on unique key field</li>
-      <li>Batches updates and deletes for high performance</li>
+      <li>Duplicate document detection, including fuzzy near duplicates</li>
+      <li>Custom index processing chains, allowing document manipulation before indexing</li>
       <li>User configurable commands triggered on index changes</li>
-      <li>Searcher concurrency control</li>
-      <li>Correct handling of numeric types for both sorting and range queries</li>
       <li>Ability to control where docs with the sort field missing will be placed</li>
       <li>"Luke" request handler for corpus information</li>
     </ul>
@@ -110,7 +120,7 @@
   <section><title>Caching</title>
     <ul>
       <li>Configurable Query Result, Filter, and Document cache instances</li>
-      <li>Pluggable Cache implementations</li>
+      <li>Pluggable Cache implementations, including a lock free, high concurrency implementation</li>
       <li>Cache warming in background
         <ul><li> When a new searcher is opened, configurable searches are run against
             it in order to warm it up to avoid
@@ -120,7 +130,7 @@
       <li>Autowarming in background
         <ul>
           <li>The most recently accessed items in the caches of the current
-            searcher are re-populated in the new searcher, enabing high cache hit
+            searcher are re-populated in the new searcher, enabling high cache hit
             rates across index/searcher changes.</li>
         </ul>
       </li>
@@ -131,15 +141,19 @@
 
   <section><title>Replication</title>
     <ul>
-      <li>Efficient distribution of index parts that have changed via rsync transport</li>
+      <li>Efficient distribution of index parts that have changed</li>
       <li>Pull strategy allows for easy addition of searchers</li>
       <li>Configurable distribution interval allows tradeoff between timeliness and cache utilization</li>
+      <li>Replication and automatic reloading of configuration files</li>
     </ul>
   </section>
 
   <section><title>Admin Interface</title>
     <ul>
       <li>Comprehensive statistics on cache utilization, updates, and queries</li>
+      <li>Interactive schema browser that includes index statistics</li>
+      <li>Replication monitoring</li>
+      <li>Full logging control</li>
       <li>Text analysis debugger, showing result of every stage in an analyzer</li>
       <li>Web Query Interface w/ debugging output
         <ul>

Re: svn commit: r831504 - in /lucene/solr/trunk: site/features.html site/features.pdf src/site/src/documentation/content/xdocs/features.xml

Posted by Yonik Seeley <yo...@lucidimagination.com>.

I took a shot at updating the feature list.
The framework way back from Solr 1.0, and it's getting pretty crusty.
We could really use a new feature matrix or something like that.
Anyway, any suggested additions/changes?  What about DIH? (I don't pay
close attention to that area).

-Yonik
http://www.lucidimagination.com


On Fri, Oct 30, 2009 at 9:37 PM,  <yo...@apache.org> wrote:
> Author: yonik
> Date: Sat Oct 31 01:37:34 2009
> New Revision: 831504
>
> URL: http://svn.apache.org/viewvc?rev=831504&view=rev
> Log:
> doc: website features update
>
> Modified:
>    lucene/solr/trunk/site/features.html
>    lucene/solr/trunk/site/features.pdf
>    lucene/solr/trunk/src/site/src/documentation/content/xdocs/features.xml
>
> Modified: lucene/solr/trunk/site/features.html
> URL: http://svn.apache.org/viewvc/lucene/solr/trunk/site/features.html?rev=831504&r1=831503&r2=831504&view=diff
> ==============================================================================
> --- lucene/solr/trunk/site/features.html (original)
> +++ lucene/solr/trunk/site/features.html Sat Oct 31 01:37:34 2009
> @@ -224,7 +224,7 @@
>
>  <li> Optimized for High Volume Web Traffic </li>
>
> -<li> Standards Based Open Interfaces - XML and HTTP </li>
> +<li> Standards Based Open Interfaces - XML,JSON and HTTP </li>
>
>  <li> Comprehensive HTML Administration Interfaces </li>
>
> @@ -249,7 +249,7 @@
>
>  <li> Powerful Extensions to the Lucene Query Language </li>
>
> -<li> Support for Dynamic Faceted Browsing and Filtering </li>
> +<li> Faceted Search and Filtering </li>
>
>  <li> Advanced, Configurable Text Analysis </li>
>
> @@ -263,24 +263,26 @@
>
>  <li> Monitorable Logging </li>
>
> -<li> Fast Incremental Updates and Snapshot Distribution </li>
> +<li> Fast Incremental Updates and Index Replication </li>
>
> -<li> Distributed search with sharded index on multiple hosts </li>
> +<li> Highly Scalable Distributed search with sharded index across multiple hosts </li>
>
> -<li> XML and CSV/delimited-text update formats </li>
> +<li> XML, CSV/delimited-text, and binary update formats </li>
>
>  <li> Easy ways to pull in data from databases and XML files from local disk and HTTP sources </li>
>
> +<li> Rich Document Parsing and Indexing (PDF, Word, HTML, etc) using Apache Tika </li>
> +
>  <li> Multiple search indices </li>
>
>  </ul>
>  </div>
>
>
> -<a name="N10066"></a><a name="Detailed+Features"></a>
> +<a name="N10069"></a><a name="Detailed+Features"></a>
>  <h2 class="boxed">Detailed Features</h2>
>  <div class="section">
> -<a name="N1006C"></a><a name="Schema"></a>
> +<a name="N1006F"></a><a name="Schema"></a>
>  <h3 class="boxed">Schema</h3>
>  <ul>
>
> @@ -301,11 +303,11 @@
>  <li>Many additional text analysis components including word splitting, regex and sounds-like filters</li>
>
>  </ul>
> -<a name="N1008D"></a><a name="Query"></a>
> +<a name="N10090"></a><a name="Query"></a>
>  <h3 class="boxed">Query</h3>
>  <ul>
>
> -<li>HTTP interface with configurable response formats (XML/XSLT, JSON, Python, Ruby)</li>
> +<li>HTTP interface with configurable response formats (XML/XSLT, JSON, Python, Ruby, PHP, Velocity, binary)</li>
>
>  <li>Sort by any number of fields</li>
>
> @@ -313,49 +315,66 @@
>
>  <li>Highlighted context snippets</li>
>
> -<li>Faceted Searching based on unique field values and explicit queries</li>
> +<li>Faceted Searching based on unique field values, explicit queries, or date ranges</li>
> +
> +<li>Multi-Select Faceting by tagging and selectively excluding filters</li>
>
>  <li>Spelling suggestions for user queries</li>
>
>  <li>More Like This suggestions for given document</li>
>
> -<li>Constant scoring range and prefix queries - no idf, coord, or lengthNorm factors, and no restriction on the number of terms the query matches.</li>
> +<li>Function Query - influence the score by user specified complex functions of
> +            numeric fields or query relevancy scores.</li>
>
> -<li>Function Query - influence the score by a function of a field's numeric value or ordinal</li>
> +<li>Range filter over Function Query results</li>
>
>  <li>Date Math - specify dates relative to "NOW" in queries and updates</li>
>
> +<li>Dynamic search results clustering using Carrot2</li>
> +
> +<li>Numeric field statistics such as min, max, average, standard deviation </li>
> +
> +<li>Combine queries derived from different syntaxes</li>
> +
> +<li>Auto-suggest functionality</li>
> +
> +<li>Allow configuration of top results for a query, overriding normal scoring and sorting</li>
> +
>  <li>Performance Optimizations</li>
>
>  </ul>
> -<a name="N100B7"></a><a name="Core"></a>
> +<a name="N100CC"></a><a name="Core"></a>
>  <h3 class="boxed">Core</h3>
>  <ul>
>
> +<li>Dynamically create and delete document collections without restarting</li>
> +
>  <li>Pluggable query handlers and extensible XML data format</li>
>
> -<li>Document uniqueness enforcement based on unique key field</li>
> +<li>Pluggable user functions for Function Query</li>
>
> -<li>Batches updates and deletes for high performance</li>
> +<li>Customizable component based request handler with distributed search support</li>
>
> -<li>User configurable commands triggered on index changes</li>
> +<li>Document uniqueness enforcement based on unique key field</li>
>
> -<li>Searcher concurrency control</li>
> +<li>Duplicate document detection, including fuzzy near duplicates</li>
>
> -<li>Correct handling of numeric types for both sorting and range queries</li>
> +<li>Custom index processing chains, allowing document manipulation before indexing</li>
> +
> +<li>User configurable commands triggered on index changes</li>
>
>  <li>Ability to control where docs with the sort field missing will be placed</li>
>
>  <li>"Luke" request handler for corpus information</li>
>
>  </ul>
> -<a name="N100D8"></a><a name="Caching"></a>
> +<a name="N100F3"></a><a name="Caching"></a>
>  <h3 class="boxed">Caching</h3>
>  <ul>
>
>  <li>Configurable Query Result, Filter, and Document cache instances</li>
>
> -<li>Pluggable Cache implementations</li>
> +<li>Pluggable Cache implementations, including a lock free, high concurrency implementation</li>
>
>  <li>Cache warming in background
>         <ul>
> @@ -371,7 +390,7 @@
>         <ul>
>
>  <li>The most recently accessed items in the caches of the current
> -            searcher are re-populated in the new searcher, enabing high cache hit
> +            searcher are re-populated in the new searcher, enabling high cache hit
>             rates across index/searcher changes.</li>
>
>  </ul>
> @@ -383,23 +402,31 @@
>  <li>User level caching with autowarming support</li>
>
>  </ul>
> -<a name="N100FD"></a><a name="Replication"></a>
> +<a name="N10118"></a><a name="Replication"></a>
>  <h3 class="boxed">Replication</h3>
>  <ul>
>
> -<li>Efficient distribution of index parts that have changed via rsync transport</li>
> +<li>Efficient distribution of index parts that have changed</li>
>
>  <li>Pull strategy allows for easy addition of searchers</li>
>
>  <li>Configurable distribution interval allows tradeoff between timeliness and cache utilization</li>
> +
> +<li>Replication and automatic reloading of configuration files</li>
>
>  </ul>
> -<a name="N1010F"></a><a name="Admin+Interface"></a>
> +<a name="N1012D"></a><a name="Admin+Interface"></a>
>  <h3 class="boxed">Admin Interface</h3>
>  <ul>
>
>  <li>Comprehensive statistics on cache utilization, updates, and queries</li>
>
> +<li>Interactive schema browser that includes index statistics</li>
> +
> +<li>Replication monitoring</li>
> +
> +<li>Full logging control</li>
> +
>  <li>Text analysis debugger, showing result of every stage in an analyzer</li>
>
>  <li>Web Query Interface w/ debugging output
>
> Modified: lucene/solr/trunk/site/features.pdf
> URL: http://svn.apache.org/viewvc/lucene/solr/trunk/site/features.pdf?rev=831504&r1=831503&r2=831504&view=diff
> ==============================================================================
> Binary files - no diff available.
>
> Modified: lucene/solr/trunk/src/site/src/documentation/content/xdocs/features.xml
> URL: http://svn.apache.org/viewvc/lucene/solr/trunk/src/site/src/documentation/content/xdocs/features.xml?rev=831504&r1=831503&r2=831504&view=diff
> ==============================================================================
> --- lucene/solr/trunk/src/site/src/documentation/content/xdocs/features.xml (original)
> +++ lucene/solr/trunk/src/site/src/documentation/content/xdocs/features.xml Sat Oct 31 01:37:34 2009
> @@ -33,7 +33,7 @@
>   <ul>
>     <li> Advanced Full-Text Search Capabilities </li>
>     <li> Optimized for High Volume Web Traffic </li>
> -    <li> Standards Based Open Interfaces - XML and HTTP </li>
> +    <li> Standards Based Open Interfaces - XML,JSON and HTTP </li>
>     <li> Comprehensive HTML Administration Interfaces </li>
>     <li> Server statistics exposed over JMX for monitoring </li>
>     <li> Scalability - Efficient Replication to other Solr Search Servers </li>
> @@ -47,17 +47,18 @@
>   <ul>
>     <li> A Real Data Schema, with Numeric Types, Dynamic Fields, Unique Keys </li>
>     <li> Powerful Extensions to the Lucene Query Language </li>
> -    <li> Support for Dynamic Faceted Browsing and Filtering </li>
> +    <li> Faceted Search and Filtering </li>
>     <li> Advanced, Configurable Text Analysis </li>
>     <li> Highly Configurable and User Extensible Caching </li>
>     <li> Performance Optimizations </li>
>     <li> External Configuration via XML </li>
>     <li> An Administration Interface </li>
>     <li> Monitorable Logging </li>
> -    <li> Fast Incremental Updates and Snapshot Distribution </li>
> -    <li> Distributed search with sharded index on multiple hosts </li>
> -    <li> XML and CSV/delimited-text update formats </li>
> +    <li> Fast Incremental Updates and Index Replication </li>
> +    <li> Highly Scalable Distributed search with sharded index across multiple hosts </li>
> +    <li> XML, CSV/delimited-text, and binary update formats </li>
>     <li> Easy ways to pull in data from databases and XML files from local disk and HTTP sources </li>
> +    <li> Rich Document Parsing and Indexing (PDF, Word, HTML, etc) using Apache Tika </li>
>     <li> Multiple search indices </li>
>   </ul>
>  </section>
> @@ -80,28 +81,37 @@
>
>   <section><title>Query</title>
>     <ul>
> -      <li>HTTP interface with configurable response formats (XML/XSLT, JSON, Python, Ruby)</li>
> +      <li>HTTP interface with configurable response formats (XML/XSLT, JSON, Python, Ruby, PHP, Velocity, binary)</li>
>       <li>Sort by any number of fields</li>
>       <li>Advanced DisMax query parser for high relevancy results from user-entered queries</li>
>       <li>Highlighted context snippets</li>
> -      <li>Faceted Searching based on unique field values and explicit queries</li>
> +      <li>Faceted Searching based on unique field values, explicit queries, or date ranges</li>
> +      <li>Multi-Select Faceting by tagging and selectively excluding filters</li>
>       <li>Spelling suggestions for user queries</li>
>       <li>More Like This suggestions for given document</li>
> -      <li>Constant scoring range and prefix queries - no idf, coord, or lengthNorm factors, and no restriction on the number of terms the query matches.</li>
> -      <li>Function Query - influence the score by a function of a field's numeric value or ordinal</li>
> +      <li>Function Query - influence the score by user specified complex functions of
> +            numeric fields or query relevancy scores.</li>
> +      <li>Range filter over Function Query results</li>
>       <li>Date Math - specify dates relative to "NOW" in queries and updates</li>
> +      <li>Dynamic search results clustering using Carrot2</li>
> +      <li>Numeric field statistics such as min, max, average, standard deviation </li>
> +      <li>Combine queries derived from different syntaxes</li>
> +      <li>Auto-suggest functionality</li>
> +      <li>Allow configuration of top results for a query, overriding normal scoring and sorting</li>
>       <li>Performance Optimizations</li>
>     </ul>
>   </section>
>
>   <section><title>Core</title>
>     <ul>
> +      <li>Dynamically create and delete document collections without restarting</li>
>       <li>Pluggable query handlers and extensible XML data format</li>
> +      <li>Pluggable user functions for Function Query</li>
> +      <li>Customizable component based request handler with distributed search support</li>
>       <li>Document uniqueness enforcement based on unique key field</li>
> -      <li>Batches updates and deletes for high performance</li>
> +      <li>Duplicate document detection, including fuzzy near duplicates</li>
> +      <li>Custom index processing chains, allowing document manipulation before indexing</li>
>       <li>User configurable commands triggered on index changes</li>
> -      <li>Searcher concurrency control</li>
> -      <li>Correct handling of numeric types for both sorting and range queries</li>
>       <li>Ability to control where docs with the sort field missing will be placed</li>
>       <li>"Luke" request handler for corpus information</li>
>     </ul>
> @@ -110,7 +120,7 @@
>   <section><title>Caching</title>
>     <ul>
>       <li>Configurable Query Result, Filter, and Document cache instances</li>
> -      <li>Pluggable Cache implementations</li>
> +      <li>Pluggable Cache implementations, including a lock free, high concurrency implementation</li>
>       <li>Cache warming in background
>         <ul><li> When a new searcher is opened, configurable searches are run against
>             it in order to warm it up to avoid
> @@ -120,7 +130,7 @@
>       <li>Autowarming in background
>         <ul>
>           <li>The most recently accessed items in the caches of the current
> -            searcher are re-populated in the new searcher, enabing high cache hit
> +            searcher are re-populated in the new searcher, enabling high cache hit
>             rates across index/searcher changes.</li>
>         </ul>
>       </li>
> @@ -131,15 +141,19 @@
>
>   <section><title>Replication</title>
>     <ul>
> -      <li>Efficient distribution of index parts that have changed via rsync transport</li>
> +      <li>Efficient distribution of index parts that have changed</li>
>       <li>Pull strategy allows for easy addition of searchers</li>
>       <li>Configurable distribution interval allows tradeoff between timeliness and cache utilization</li>
> +      <li>Replication and automatic reloading of configuration files</li>
>     </ul>
>   </section>
>
>   <section><title>Admin Interface</title>
>     <ul>
>       <li>Comprehensive statistics on cache utilization, updates, and queries</li>
> +      <li>Interactive schema browser that includes index statistics</li>
> +      <li>Replication monitoring</li>
> +      <li>Full logging control</li>
>       <li>Text analysis debugger, showing result of every stage in an analyzer</li>
>       <li>Web Query Interface w/ debugging output
>         <ul>
>
>
>