You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-commits@lucene.apache.org by Apache Wiki <wi...@apache.org> on 2012/06/15 10:41:39 UTC

[Solr Wiki] Update of "FrontPage" by LanceXNorskog

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Solr Wiki" for change notification.

The "FrontPage" page has been changed by LanceXNorskog:
http://wiki.apache.org/solr/FrontPage?action=diff&rev1=288&rev2=289

Comment:
Create 'Advanced Tools' section describing dope shit: Clustering, SolrUIMA and Business Rules were isolated pages. Moved language detection to it.

    * AnalysisRequestHandler - Analyzing documents without indexing
    * [[ExtractingRequestHandler|Solr Content Extraction Library (Solr Cell)]] - Covers how to index MS Word, PDF, etc. using Solr Cell (a.k.a. !ExtractingRequestHandler).  Also see the older version at UpdateRichDocuments
    * [[UpdateRequestProcessor|Update Processors]] - Update Processors define how an update request is processed.
-    * [[Deduplication]] - prevent or tag duplicate documents
+   * [[Deduplication]] - Prevent or tag duplicate documents
-   * LanguageDetection - Identify the language of documents and mark them.
   * Searching Solr
    * [[SolrRequestHandler|Request Handlers]] - Control the logic used to process requests.  Several different Request Handlers are included with Solr, or you can write your own custom implementation.
-   * [[QueryResponseWriter|Response Writers]] - Control the formating of the responses generated by Request Handlers.  Several different Response Writers are included with Solr, or you can write your own custom implementation.
+   * [[QueryResponseWriter|Response Writers]] - Control the formatting of the responses generated by Request Handlers.  Several different Response Writers are included with Solr, or you can write your own custom implementation.
    * Input Parameters
     * QueryParametersIndex - index of query parameters in following wiki pages
     * CoreQueryParameters
@@ -76, +75 @@

     * HighlightingParameters
     * MoreLikeThis
    * [[SearchComponent|Search Components]] - Search Components provide core functionality to a Request Handler.
-   * [[SolrQuerySyntax|Query Syntax]] - syntax for default query parsing, and how to specify a Query Parser.
+   * [[SolrQuerySyntax|Query Syntax]] - Syntax for default query parsing, and how to specify a Query Parser.
    * [[FunctionQuery|Function Queries]] - Using the values in fields in functions and as factors in scoring
    * [[SolrFacetingOverview|Faceted search]] - Category counts for search results
    * (Geo)[[SpatialSearch|Spatial Search]] - Find results near a point
    * [[FieldCollapsing|Field Collapsing]] / Result Grouping - documents with a common field value are grouped
    * [[Join]] - Do joins (similar to database) on documents
  
+ == Advanced Tools ==
+   * [[ClusteringComponent|Carrot2-based Document Clustering]] - Summarize/compare all documents returned by a query
+   * [[LanguageDetection|Language Detection]] - Deduce the language of a document
+   * [[SolrUIMA|UIMA Natural Language Processing]] - Sophisticated NLP suite, originally from IBM Research
+   * [[OpenNLP|OpenNLP Natural Language Processing]] - Simple NLP suite
+   * [[Business Rules|Business Rules]] - Alter stored documents and query results with flexible dynamic rules engine 
+ 
  == Tips, Tricks and Use Cases ==
   * FilterQueryGuidance - How do I partition my query in main query and filter query in order to have Solr's caching work efficiently?
   * SolrPlugins
   * SolrRelevancyCookbook
   * NearRealtimeSearch
-  * Auto-complete - use either [[SimpleFacetParameters#facet.prefix|Faceting with facet.prefix]] or [[Suggester]] or TermsComponent
+  * Auto-complete - Use either [[SimpleFacetParameters#facet.prefix|Faceting with facet.prefix]] or [[Suggester]] or TermsComponent
   * UniqueKey - Covers tips about unique keys in the schema
   * [[JapaneseLanguageSupport|Japanese Language Support]] - How to search Japanese text, best practices and various considerations
   * Using PreAnalyzedField type for integration with external document processing pipelines