You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@solr.apache.org by gu...@apache.org on 2022/10/29 01:39:35 UTC

[solr] branch branch_9x updated: SOLR-16495 - new text for introduction to solr page. (#1130) (#1148)

This is an automated email from the ASF dual-hosted git repository.

gus pushed a commit to branch branch_9x
in repository https://gitbox.apache.org/repos/asf/solr.git


The following commit(s) were added to refs/heads/branch_9x by this push:
     new 716b268872c SOLR-16495 - new text for introduction to solr page. (#1130) (#1148)
716b268872c is described below

commit 716b268872ccbc0755026935af38842549d11bd1
Author: Gus Heck <46...@users.noreply.github.com>
AuthorDate: Fri Oct 28 21:39:30 2022 -0400

    SOLR-16495 - new text for introduction to solr page. (#1130) (#1148)
---
 .../getting-started/pages/introduction.adoc        | 42 ++++++++++++----------
 1 file changed, 23 insertions(+), 19 deletions(-)

diff --git a/solr/solr-ref-guide/modules/getting-started/pages/introduction.adoc b/solr/solr-ref-guide/modules/getting-started/pages/introduction.adoc
index 98ab8b8e80e..0a1dbe58285 100644
--- a/solr/solr-ref-guide/modules/getting-started/pages/introduction.adoc
+++ b/solr/solr-ref-guide/modules/getting-started/pages/introduction.adoc
@@ -16,28 +16,32 @@
 // specific language governing permissions and limitations
 // under the License.
 
-Solr is a search server built on top of https://lucene.apache.org[Apache Lucene], an open source, Java-based, information retrieval library.
-It is designed to drive powerful document retrieval applications - wherever you need to serve data to users based on their queries, Solr can work for you.
+Apache^TM^ Solr is a search server built on top of https://lucene.apache.org[Apache Lucene^TM^], an open source, Java-based, information retrieval library.
+Solr is designed to drive powerful document retrieval or analytical applications involving unstructured data, semi-structured data or a mix of unstructured and structured data.
+It also has secondary support for limited relational, graph, statistical, data analysis or storage related use cases.
+Since Solr is Apache 2.0 licensed open source software designed for extensibility, it gives you the freedom to adapt or optimize it for almost any commercial or non-commercial use case.
 
-Solr is based on open standards and it is highly extensible.
-Solr queries are simple HTTP request URLs and the response is a structured document: mainly JSON, but it could also be XML, CSV, or other formats.
-This means that a wide variety of clients will be able to use Solr, from other web applications to browser clients, rich client applications, and mobile devices.
-Any platform capable of HTTP can talk to Solr.
-See xref:deployment-guide:client-apis.adoc[] for details on client APIs.
+Solr's xref:query-guide:query-syntax-and-parsers.adoc[query syntax and parsers] offer support for everything from the simplest keyword searching through to complex queries on multiple fields and xref:query-guide:faceting.adoc[faceted] search results.
+xref:query-guide:collapse-and-expand-results.adoc[Collapsing] and xref:query-guide:result-clustering.adoc[clustering] results offer compelling features for e-commerce and storefronts.
+xref:query-guide:streaming-expressions.adoc[Streaming expressions] allow you to conduct analytics on an entire corpus, a subset matching a query, or a random sample from a set of documents.
+Powerful xref:query-guide:math-expressions.adoc[math expressions] build on streaming expressions to provide the backbone for advanced analysis and predictive analytics use cases.
 
-Flexible schema configurations allow nearly any type of data to be stored in Solr.
-The xref:indexing-guide:schema-elements.adoc[] has more details on these options.
+Advanced relevancy tuning is also supported;
+Solr provides access to almost all of Lucene's text analysis features including tokenization, stemming, synonyms and much more, allowing you to tune relevancy based on knowledge of your users and your domain.
+Solr even allows for customization of relevancy via machine learning using the xref:query-guide:learning-to-rank.adoc[] feature.
 
-Solr offers support for the simplest keyword searching through to complex queries on multiple fields and faceted search results.
-Collapsing and clustering results offer compelling features for e-commerce and storefronts.
-Powerful math expressions provide the backbone for advanced analytics use cases.
-The xref:query-guide:query-syntax-and-parsers.adoc[] has more information about searching and queries.
+Queries are transmitted to Solr via HTTP 1.1 or 2.0 requests and the response is typically a list of structured document descriptors.
+In the classic example, 10 descriptors are returned, each including a URL to locate the document (often rendered as "10 blue links"). However, Solr can go far beyond document locators and many other types of document metadata might also be included.  Flexible schema configurations allow nearly any type of metadata to be associated with a document indexed in Solr.
+The xref:indexing-guide:schema-elements.adoc[schema elements] page of the indexing guide has more details on these options.
 
-If Solr's capabilities are not impressive enough, its ability to handle very high-volume applications should do the trick.
+JSON is the default response format, but it could also be XML, CSV, optimized binary, or (with customization) any format you desire.
+This means that a wide variety of clients will be able to use Solr. Such clients might be web applications, browsers, rich client applications, or mobile devices.
+Any platform capable of HTTP can talk to Solr.
+Several xref:deployment-guide:client-apis.adoc[] are provided for use in common programming languages.
 
-A relatively common scenario is that you have so much data, or so many queries, that a single Solr server is unable to handle your entire workload.
-In this case, you can scale up the capabilities of your application using xref:deployment-guide:cluster-types.adoc[] to better distribute the data, and the processing of requests, across many servers.
-Multiple options can be mixed and matched depending on the scalability you need.
+In addition to providing a network accessible engine for Lucene based document retrieval, Solr provides the ability to scale beyond the limitations of a single machine.
+Indexes can be sharded and replicated for performance and reliability, using either one of two xref:deployment-guide:cluster-types.adoc[].
+One type of cluster requires no supporting infrastructure, and instances are managed directly by administrators. The second type uses https://zookeeper.apache.org/[Apache Zookeeper^TM^] to coordinate management activities across the cluster.
 
-Best of all, this talk about high-volume applications is not just hypothetical: some of the famous Internet sites that use Solr today are Macy's, EBay, and Zappo's.
-For more examples, take a look at https://cwiki.apache.org/confluence/display/solr/PublicServers.
+Solr scaling and high availability features are so effective that some of the largest and most famous internet sites use Solr.
+A partial, typically self nominated, list of sites using Solr can be found at https://cwiki.apache.org/confluence/display/solr/PublicServers.