You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@lucene.apache.org by Apache Wiki <wi...@apache.org> on 2015/07/23 21:25:13 UTC
[Solr Wiki] Trivial Update of "SolrEcosystem" by PascalEssiembre
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Solr Wiki" for change notification.
The "SolrEcosystem" page has been changed by PascalEssiembre:
https://wiki.apache.org/solr/SolrEcosystem?action=diff&rev1=20&rev2=21
Comment:
Removed recently added "Apache" prefix to Nutch in Crawler and Connectors since other Apache products listed did not have it (consistency).
== Crawlers And Connectors ==
Web, email, and file crawlers (alphabetically).
- * [[http://lucene.apache.org/nutch/|Apache Nutch]] (web) [[http://wiki.apache.org/nutch/NutchTutorial|Solr Info (included as part of the Nutch Tutorial)]]
* [[http://aperture.sourceforge.net/|Aperture]] (web, email, file)
* [[http://www.crawl-anywhere.com/|Crawl-Anywhere]] (web) [[http://www.crawl-anywhere.com/solr-indexer/|Solr Info]]
* DataImportHandler (email, file)
@@ -25, +24 @@
* [[http://en.wikipedia.org/wiki/Heritrix|Heritrix]] (web)
* [[http://incubator.apache.org/connectors/|ManifoldCF]] (web, file) [[http://incubator.apache.org/connectors/end-user-documentation.html#solroutputconnector|Solr Info]]
* [[http://www.norconex.com/collectors/|Norconex Collectors]] (web, file) [[http://www.norconex.com/collectors/committer-solr/|Solr Info]]
+ * [[http://lucene.apache.org/nutch/|Nutch]] (web) [[http://wiki.apache.org/nutch/NutchTutorial|Solr Info (included as part of the Nutch Tutorial)]]
== Pipelines / Document Processing ==
Frameworks for flexible document processing. See DocumentProcessing for more background and criteria for a proposal. Some crawlers/connectors have their own pipeline capability and they are not repeated here.