You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-commits@lucene.apache.org by Apache Wiki <wi...@apache.org> on 2011/08/22 20:00:11 UTC
[Solr Wiki] Update of "SolrEcosystem" by DavidSmiley
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Solr Wiki" for change notification.
The "SolrEcosystem" page has been changed by DavidSmiley:
http://wiki.apache.org/solr/SolrEcosystem?action=diff&rev1=7&rev2=8
Comment:
Add Talend Open Studio, and comment on ETLs.
* [[https://github.com/kolstae/openpipe|OpenPipe]] ([[http://openpipe.berlios.de/|alt]]) [[http://openpipe.berlios.de/|Solr Info]]
* [[http://www.openpipeline.org|OpenPipeline]]
* ETL (Extract Transform Load) -- many are applicable; these are a couple notable ones:
+ * [[http://www.talend.com/products-data-integration/talend-open-studio.php|Talend Open Studio (TOS)]]
+ * [[http://kettle.pentaho.com/|Kettle (Pentaho)]]
* [[http://sourceforge.net/projects/cloveretl/|CloverETL]]
- * [[http://kettle.pentaho.com/|Kettle (Pentaho)]]
+
+ A common problem amongst the ETLs is that each step in the pipeline accepts and emits records in a fixed flat schema, they don't support dynamic name-value pairs. And these are not document oriented; if you want to pass a DOM of some kind then you serialize it into a field. However, the ETLs are all far more mature than nascent document or XML oriented pipelines.
+
* ESBs (Enterprise Service Buses) -- not listed; various
* One of the [[http://xproc.org/implementations/|XProc implementations]] (an XML pipeline spec) such as [[http://xmlcalabash.com/|Calabash]]
@@ -45, +49 @@
Hadoop:
* [[http://www.cascading.org/|Cascading]] - [[https://github.com/bixolabs/cascading.solr|Solr "Tap"]]
* [[http://katta.sourceforge.net/|Katta]] - [[KattaIntegration]]
-
= Misc =
* [[https://github.com/tjake/Solandra|Solandra]] - A tight integration of Solr and Cassandra. The result is Solr with the awesome scalability properties of Cassandra.