You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Shravan Kumar Bolla <sh...@radiant.digital> on 2020/04/18 09:16:17 UTC

Indexing data from multiple data sources(CSV, RDBMS)

Hi,

I am working on indexing data from multiple data sources using a single collection. I specified data sources information in the data-config file and also updated managed schema.xml by adding the fields from all the data sources by specifying the common unique key across all the sources.

Here is a sample config file.

 <dataConfig>
>  <dataSource type="Jdbc-1" driver="com.mysql.jdbc.Driver" url="jdbc:mysql://localhost/aaa" user="***" password="***" batchSize="1" />
>  <dataSource type="Jdbc-2" driver="com.microsoft.sqlserver.jdbc.SQLServerDriver" url="jdbc:sqlserver://localhost;databasename=aaa" user="***" password="**"/>
>   <document name="abc">
>   <entity name="aaaa" dataSource="Jdbc-1" query="SELECT id,numberFROM abc">
>    <field column="id" name="id"/>
>    <field column="num" name="num"/>
>   <entity name="bbb" dataSource="Jdbc-2" query="SELECT id,num FROM abc">
>    <field column="id" name="id"/>
>    <field column="num" name="num"/>
>   </entity>
> </entity>
>  </document>
> </dataConfig>

Error Details:
Full Import failed:java.lang.RuntimeException:java.lang.RuntimeException:
org.apache.solr.handler.dataimport.DataImportHandlerException: Invalid type for data source: Jdbc-2
Processing Document #1

Thanks,
Shravan