You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Aniket Khare (JIRA)" <ji...@apache.org> on 2016/11/16 13:56:58 UTC

[jira] [Updated] (SOLR-9774) Delta indexing with child documents with help of cacheImpl="SortedMapBackedCache"

     [ https://issues.apache.org/jira/browse/SOLR-9774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Aniket Khare updated SOLR-9774:
-------------------------------
    Description: 
Hi,

I am using solr DIH for indexing the Parent-Child relation data and using cacheImpl="SortedMapBackedCache".
For Full data indexinf I am using command clean="true" and for delta I am using command full-import and clean="false".
So the same queries are being executed for fulland delta and indexing working properly.
The issue which we are facing is where for a perticuler parent document, there not a single child document and we are adding new child document.
Following are the steps to reproduce the issue.

1. Add Child document to an existing parent document which is not having empty child document.
2. Once the child document is added with delta indexing, try to modify the parent document and run delta indexing again
3. After the delta indexing is completed, I can see the modified child documents showing in Solr DIH page in debug mode. But the it is not getting updated in Solr collection.

I am using data config as below as below.

  <document name="User">
    <entity name="Parent" dataSource="DS1" pk="id" query="[SP_GetData] '${dih.last_index_time}','${dataimporter.request.clean}'">
      <field column="id" name="id"/>
      <field column="FirstName" name="FirstName"/>
      <field column="LastName" name="LastName"/>
      <entity name="R" dataSource="DS1" query="[SP_GetDataRTest] '${dih.last_index_time}','${dataimporter.request.clean}'"
      cacheKey="id" cacheLookup="Parent.id" processor="SqlEntityProcessor" cacheImpl="SortedMapBackedCache">
        <field column="RID" name="RID"/>
        <field column="RNAME" name="RID"/>
      </entity>
     <entity name="P" dataSource="DS2" query="[SP_GetDataTest] '${dih.last_index_time}','${dataimporter.request.clean}'"
      cacheKey="PID" cacheLookup="Parent.id" processor="SqlEntityProcessor" cacheImpl="SortedMapBackedCache" child="true">
        <field column="id" name="id"/>        
	<field column="PNAME" name="PNAME"/>
      </entity>
    </entity>
  </document>


  was:
Hi,

I am using solr DIH for indexing the Parent-Child relation data and using cacheImpl="SortedMapBackedCache".
For Full data indexinf I am using command clean="true" and for delta I am using command full-import and clean="false".
So the same queries are being executed for fulland delta and indexing working properly.
The issue which we are facing is where for a perticuler parent document, there not a single child document and we are adding new child document.
Following are the steps to reproduce the issue.

1. Add Child document to an existing parent document which is not having empty child document.
2. Once the child document is added with delta indexing, try to modify the parent document and run delta indexing again
3. After the delta indexing is completed, I can see the modified child documents showing in Solr DIH page in debug mode. But the it is not getting updated in Solr collection.

I am using data config as below as below.

  <document name="User">
    <entity name="Parent" dataSource="DS1" pk="id" query="[SP_GetData] '${dih.last_index_time}','${dataimporter.request.clean}'">
      <field column="id" name="id"/>
      <field column="FirstName" name="FirstName"/>
      <field column="LastName" name="LastName"/>
      <entity name="R" dataSource="DS1" query="[SP_GetDataRole] '${dih.last_index_time}','${dataimporter.request.clean}'"
      cacheKey="id" cacheLookup="Parent.id" processor="SqlEntityProcessor" cacheImpl="SortedMapBackedCache">
        <field column="RID" name="RID"/>
        <field column="RNAME" name="RID"/>
      </entity>
     <entity name="P" dataSource="DS2" query="[SP_GetDataTest] '${dih.last_index_time}','${dataimporter.request.clean}'"
      cacheKey="PID" cacheLookup="Parent.id" processor="SqlEntityProcessor" cacheImpl="SortedMapBackedCache" child="true">
        <field column="id" name="id"/>        
	<field column="PNAME" name="PNAME"/>
      </entity>
    </entity>
  </document>



> Delta indexing with child documents with help of cacheImpl="SortedMapBackedCache"
> ---------------------------------------------------------------------------------
>
>                 Key: SOLR-9774
>                 URL: https://issues.apache.org/jira/browse/SOLR-9774
>             Project: Solr
>          Issue Type: Bug
>      Security Level: Public(Default Security Level. Issues are Public) 
>          Components: contrib - DataImportHandler, Data-driven Schema
>    Affects Versions: 6.1
>            Reporter: Aniket Khare
>              Labels: DIH, solr
>
> Hi,
> I am using solr DIH for indexing the Parent-Child relation data and using cacheImpl="SortedMapBackedCache".
> For Full data indexinf I am using command clean="true" and for delta I am using command full-import and clean="false".
> So the same queries are being executed for fulland delta and indexing working properly.
> The issue which we are facing is where for a perticuler parent document, there not a single child document and we are adding new child document.
> Following are the steps to reproduce the issue.
> 1. Add Child document to an existing parent document which is not having empty child document.
> 2. Once the child document is added with delta indexing, try to modify the parent document and run delta indexing again
> 3. After the delta indexing is completed, I can see the modified child documents showing in Solr DIH page in debug mode. But the it is not getting updated in Solr collection.
> I am using data config as below as below.
>   <document name="User">
>     <entity name="Parent" dataSource="DS1" pk="id" query="[SP_GetData] '${dih.last_index_time}','${dataimporter.request.clean}'">
>       <field column="id" name="id"/>
>       <field column="FirstName" name="FirstName"/>
>       <field column="LastName" name="LastName"/>
>       <entity name="R" dataSource="DS1" query="[SP_GetDataRTest] '${dih.last_index_time}','${dataimporter.request.clean}'"
>       cacheKey="id" cacheLookup="Parent.id" processor="SqlEntityProcessor" cacheImpl="SortedMapBackedCache">
>         <field column="RID" name="RID"/>
>         <field column="RNAME" name="RID"/>
>       </entity>
>      <entity name="P" dataSource="DS2" query="[SP_GetDataTest] '${dih.last_index_time}','${dataimporter.request.clean}'"
>       cacheKey="PID" cacheLookup="Parent.id" processor="SqlEntityProcessor" cacheImpl="SortedMapBackedCache" child="true">
>         <field column="id" name="id"/>        
> 	<field column="PNAME" name="PNAME"/>
>       </entity>
>     </entity>
>   </document>



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org