You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-dev@lucene.apache.org by "Noble Paul (JIRA)" <ji...@apache.org> on 2009/08/31 09:03:33 UTC

[jira] Created: (SOLR-1396) standardize the updateprocessorchain syntax

standardize the updateprocessorchain syntax
-------------------------------------------

                 Key: SOLR-1396
                 URL: https://issues.apache.org/jira/browse/SOLR-1396
             Project: Solr
          Issue Type: Improvement
            Reporter: Noble Paul
             Fix For: 1.4


updateprocessorChain follows a non-standard syntax in solr . Usually, all the components are initialized as top level components and they are assembled and used using a NamedList syntax .for example search components.

I propose to change it as follows

{code:xml}
 <updateRequestProcessorChain name="custom" class="solr.UpdateRequestProcessorChain"> 
  <arr name="chain">
      <str>custom</str>
      <str>runUpdate</str>
      <str>log</str>
  </arr>
</updateRequestProcessorChain>
<updateProcessor name="custom" class="solr.CustomUpdateRequestProcessorFactory" >
   <lst name="name">
     <str name="n1">x1</str>
     <str name="n2">x2</str>
   </lst>
</updateProcessor>                                    
<updateProcessor name="runUpdate" class="solr.RunUpdateProcessorFactory" />                    
<updateProcessor name="log" class="solr.LogUpdateProcessorFactory" />   
{code}

The wiki documentation says this was supposed to be reviewed. If possible we should clean it up in 1.4 itself. We can support the old syntax too





-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (SOLR-1396) standardize the updateprocessorchain syntax

Posted by "Noble Paul (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/SOLR-1396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12752921#action_12752921 ] 

Noble Paul commented on SOLR-1396:
----------------------------------

it is not enough to have simple configuration. we need standardized parsing also. I have modified SolrConfig in such a way that we must not need to write any xml parsing for new components as long as we stick to the standard syntax (SOLR-1198).  Going forward SolrConfig implementation should be pluggable and be xml independent, we should have a simplified interface for SolrConfig so that users can read it from anywhere (xml, zookeeper,db etct etc)

> standardize the updateprocessorchain syntax
> -------------------------------------------
>
>                 Key: SOLR-1396
>                 URL: https://issues.apache.org/jira/browse/SOLR-1396
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Noble Paul
>             Fix For: 1.4
>
>
> updateprocessorChain follows a non-standard syntax in solr . Usually, all the components are initialized as top level components and they are assembled and used using a NamedList syntax .for example search components.
> I propose to change it as follows
> {code:xml}
>  <updateRequestProcessorChain name="custom" class="solr.UpdateRequestProcessorChain"> 
>   <arr name="chain">
>       <str>custom</str>
>       <str>runUpdate</str>
>       <str>log</str>
>   </arr>
> </updateRequestProcessorChain>
> <updateProcessor name="custom" class="solr.CustomUpdateRequestProcessorFactory" >
>    <lst name="name">
>      <str name="n1">x1</str>
>      <str name="n2">x2</str>
>    </lst>
> </updateProcessor>                                    
> <updateProcessor name="runUpdate" class="solr.RunUpdateProcessorFactory" />                    
> <updateProcessor name="log" class="solr.LogUpdateProcessorFactory" />   
> {code}
> The wiki documentation says this was supposed to be reviewed. If possible we should clean it up in 1.4 itself. We can support the old syntax too

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (SOLR-1396) standardize the updateprocessorchain syntax

Posted by "Hoss Man (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/SOLR-1396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12751250#action_12751250 ] 

Hoss Man commented on SOLR-1396:
--------------------------------


* a ref by name syntax like this would make it possible for two separate chains to both contain the same processor, or for a single processor to occur multiple times in the same chain ... i have no idea if this would break things, but it's something that should be considered
* i don't see a reason why updateRequestProcessorChain needs a class attribute ... it's just a simple container around the processors, is there a use case for making it pluggable?
** unless we make the processor chain class pluggable, there's no reason for an explict array containing hte names of the processors, the top level list under the processor chain can be used (as it is now - just with names instead of the processors themselves)

> standardize the updateprocessorchain syntax
> -------------------------------------------
>
>                 Key: SOLR-1396
>                 URL: https://issues.apache.org/jira/browse/SOLR-1396
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Noble Paul
>             Fix For: 1.4
>
>
> updateprocessorChain follows a non-standard syntax in solr . Usually, all the components are initialized as top level components and they are assembled and used using a NamedList syntax .for example search components.
> I propose to change it as follows
> {code:xml}
>  <updateRequestProcessorChain name="custom" class="solr.UpdateRequestProcessorChain"> 
>   <arr name="chain">
>       <str>custom</str>
>       <str>runUpdate</str>
>       <str>log</str>
>   </arr>
> </updateRequestProcessorChain>
> <updateProcessor name="custom" class="solr.CustomUpdateRequestProcessorFactory" >
>    <lst name="name">
>      <str name="n1">x1</str>
>      <str name="n2">x2</str>
>    </lst>
> </updateProcessor>                                    
> <updateProcessor name="runUpdate" class="solr.RunUpdateProcessorFactory" />                    
> <updateProcessor name="log" class="solr.LogUpdateProcessorFactory" />   
> {code}
> The wiki documentation says this was supposed to be reviewed. If possible we should clean it up in 1.4 itself. We can support the old syntax too

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Resolved: (SOLR-1396) standardize the updateprocessorchain syntax

Posted by "Noble Paul (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/SOLR-1396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Noble Paul resolved SOLR-1396.
------------------------------

    Resolution: Won't Fix

with SOLR-1326 this kind of syntax is now standard for any type of plugin . so no changes required

> standardize the updateprocessorchain syntax
> -------------------------------------------
>
>                 Key: SOLR-1396
>                 URL: https://issues.apache.org/jira/browse/SOLR-1396
>             Project: Solr
>          Issue Type: Improvement
>    Affects Versions: 1.3
>            Reporter: Noble Paul
>             Fix For: 1.5
>
>
> updateprocessorChain follows a non-standard syntax in solr . Usually, all the components are initialized as top level components and they are assembled and used using a NamedList syntax .for example search components.
> I propose to change it as follows
> {code:xml}
>  <updateRequestProcessorChain name="custom" class="solr.UpdateRequestProcessorChain"> 
>   <arr name="chain">
>       <str>custom</str>
>       <str>runUpdate</str>
>       <str>log</str>
>   </arr>
> </updateRequestProcessorChain>
> <updateProcessor name="custom" class="solr.CustomUpdateRequestProcessorFactory" >
>    <lst name="name">
>      <str name="n1">x1</str>
>      <str name="n2">x2</str>
>    </lst>
> </updateProcessor>                                    
> <updateProcessor name="runUpdate" class="solr.RunUpdateProcessorFactory" />                    
> <updateProcessor name="log" class="solr.LogUpdateProcessorFactory" />   
> {code}
> The wiki documentation says this was supposed to be reviewed. If possible we should clean it up in 1.4 itself. We can support the old syntax too

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (SOLR-1396) standardize the updateprocessorchain syntax

Posted by "Noble Paul (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/SOLR-1396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Noble Paul updated SOLR-1396:
-----------------------------

    Affects Version/s: 1.3
        Fix Version/s:     (was: 1.5)
                       1.4

> standardize the updateprocessorchain syntax
> -------------------------------------------
>
>                 Key: SOLR-1396
>                 URL: https://issues.apache.org/jira/browse/SOLR-1396
>             Project: Solr
>          Issue Type: Improvement
>    Affects Versions: 1.3
>            Reporter: Noble Paul
>             Fix For: 1.4
>
>
> updateprocessorChain follows a non-standard syntax in solr . Usually, all the components are initialized as top level components and they are assembled and used using a NamedList syntax .for example search components.
> I propose to change it as follows
> {code:xml}
>  <updateRequestProcessorChain name="custom" class="solr.UpdateRequestProcessorChain"> 
>   <arr name="chain">
>       <str>custom</str>
>       <str>runUpdate</str>
>       <str>log</str>
>   </arr>
> </updateRequestProcessorChain>
> <updateProcessor name="custom" class="solr.CustomUpdateRequestProcessorFactory" >
>    <lst name="name">
>      <str name="n1">x1</str>
>      <str name="n2">x2</str>
>    </lst>
> </updateProcessor>                                    
> <updateProcessor name="runUpdate" class="solr.RunUpdateProcessorFactory" />                    
> <updateProcessor name="log" class="solr.LogUpdateProcessorFactory" />   
> {code}
> The wiki documentation says this was supposed to be reviewed. If possible we should clean it up in 1.4 itself. We can support the old syntax too

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (SOLR-1396) standardize the updateprocessorchain syntax

Posted by "Hoss Man (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/SOLR-1396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12752899#action_12752899 ] 

Hoss Man commented on SOLR-1396:
--------------------------------

bq. This is for standardization. the searchcomponent has a similar configuration

but search components _are_ a pluggable, and have a generic init(NamedList) method ... so do request handlers, and i think what you are refering to is the way the component list for a SearchHandler instance is specified -- in that case named arrays are included in the broader NamedList of the init method to distibguish what it is (because some RequestHandlers could have all sorts of other init params.

but if updateRequestProcessorChain isn't going to be pluggable (as you said: we don't need a class attribute) then we don't need to worry about providing generalized init param support for it ... so we can keep the syntax simple.

really it just comes down to whether or not update processors can really be refrenced by name and reused in multiple chains.  if they can't this is all moot, but if they can then it would certianly make sense to give them names, and break them out like you describe, and then keep the chains simple...

{code}
  <updateRequestProcessorChain name="dedupe">
    <processor>signature<processor>
    <processor>logger</processor>
    <processor>runupdate</processor>
  </updateRequestProcessorChain>
{code}

> standardize the updateprocessorchain syntax
> -------------------------------------------
>
>                 Key: SOLR-1396
>                 URL: https://issues.apache.org/jira/browse/SOLR-1396
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Noble Paul
>             Fix For: 1.4
>
>
> updateprocessorChain follows a non-standard syntax in solr . Usually, all the components are initialized as top level components and they are assembled and used using a NamedList syntax .for example search components.
> I propose to change it as follows
> {code:xml}
>  <updateRequestProcessorChain name="custom" class="solr.UpdateRequestProcessorChain"> 
>   <arr name="chain">
>       <str>custom</str>
>       <str>runUpdate</str>
>       <str>log</str>
>   </arr>
> </updateRequestProcessorChain>
> <updateProcessor name="custom" class="solr.CustomUpdateRequestProcessorFactory" >
>    <lst name="name">
>      <str name="n1">x1</str>
>      <str name="n2">x2</str>
>    </lst>
> </updateProcessor>                                    
> <updateProcessor name="runUpdate" class="solr.RunUpdateProcessorFactory" />                    
> <updateProcessor name="log" class="solr.LogUpdateProcessorFactory" />   
> {code}
> The wiki documentation says this was supposed to be reviewed. If possible we should clean it up in 1.4 itself. We can support the old syntax too

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (SOLR-1396) standardize the updateprocessorchain syntax

Posted by "Grant Ingersoll (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/SOLR-1396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Grant Ingersoll updated SOLR-1396:
----------------------------------

    Fix Version/s:     (was: 1.4)
                   1.5

> standardize the updateprocessorchain syntax
> -------------------------------------------
>
>                 Key: SOLR-1396
>                 URL: https://issues.apache.org/jira/browse/SOLR-1396
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Noble Paul
>             Fix For: 1.5
>
>
> updateprocessorChain follows a non-standard syntax in solr . Usually, all the components are initialized as top level components and they are assembled and used using a NamedList syntax .for example search components.
> I propose to change it as follows
> {code:xml}
>  <updateRequestProcessorChain name="custom" class="solr.UpdateRequestProcessorChain"> 
>   <arr name="chain">
>       <str>custom</str>
>       <str>runUpdate</str>
>       <str>log</str>
>   </arr>
> </updateRequestProcessorChain>
> <updateProcessor name="custom" class="solr.CustomUpdateRequestProcessorFactory" >
>    <lst name="name">
>      <str name="n1">x1</str>
>      <str name="n2">x2</str>
>    </lst>
> </updateProcessor>                                    
> <updateProcessor name="runUpdate" class="solr.RunUpdateProcessorFactory" />                    
> <updateProcessor name="log" class="solr.LogUpdateProcessorFactory" />   
> {code}
> The wiki documentation says this was supposed to be reviewed. If possible we should clean it up in 1.4 itself. We can support the old syntax too

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (SOLR-1396) standardize the updateprocessorchain syntax

Posted by "Noble Paul (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/SOLR-1396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12751288#action_12751288 ] 

Noble Paul commented on SOLR-1396:
----------------------------------

bq. i don't see a reason why updateRequestProcessorChain needs a class attribute ... 

actually there is no need.

bq.here's no reason for an explict array containing hte names of the processors, 

This is for standardization. the searchcomponent has a similar configuration

> standardize the updateprocessorchain syntax
> -------------------------------------------
>
>                 Key: SOLR-1396
>                 URL: https://issues.apache.org/jira/browse/SOLR-1396
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Noble Paul
>             Fix For: 1.4
>
>
> updateprocessorChain follows a non-standard syntax in solr . Usually, all the components are initialized as top level components and they are assembled and used using a NamedList syntax .for example search components.
> I propose to change it as follows
> {code:xml}
>  <updateRequestProcessorChain name="custom" class="solr.UpdateRequestProcessorChain"> 
>   <arr name="chain">
>       <str>custom</str>
>       <str>runUpdate</str>
>       <str>log</str>
>   </arr>
> </updateRequestProcessorChain>
> <updateProcessor name="custom" class="solr.CustomUpdateRequestProcessorFactory" >
>    <lst name="name">
>      <str name="n1">x1</str>
>      <str name="n2">x2</str>
>    </lst>
> </updateProcessor>                                    
> <updateProcessor name="runUpdate" class="solr.RunUpdateProcessorFactory" />                    
> <updateProcessor name="log" class="solr.LogUpdateProcessorFactory" />   
> {code}
> The wiki documentation says this was supposed to be reviewed. If possible we should clean it up in 1.4 itself. We can support the old syntax too

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.