You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Shalin Shekhar Mangar (JIRA)" <ji...@apache.org> on 2013/07/29 17:05:48 UTC

[jira] [Resolved] (SOLR-5086) The OR operator works incorrectly in XPathEntityProcessor

     [ https://issues.apache.org/jira/browse/SOLR-5086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Shalin Shekhar Mangar resolved SOLR-5086.
-----------------------------------------

    Resolution: Not A Problem

The XPathEntityProcessor does not support the OR operator in field xpaths. The OR operator is supported only in the "forEach" attribute of entity.

See the supported xpath types here:
http://wiki.apache.org/solr/DataImportHandler#Configuration_in_data-config.xml-1
                
> The OR operator works incorrectly in XPathEntityProcessor
> ---------------------------------------------------------
>
>                 Key: SOLR-5086
>                 URL: https://issues.apache.org/jira/browse/SOLR-5086
>             Project: Solr
>          Issue Type: Bug
>          Components: contrib - DataImportHandler
>    Affects Versions: 4.4
>            Reporter: shenzhuxi
>
> I's trying to use DataImportHandler to index RSS/ATOM feed and find bizarre behaviours of the OR operator in XPathEntityProcessor. 
> Here is the configuration.  
> <?xml version="1.0" encoding="UTF-8"?>
> <dataConfig>
>   <dataSource type="FileDataSource"/>
>   <document>
>     <entity name="rss" processor="FileListEntityProcessor" baseDir="${solr.solr.home}/feed/rss" fileName="^.*\.xml$" recursive="true" rootEntity="false" dataSource="null">
>       <entity name="feed" url="${rss.fileAbsolutePath}" processor="XPathEntityProcessor" forEach="/rss/channel/item|/feed/entry" transformer="DateFormatTransformer">
>         <field column="link" xpath="/rss/channel/item/link|/feed/entry/link/@href"/>
>       </entity>
>     </entity>
>   </document>
> </dataConfig>
> The first OR operator in "/rss/channel/item|/feed/entry" works correctly. 
> But the second one in "/rss/channel/item/link|/feed/entry/link/@href" doesn't work. 
> If I rewrite it to either "/rss/channel/item/link" or "/feed/entry/link/@href", it works correctly. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org