You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Eduardo Marques (JIRA)" <ji...@apache.org> on 2015/03/10 12:31:38 UTC

[jira] [Created] (SOLR-7222) Problem with FileListEntityProcessor combined TikaEntityProcessor

Eduardo Marques created SOLR-7222:
-------------------------------------

             Summary: Problem with FileListEntityProcessor combined TikaEntityProcessor
                 Key: SOLR-7222
                 URL: https://issues.apache.org/jira/browse/SOLR-7222
             Project: Solr
          Issue Type: Bug
          Components: contrib - DataImportHandler, contrib - Solr Cell (Tika extraction)
    Affects Versions: 5.0
         Environment: Windows Server 2012 R2
Intel Xeon E5-2620 @2.00GHz
32GB
            Reporter: Eduardo Marques


I was trying to upgrade from Solr 4.9.1 to 5.0.0, but I am dealing with a possible bug, that does not allow me to continue with this migration.

I have configured the following solr-data-config.xml:
http://pastebin.com/XvyD4GDR

In version 4.9.1 the dataimporthandler fetches all the from both directories, and processes, and finishes the dataimport with no problems.

In version 5.0.0 the dataimporhandler also fetches the same files, but just processes 1 document per directory, and finishes with no errors.

Has anything changed regarding those two entity processors, that I should be aware of?

Also, I've found a similar issue here:
http://stackoverflow.com/questions/28943521/solr-dih-fetched-many-and-only-one-processed






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org