You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@solr.apache.org by Scott Derrick <sc...@tnstaafl.net> on 2024/03/08 21:25:38 UTC

reindexing fails on a particular file

HI,

     I have about 6,000 files to index using a FileDataSource as the 
dataSource.   This uses a FileListEntityProcessor to get the filenames 
in a folder to process.

     The reindexing is failing on file # 3091.   Something changed 
between version 7.3 and 8.11?

org.apache.solr.handler.dataimport.DataImportHandlerException: Exception 
in applying XSL Transformation Processing Document # 3091

     No filename given, just a number?  So I did a listing "ls -all" in 
the folder, found the 3091'st file deleted it ran the update and it 
failed on the 3091'st file again...

     I then deleted the first 16 files and it fails on file # 3080

     So I need to find out what file is causing the issue, but can't 
seem to get that to out put.  I can see a FileDataSource in the 
dataimport section of the Logging UI in the web app. I set it to ALL but 
it didn't change the output of the log file.

Any ideas how to get the logger to output something useful?

Scott


Re: reindexing fails on a particular file

Posted by Mikhail Khludnev <mk...@apache.org>.
Hi,
You can enable log on wider category, and it give you a clue:
https://github.com/SearchScale/dataimporthandler/blob/branch_9x/src/main/java/org/apache/solr/handler/dataimport/FileDataSource.java#L120
Assuming the error occurs in XPathEntityProcessor, it log in case if
onError=continue
https://github.com/SearchScale/dataimporthandler/blob/branch_9x/src/main/java/org/apache/solr/handler/dataimport/XPathEntityProcessor.java#L316

On Sat, Mar 9, 2024 at 3:32 AM Scott Derrick <sc...@tnstaafl.net> wrote:

> HI,
>
>      I have about 6,000 files to index using a FileDataSource as the
> dataSource.   This uses a FileListEntityProcessor to get the filenames
> in a folder to process.
>
>      The reindexing is failing on file # 3091.   Something changed
> between version 7.3 and 8.11?
>
> org.apache.solr.handler.dataimport.DataImportHandlerException: Exception
> in applying XSL Transformation Processing Document # 3091
>
>      No filename given, just a number?  So I did a listing "ls -all" in
> the folder, found the 3091'st file deleted it ran the update and it
> failed on the 3091'st file again...
>
>      I then deleted the first 16 files and it fails on file # 3080
>
>      So I need to find out what file is causing the issue, but can't
> seem to get that to out put.  I can see a FileDataSource in the
> dataimport section of the Logging UI in the web app. I set it to ALL but
> it didn't change the output of the log file.
>
> Any ideas how to get the logger to output something useful?
>
> Scott
>
>

-- 
Sincerely yours
Mikhail Khludnev