You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Alexandre Rafalovitch (JIRA)" <ji...@apache.org> on 2016/10/08 03:16:20 UTC

[jira] [Closed] (SOLR-1113) Error reports from ExtractingRequestHandler and Co do not indicate name of rejected documents

     [ https://issues.apache.org/jira/browse/SOLR-1113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Alexandre Rafalovitch closed SOLR-1113.
---------------------------------------
    Resolution: Won't Fix

The improvement requested in this issue is not possible as the information is not provided by the underlying apache commons library until after the file is read (catch-22).

A potential improvement (in its own JIRA) would be to switch multipart file parsing from all-in-one to the streaming API available from the commons library, which would allow - for example - to log on Debug level what files were parsed. But this still would not help the middleware with the one that fails.

> Error reports from ExtractingRequestHandler and Co do not indicate name of rejected documents
> ---------------------------------------------------------------------------------------------
>
>                 Key: SOLR-1113
>                 URL: https://issues.apache.org/jira/browse/SOLR-1113
>             Project: Solr
>          Issue Type: Improvement
>          Components: update
>            Reporter: Fergus McMenemie
>
> The ExtractingRequestHandler rejects documents that are larger than the configured multipartUploadLimitInKB in solrconfig.xml. None of the generated error messages indicate the name of the rejected document or provide any way of identifying the rejected document. The failure to identify the rejected document complicates the middleware used to look after indexes.
> Here is the trace produced by a recent version of trunk.
> {code}
> SEVERE: org.apache.commons.fileupload.FileUploadBase$SizeLimitExceededException: the request was rejected because its size (4585774) exceeds the configured maximum (2097152)
> 	at org.apache.commons.fileupload.FileUploadBase$FileItemIteratorImpl.<init>(FileUploadBase.java:914)
> 	at org.apache.commons.fileupload.FileUploadBase.getItemIterator(FileUploadBase.java:331)
> 	at org.apache.commons.fileupload.FileUploadBase.parseRequest(FileUploadBase.java:349)
> 	at org.apache.commons.fileupload.servlet.ServletFileUpload.parseRequest(ServletFileUpload.java:126)
> 	at org.apache.solr.servlet.MultipartRequestParser.parseParamsAndFillStreams(SolrRequestParsers.java:343)
> 	at org.apache.solr.servlet.StandardRequestParser.parseParamsAndFillStreams(SolrRequestParsers.java:396)
> 	at org.apache.solr.servlet.SolrRequestParsers.parse(SolrRequestParsers.java:114)
> 	at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:217)
> 	at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:202)
> 	at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:173)
> 	at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:213)
> 	at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:178)
> 	at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:126)
> 	at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:105)
> {code} 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org