You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@lucene.apache.org by "Cao Manh Dat (JIRA)" <ji...@apache.org> on 2018/05/01 03:14:00 UTC

[jira] [Comment Edited] (SOLR-12278) Ignore very large document on indexing

    [ https://issues.apache.org/jira/browse/SOLR-12278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16459412#comment-16459412 ] 

Cao Manh Dat edited comment on SOLR-12278 at 5/1/18 3:13 AM:
-------------------------------------------------------------

[~dsmiley] some problem of that approach
 * we have to modify all other parsers,
 * each parser has its own set of parameters, which make the size of a SolrInputDocument quite different with the number of bytes of the input (ie: SOLR-6304)
 * what happens if the users have some processor in the middle which enriches the SolrInputDocument

In short vision, IgnoreLargeDocumentProcessor might handy for users who need to filter large documents.


was (Author: caomanhdat):
[~dsmiley] problem of that approach is we have to modify all other parsers, not mention that each parser has its worn set of parameters.

> Ignore very large document on indexing
> --------------------------------------
>
>                 Key: SOLR-12278
>                 URL: https://issues.apache.org/jira/browse/SOLR-12278
>             Project: Solr
>          Issue Type: Improvement
>      Security Level: Public(Default Security Level. Issues are Public) 
>            Reporter: Cao Manh Dat
>            Assignee: Cao Manh Dat
>            Priority: Major
>         Attachments: SOLR-12278.patch, SOLR-12278.patch
>
>
> Solr should be able to ignore very large document, so it won't affect the index as well as the tlog. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org