You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by Eduardo Botelho <ed...@gmail.com> on 2007/07/27 17:22:51 UTC

Size of field?

Hi guys,

I would like to know if exist some limit of size for the fields of a
document.

I'm with the following problem:
When a term is after a certain amount of characters (approximately 87300) in
a field, the search does not find de occurrency.
If I divide my field in pages, the terms are found normally.
This problem occours when I make an exact query (query between quotes)

What can be happening?

I'm using BrazilianAnalyzer and StandardAnalyzer(for tests only) for both,
search and indexation.

thanks...

Sorry for my poor english...

Re: Size of field?

Posted by Erick Erickson <er...@gmail.com>.
Glad it worked out for you.... Did you ever have any insight into what
was magical about 87,300? Although now that I re-read your mail, that
was the number of characters, so I can imagine that your corpus
averaged 8.73 characters/word <G>....

Best
Erick

On 8/1/07, Eduardo Botelho <ed...@gmail.com> wrote:
>
> Hi Erick!!
>
> You're right, I just use setMaxFieldLength() and all work fine.
>
> You save my life, thanks! (y)
>
> On 7/30/07, Erick Erickson <er...@gmail.com> wrote:
> >
> > See IndexWriter.setMaxFieldLength(). 87,300 is odd, since the default
> > max field length, last I knew, was 10,000. But this sounds like
> > it might relate to your issue.
> >
> > Best
> > Erick
> >
> > On 7/27/07, Eduardo Botelho <ed...@gmail.com> wrote:
> > >
> > > Hi guys,
> > >
> > > I would like to know if exist some limit of size for the fields of a
> > > document.
> > >
> > > I'm with the following problem:
> > > When a term is after a certain amount of characters (approximately
> > 87300)
> > > in
> > > a field, the search does not find de occurrency.
> > > If I divide my field in pages, the terms are found normally.
> > > This problem occours when I make an exact query (query between quotes)
> > >
> > > What can be happening?
> > >
> > > I'm using BrazilianAnalyzer and StandardAnalyzer(for tests only) for
> > both,
> > > search and indexation.
> > >
> > > thanks...
> > >
> > > Sorry for my poor english...
> > >
> >
>

Re: Size of field?

Posted by Eduardo Botelho <ed...@gmail.com>.
Hi Erick!!

You're right, I just use setMaxFieldLength() and all work fine.

You save my life, thanks! (y)

On 7/30/07, Erick Erickson <er...@gmail.com> wrote:
>
> See IndexWriter.setMaxFieldLength(). 87,300 is odd, since the default
> max field length, last I knew, was 10,000. But this sounds like
> it might relate to your issue.
>
> Best
> Erick
>
> On 7/27/07, Eduardo Botelho <ed...@gmail.com> wrote:
> >
> > Hi guys,
> >
> > I would like to know if exist some limit of size for the fields of a
> > document.
> >
> > I'm with the following problem:
> > When a term is after a certain amount of characters (approximately
> 87300)
> > in
> > a field, the search does not find de occurrency.
> > If I divide my field in pages, the terms are found normally.
> > This problem occours when I make an exact query (query between quotes)
> >
> > What can be happening?
> >
> > I'm using BrazilianAnalyzer and StandardAnalyzer(for tests only) for
> both,
> > search and indexation.
> >
> > thanks...
> >
> > Sorry for my poor english...
> >
>

Re: Size of field?

Posted by Erick Erickson <er...@gmail.com>.
See IndexWriter.setMaxFieldLength(). 87,300 is odd, since the default
max field length, last I knew, was 10,000. But this sounds like
it might relate to your issue.

Best
Erick

On 7/27/07, Eduardo Botelho <ed...@gmail.com> wrote:
>
> Hi guys,
>
> I would like to know if exist some limit of size for the fields of a
> document.
>
> I'm with the following problem:
> When a term is after a certain amount of characters (approximately 87300)
> in
> a field, the search does not find de occurrency.
> If I divide my field in pages, the terms are found normally.
> This problem occours when I make an exact query (query between quotes)
>
> What can be happening?
>
> I'm using BrazilianAnalyzer and StandardAnalyzer(for tests only) for both,
> search and indexation.
>
> thanks...
>
> Sorry for my poor english...
>