You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Brian Lamb <br...@journalexperts.com> on 2011/06/01 21:41:34 UTC
Searching using a PDF
Is it possible to do a search based on a PDF file? I know its possible to
update the index with a PDF but can you do just a regular search with it?
Thanks,
Brian Lamb
Re: Searching using a PDF
Posted by Erick Erickson <er...@gmail.com>.
Not that I know of, you'll probably have to handle this before it hits Solr.
Best
Erick
On Thu, Jun 2, 2011 at 3:10 PM, Brian Lamb
<br...@journalexperts.com> wrote:
> I mean instead of typing http://localhost:8983/?q=mysearch, I would send a
> PDF file with the contents of "mysearch" and search based on that. I am
> leaning toward handling this before it hits solr however.
>
> Thanks,
>
> Brian Lamb
>
> On Wed, Jun 1, 2011 at 3:52 PM, Erick Erickson <er...@gmail.com>wrote:
>
>> I'm not quite sure what you mean by "regular search". When
>> you index a PDF (Presumably through Tika or Solr Cell) the text
>> is indexed into your index and you can certainly search that. Additionally,
>> there may be meta data indexed in specific fields (e.g. author,
>> date modified, etc).
>>
>> But what does "search based on a PDF file" mean in your context?
>>
>> Best
>> Erick
>>
>> On Wed, Jun 1, 2011 at 3:41 PM, Brian Lamb
>> <br...@journalexperts.com> wrote:
>> > Is it possible to do a search based on a PDF file? I know its possible to
>> > update the index with a PDF but can you do just a regular search with it?
>> >
>> > Thanks,
>> >
>> > Brian Lamb
>> >
>>
>
Re: Searching using a PDF
Posted by Brian Lamb <br...@journalexperts.com>.
I mean instead of typing http://localhost:8983/?q=mysearch, I would send a
PDF file with the contents of "mysearch" and search based on that. I am
leaning toward handling this before it hits solr however.
Thanks,
Brian Lamb
On Wed, Jun 1, 2011 at 3:52 PM, Erick Erickson <er...@gmail.com>wrote:
> I'm not quite sure what you mean by "regular search". When
> you index a PDF (Presumably through Tika or Solr Cell) the text
> is indexed into your index and you can certainly search that. Additionally,
> there may be meta data indexed in specific fields (e.g. author,
> date modified, etc).
>
> But what does "search based on a PDF file" mean in your context?
>
> Best
> Erick
>
> On Wed, Jun 1, 2011 at 3:41 PM, Brian Lamb
> <br...@journalexperts.com> wrote:
> > Is it possible to do a search based on a PDF file? I know its possible to
> > update the index with a PDF but can you do just a regular search with it?
> >
> > Thanks,
> >
> > Brian Lamb
> >
>
Re: Searching using a PDF
Posted by Erick Erickson <er...@gmail.com>.
I'm not quite sure what you mean by "regular search". When
you index a PDF (Presumably through Tika or Solr Cell) the text
is indexed into your index and you can certainly search that. Additionally,
there may be meta data indexed in specific fields (e.g. author,
date modified, etc).
But what does "search based on a PDF file" mean in your context?
Best
Erick
On Wed, Jun 1, 2011 at 3:41 PM, Brian Lamb
<br...@journalexperts.com> wrote:
> Is it possible to do a search based on a PDF file? I know its possible to
> update the index with a PDF but can you do just a regular search with it?
>
> Thanks,
>
> Brian Lamb
>