You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@manifoldcf.apache.org by Shinichiro Abe <sh...@gmail.com> on 2011/08/01 13:20:34 UTC
Re: Data query of JDBC repo
Thank you. Indexing data of VARCHAR worked well. My solrconfig setting was incorrect.
Shinichiro
On 2011/07/29, at 19:06, Karl Wright wrote:
> Oh, FWIW, content data of type VARCHAR should also work.
> Karl
>
> On Fri, Jul 29, 2011 at 6:05 AM, Karl Wright <da...@gmail.com> wrote:
>> I believe the end-user documentation talks about this to some extent.
>> Nevertheless, the JDBC handler is designed to pull all the necessary
>> information for a document, including the content data, out of a
>> single database table. So it presumes the content is stored as either
>> CLOB data or BLOB data in one column of the table.
>>
>> The url field is necessary because that is what ManifoldCF uses for
>> the "id" in the target search engine. It needs this to be able to
>> remove or replace the document in the target on subsequent job runs.
>> It might as well be a URL because it presumes that the search user
>> will need some way to get to the content of the indexed document.
>>
>> Hope that answers your question.
>>
>> Karl
>>
>> 2011/7/29 Shinichiro Abe <sh...@gmail.com>:
>>> Hello.
>>>
>>> I used JDBC Repository Connection and created
>>> the following view table[1] on postgesql.
>>> I set the default setting at Queries tab in job lists.
>>> I run the job, then on the Solr, only urlfield was indexed as id field.
>>>
>>> 1)I also want to index datafield. What is needed to set?
>>> Can I use it like solr dataimporthandler?
>>> For example, can it index datafield1, datafield2, datafield…?
>>>
>>> 2)Why ingesting datafield need to know if url is valid in source code?
>>> I want to index datafield without urlfield.
>>>
>>> My usage may be wrong, I assumed that string data of datafield is indexed as contents.
>>> I want to know what kind of table Data-query assume.
>>>
>>> [1]view:documenttable
>>> | idfield | versionfield | urlfield | datafield | modifydatefield
>>> | char varying | char varying | char varying | char varying | bigint
>>> --------------------------------------------------------------------------
>>> | 1 | 1 | file:///dummy/1| test string | 1
>>> | 2 | 1 | file:///dummy/2| test info | 1
>>>
>>> Thank you,
>>> Shinichiro Abe
>>