You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Rishabh Patel <ri...@gmail.com> on 2016/07/15 20:02:29 UTC

Indexing documents stored in HDFS

Hello,

I am trying to find a way to index some documents, all located in a
directory in HDFS.

Since HDFS has a REST API, I was trying to use the DataImportHandler(DIH)
along with the datasource type as URLDataSource, to index the documents.

Is this approach wrong? If so, then is there a canonical way to index
documents present in HDFS?
-- 
Sincerely,
*Rishabh Patel*