You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by "Visser, Marc" <Ma...@ordina.nl> on 2014/05/08 08:55:45 UTC

Solr, How to index scripts *.sh and *.SQL

HI All,
Recently I have set up an image with SOLR. My goal is to index and extract files on a Windows and Linux server. It is possible for me to index and extract data from multiple file types. This is done by the SOLR CELL request handler. See the post.jar cmd below.

j ava -Dauto -Drecursive -jar post.jar Y:\ SimplePostTool version 1.5 Posting files to base url localhost:8983/solr/update.. Entering auto mode. File endings considered are xml,json,csv,pdf,doc,docx,ppt,pp tx,xls,xlsx,odt,odp,ods,ott,otp,ots,rtf,htm,html,txt,log Entering recursive mode, max depth=999, delay=0s 0 files indexed.

Is it possible to index and extract metadata/content from file types like .sh and .sql? If it is possible I would like to know how of course :)



Greetings

Marc


Disclaimer
Dit bericht met eventuele bijlagen is vertrouwelijk en uitsluitend bestemd voor de geadresseerde. Indien u niet de bedoelde ontvanger bent, wordt u verzocht de afzender te waarschuwen en dit bericht met eventuele bijlagen direct te verwijderen en/of te vernietigen. Het is niet toegestaan dit bericht en eventuele bijlagen te vermenigvuldigen, door te sturen, openbaar te maken, op te slaan of op andere wijze te gebruiken. Ordina N.V. en/of haar groepsmaatschappijen accepteren geen verantwoordelijkheid of aansprakelijkheid voor schade die voortvloeit uit de inhoud en/of de verzending van dit bericht.

This e-mail and any attachments are confidential and are solely intended for the addressee. If you are not the intended recipient, please notify the sender and delete and/or destroy this message and any attachments immediately. It is prohibited to copy, to distribute, to disclose or to use this e-mail and any attachments in any other way. Ordina N.V. and/or its group companies do not accept any responsibility nor liability for any damage resulting from the content of and/or the transmission of this message.

Re: Solr, How to index scripts *.sh and *.SQL

Posted by Marc <ma...@ordina.nl>.
Thanks that worked



--
View this message in context: http://lucene.472066.n3.nabble.com/Solr-How-to-index-scripts-sh-and-SQL-tp4135627p4136207.html
Sent from the Solr - User mailing list archive at Nabble.com.

Re: Solr, How to index scripts *.sh and *.SQL

Posted by Alexei Martchenko <al...@martchenko.com.br>.
Same in Windows. just plain text files, no metadata, no headers.


alexei martchenko
Facebook <http://www.facebook.com/alexeiramone> |
Linkedin<http://br.linkedin.com/in/alexeimartchenko>|
Steam <http://steamcommunity.com/id/alexeiramone/> |
4sq<https://pt.foursquare.com/alexeiramone>| Skype: alexeiramone |
Github <https://github.com/alexeiramone> | (11) 9 7613.0966 |


2014-05-11 4:32 GMT-03:00 Gora Mohanty <go...@mimirtech.com>:

> On 8 May 2014 12:25, Visser, Marc <Ma...@ordina.nl> wrote:
> >
> > HI All,
> > Recently I have set up an image with SOLR. My goal is to index and
> extract files on a Windows and Linux server. It is possible for me to index
> and extract data from multiple file types. This is done by the SOLR CELL
> request handler. See the post.jar cmd below.
> >
> > j ava -Dauto -Drecursive -jar post.jar Y:\ SimplePostTool version 1.5
> Posting files to base url localhost:8983/solr/update.. Entering auto mode.
> File endings considered are xml,json,csv,pdf,doc,docx,ppt,pp
> tx,xls,xlsx,odt,odp,ods,ott,otp,ots,rtf,htm,html,txt,log Entering recursive
> mode, max depth=999, delay=0s 0 files indexed.
> >
> > Is it possible to index and extract metadata/content from file types
> like .sh and .sql? If it is possible I would like to know how of course :)
>
> Don't know about Windows, but on Linux these are just text files. What
> metadata are you referring to? Normally, a Linux text file only has
> content,
> unless you are talking about metadata such as obtained from:
>    file cmd.sh
>
> Regards,
> Gora
>

Re: Solr, How to index scripts *.sh and *.SQL

Posted by Gora Mohanty <go...@mimirtech.com>.
On 8 May 2014 12:25, Visser, Marc <Ma...@ordina.nl> wrote:
>
> HI All,
> Recently I have set up an image with SOLR. My goal is to index and extract files on a Windows and Linux server. It is possible for me to index and extract data from multiple file types. This is done by the SOLR CELL request handler. See the post.jar cmd below.
>
> j ava -Dauto -Drecursive -jar post.jar Y:\ SimplePostTool version 1.5 Posting files to base url localhost:8983/solr/update.. Entering auto mode. File endings considered are xml,json,csv,pdf,doc,docx,ppt,pp tx,xls,xlsx,odt,odp,ods,ott,otp,ots,rtf,htm,html,txt,log Entering recursive mode, max depth=999, delay=0s 0 files indexed.
>
> Is it possible to index and extract metadata/content from file types like .sh and .sql? If it is possible I would like to know how of course :)

Don't know about Windows, but on Linux these are just text files. What
metadata are you referring to? Normally, a Linux text file only has content,
unless you are talking about metadata such as obtained from:
   file cmd.sh

Regards,
Gora