You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Dan Wu <wu...@gmail.com> on 2011/10/24 19:42:39 UTC

some basic information on Solr

 Hi all,

I am doing a student project on search engine research. Right now I have
some basic questions about Slor.

1. How many types of data file Solr can support (estimate)? i.e. No. of
file types solr can look at for indexing and searching.

2. How much is estimated cost of incidents per year for Solr ?

Since the numbers could vary from different platforms, however we would like
to know the estimate answers regarding the general cases.

Thanks



-- 
Dan Wu (Fiona Wu)  武丹
Master of Engineering Management Program Degree Candidate
Duke University, North Carolina, USA
Email: dan.wu@duke.edu
Tel: 919-599-2730

RE: some basic information on Solr

Posted by "Jaeger, Jay - DOT" <Ja...@dot.wi.gov>.
It didn't look like that, but.... maybe.

Our experience has been very very good.  I don't think we have seen a crash in our prototype to date (though that prototype is also not very busy).  We have had as many a four cores, with as many as 35 million "documents".

-----Original Message-----
From: stockii [mailto:stock.jonas@googlemail.com] 
Sent: Wednesday, October 26, 2011 2:30 AM
To: solr-user@lucene.apache.org
Subject: Re: some basic information on Solr

i think with "incident" he mean, failures / downtimes / problems with solr !? 

-----
------------------------------- System ----------------------------------------

One Server, 12 GB RAM, 2 Solr Instances, 8 Cores, 
1 Core with 45 Million Documents other Cores < 200.000

- Solr1 for Search-Requests - commit every Minute  - 5GB Xmx
- Solr2 for Update-Request  - delta every Minute - 4GB Xmx
--
View this message in context: http://lucene.472066.n3.nabble.com/some-basic-information-on-Solr-tp3448957p3453837.html
Sent from the Solr - User mailing list archive at Nabble.com.

Re: some basic information on Solr

Posted by stockii <st...@googlemail.com>.
i think with "incident" he mean, failures / downtimes / problems with solr !? 

-----
------------------------------- System ----------------------------------------

One Server, 12 GB RAM, 2 Solr Instances, 8 Cores, 
1 Core with 45 Million Documents other Cores < 200.000

- Solr1 for Search-Requests - commit every Minute  - 5GB Xmx
- Solr2 for Update-Request  - delta every Minute - 4GB Xmx
--
View this message in context: http://lucene.472066.n3.nabble.com/some-basic-information-on-Solr-tp3448957p3453837.html
Sent from the Solr - User mailing list archive at Nabble.com.

Re: some basic information on Solr

Posted by Simon Willnauer <si...@googlemail.com>.
hey,

2011/10/24 Dan Wu <wu...@gmail.com>:
>  Hi all,
>
> I am doing a student project on search engine research. Right now I have
> some basic questions about Slor.
>
> 1. How many types of data file Solr can support (estimate)? i.e. No. of
> file types solr can look at for indexing and searching.
basically you can use solr to index all kinds of documents as long as
you can extract the text from the document. However, Solr ships with
content extraction support that handles a large set of different
files. AFAIK it leverages apache tika (http://tika.apache.org) which
supports a very large set of document formats
(http://tika.apache.org/0.10/formats.html). Hope this helps here?!
>
> 2. How much is estimated cost of incidents per year for Solr ?

I have to admit I don't know what you are asking for. can you
elaborate on this a bit? What is an incident in this context?

simon
>
> Since the numbers could vary from different platforms, however we would like
> to know the estimate answers regarding the general cases.
>
> Thanks
>
>
>
> --
> Dan Wu (Fiona Wu)  武丹
> Master of Engineering Management Program Degree Candidate
> Duke University, North Carolina, USA
> Email: dan.wu@duke.edu
> Tel: 919-599-2730
>

RE: some basic information on Solr

Posted by "Jaeger, Jay - DOT" <Ja...@dot.wi.gov>.
I am not a developer either.  We are just using it in a project here.

-----Original Message-----
From: Dan Wu [mailto:wudan1017@gmail.com] 
Sent: Monday, October 24, 2011 2:16 PM
To: solr-user@lucene.apache.org
Subject: Re: some basic information on Solr

 JRJ,

We did check the solr official website but found it was really technical,
since we are not on the developer side and we just want some basic
information or numbers about its usage.

Thanks for your answer, anyway.



2011/10/24 Jaeger, Jay - DOT <Ja...@dot.wi.gov>

> 1.  Solr, proper, does not index "files".  An adjunct called Solr Cel can.
>  See http://wiki.apache.org/solr/ExtractingRequestHandler .  That article
> describes which kinds of files it Solr Cel can handle.
>
> 2.  I have no idea what you mean by "incidents per year".  Please explain.
>
> 3.  Even though you didn't ask:  You are apparently a student at an
> advanced level.  At your level I would guess that your professors expect
> *YOU* to read thru the material available on the Internet on Solr and figure
> it out on your own, rather than just asking others to do your work for you.
>  ;^)
>
> In particular, before asking further questions you should probably read
> thru http://wiki.apache.org/solr/FrontPage and
> http://lucene.apache.org/solr/tutorial.html .
>
> JRJ
>
> -----Original Message-----
> From: Dan Wu [mailto:wudan1017@gmail.com]
> Sent: Monday, October 24, 2011 12:43 PM
> To: solr-user@lucene.apache.org
> Subject: some basic information on Solr
>
>  Hi all,
>
> I am doing a student project on search engine research. Right now I have
> some basic questions about Slor.
>
> 1. How many types of data file Solr can support (estimate)? i.e. No. of
> file types solr can look at for indexing and searching.
>
> 2. How much is estimated cost of incidents per year for Solr ?
>
> Since the numbers could vary from different platforms, however we would
> like
> to know the estimate answers regarding the general cases.
>
> Thanks
>

Re: some basic information on Solr

Posted by Dan Wu <wu...@gmail.com>.
 JRJ,

We did check the solr official website but found it was really technical,
since we are not on the developer side and we just want some basic
information or numbers about its usage.

Thanks for your answer, anyway.



2011/10/24 Jaeger, Jay - DOT <Ja...@dot.wi.gov>

> 1.  Solr, proper, does not index "files".  An adjunct called Solr Cel can.
>  See http://wiki.apache.org/solr/ExtractingRequestHandler .  That article
> describes which kinds of files it Solr Cel can handle.
>
> 2.  I have no idea what you mean by "incidents per year".  Please explain.
>
> 3.  Even though you didn't ask:  You are apparently a student at an
> advanced level.  At your level I would guess that your professors expect
> *YOU* to read thru the material available on the Internet on Solr and figure
> it out on your own, rather than just asking others to do your work for you.
>  ;^)
>
> In particular, before asking further questions you should probably read
> thru http://wiki.apache.org/solr/FrontPage and
> http://lucene.apache.org/solr/tutorial.html .
>
> JRJ
>
> -----Original Message-----
> From: Dan Wu [mailto:wudan1017@gmail.com]
> Sent: Monday, October 24, 2011 12:43 PM
> To: solr-user@lucene.apache.org
> Subject: some basic information on Solr
>
>  Hi all,
>
> I am doing a student project on search engine research. Right now I have
> some basic questions about Slor.
>
> 1. How many types of data file Solr can support (estimate)? i.e. No. of
> file types solr can look at for indexing and searching.
>
> 2. How much is estimated cost of incidents per year for Solr ?
>
> Since the numbers could vary from different platforms, however we would
> like
> to know the estimate answers regarding the general cases.
>
> Thanks
>

RE: some basic information on Solr

Posted by "Jaeger, Jay - DOT" <Ja...@dot.wi.gov>.
1.  Solr, proper, does not index "files".  An adjunct called Solr Cel can.  See http://wiki.apache.org/solr/ExtractingRequestHandler .  That article describes which kinds of files it Solr Cel can handle.

2.  I have no idea what you mean by "incidents per year".  Please explain.

3.  Even though you didn't ask:  You are apparently a student at an advanced level.  At your level I would guess that your professors expect *YOU* to read thru the material available on the Internet on Solr and figure it out on your own, rather than just asking others to do your work for you.  ;^)

In particular, before asking further questions you should probably read thru http://wiki.apache.org/solr/FrontPage and http://lucene.apache.org/solr/tutorial.html .

JRJ

-----Original Message-----
From: Dan Wu [mailto:wudan1017@gmail.com] 
Sent: Monday, October 24, 2011 12:43 PM
To: solr-user@lucene.apache.org
Subject: some basic information on Solr

 Hi all,

I am doing a student project on search engine research. Right now I have
some basic questions about Slor.

1. How many types of data file Solr can support (estimate)? i.e. No. of
file types solr can look at for indexing and searching.

2. How much is estimated cost of incidents per year for Solr ?

Since the numbers could vary from different platforms, however we would like
to know the estimate answers regarding the general cases.

Thanks



-- 
Dan Wu (Fiona Wu)  武丹
Master of Engineering Management Program Degree Candidate
Duke University, North Carolina, USA
Email: dan.wu@duke.edu
Tel: 919-599-2730