You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by "Musshorn, Kris T CTR USARMY RDECOM ARL (US)" <kr...@mail.mil> on 2016/07/28 14:06:32 UTC

RE: [Non-DoD Source] Re: config question (UNCLASSIFIED)

CLASSIFICATION: UNCLASSIFIED

Thank you for your reply Alexandre.

Mycollection1 is a core I made when I set up solr.
I did this because solr does not have any cores by itself and I thought that nutch would need a core to work.
Was my assumption correct?

\solr\server\solr\mycollection1\conf\managed-schema file does not have an .xml extension yet looks like a verbose schema.xml file.
Opening the managed schema file in a text editor shows :
"This is the Solr schema file. This file should be named "schema.xml" and should be in the conf directory under the solr home (i.e. ./solr/conf/schema.xml by default) or located where the classloader for the Solr webapp can find it."

Should I rename this Schema.xml and leave it in the current location? How should I handle insertion of the nutch schema?


Thanks,
Kris

~~~~~~~~~~~~~~~~~~~~~~~~~~
Kris T. Musshorn
FileMaker Developer - Contractor – Catapult Technology Inc.      
US Army Research Lab 
Aberdeen Proving Ground 
Application Management & Development Branch 
410-278-7251
kris.t.musshorn.ctr@mail.mil
~~~~~~~~~~~~~~~~~~~~~~~~~~


-----Original Message-----
From: Alexandre Rafalovitch [mailto:arafalov@gmail.com] 
Sent: Thursday, July 28, 2016 9:38 AM
To: solr-user <so...@lucene.apache.org>
Subject: [Non-DoD Source] Re: config question (UNCLASSIFIED)

All active links contained in this email were disabled.  Please verify the identity of the sender, and confirm the authenticity of all links contained within the message prior to copying and pasting the address to a Web browser.  




----

I haven't been through this tutorial before, but from a quick look I see two issues:
1) If you don't see schema.xml file in the directory, that means it uses dynamic schema instead (called managed-schema). Try copying your file and replacing managed-schema file.
2) It talks about "collection1". That used to be a default collection, so you did not need to use collection name in the URL. I can't remember when that changed, but probably has by 5.5. So, you should be able to use any other names but ensure that all URLs referencing it include collection name. Specifically, "Caution-http://localhost:8983/solr" is assuming the default collection feature. Instead, it should be Caution-http://localhost:8983/solr/collectionname everywhere.
3) Do you actually have an otherwise-working core/collection in the solr/server/solr/mycollection1? Did you create it with some command (e.g. bin/solr create_core -c mycollection1?)?

If you are still having problems, we may need a bit more specific details on the error messages and command lines used (especially the full expanded URL commands that hit Solr itself). Also, if you are on Windows, this may be worth mentioning in the follow-up messages. Some commands are not available on Windows, such as bin\post (instead post.jar is used).

Regards,
   Alex.
----
Newsletter and resources for Solr beginners and intermediates:
Caution-http://www.solr-start.com/


On 28 July 2016 at 23:22, Musshorn, Kris T CTR USARMY RDECOM ARL (US) <kr...@mail.mil> wrote:
> CLASSIFICATION: UNCLASSIFIED
>
> I am trying to integrate nutch 1.12 with solr 5.5.2.
>
> In the setup documents from here... Caution-https://wiki.apache.org/nutch/NutchTutorial it says to replace the schema.xml file in the core with the schema.xml from nutch.
>
> The install of solr does not have a schema.xml file anywhere.
>
> I moved the schema.xml from nutch to \solr\server\solr\mycollection1\conf\ and tried to index into solr from the tutorial and got errors.
>
> I really need some pro help here.
>
> I tried to post the same question to the nutch list but have not gotten any responses.
>
> Thanks,
> Kris
>
> ~~~~~~~~~~~~~~~~~~~~~~~~~~
> Kris T. Musshorn
> FileMaker Developer - Contractor - Catapult Technology Inc.
> US Army Research Lab
> Aberdeen Proving Ground
> Application Management & Development Branch
> 410-278-7251
> kris.t.musshorn.ctr@mail.mil
> ~~~~~~~~~~~~~~~~~~~~~~~~~~
>
>
>
> CLASSIFICATION: UNCLASSIFIED


CLASSIFICATION: UNCLASSIFIED

RE: [Non-DoD Source] Re: config question (UNCLASSIFIED)

Posted by "Musshorn, Kris T CTR USARMY RDECOM ARL (US)" <kr...@mail.mil>.
CLASSIFICATION: UNCLASSIFIED

Alexandre..

Deleting managed-schema file from the core conf folder and restarting solr with the schema.xml file in its place produced 29 errors and the core could not be started. 

Errors started with:

org.apache.solr.common.SolrException: Could not load conf for core collection1: Can't load schema /home/musshorn/tutorial/solr/server/solr/Mycollection1/conf/schema.xml: Plugin init failure for [schema.xml] fieldType "text_general": Plugin init failure for [schema.xml] analyzer/filter: Error instantiating class: 'org.apache.lucene.analysis.core.StopFilterFactory'


Thanks,
Kris

~~~~~~~~~~~~~~~~~~~~~~~~~~
Kris T. Musshorn
FileMaker Developer - Contractor – Catapult Technology Inc.      
US Army Research Lab 
Aberdeen Proving Ground 
Application Management & Development Branch 
410-278-7251
kris.t.musshorn.ctr@mail.mil
~~~~~~~~~~~~~~~~~~~~~~~~~~


-----Original Message-----
From: Musshorn, Kris T CTR USARMY RDECOM ARL (US) 
Sent: Thursday, July 28, 2016 10:37 AM
To: 'Alexandre Rafalovitch' <ar...@gmail.com>
Subject: RE: [Non-DoD Source] Re: config question (UNCLASSIFIED)

CLASSIFICATION: UNCLASSIFIED

I currently have:

\solr\server\solr\mycollection1\conf\managed-schema 
And
\solr\server\solr\mycollection1\conf\schema.xml

are you saying I should have just 
\solr\server\solr\mycollection1\conf\schema.xml 
and remove 
\solr\server\solr\mycollection1\conf\managed-schema 

I have no plans to use the ui to manage schema.

Sorry but im relatively new to solr.

Thanks,
Kris

~~~~~~~~~~~~~~~~~~~~~~~~~~
Kris T. Musshorn
FileMaker Developer - Contractor – Catapult Technology Inc.      
US Army Research Lab 
Aberdeen Proving Ground 
Application Management & Development Branch 
410-278-7251
kris.t.musshorn.ctr@mail.mil
~~~~~~~~~~~~~~~~~~~~~~~~~~


-----Original Message-----
From: Alexandre Rafalovitch [mailto:arafalov@gmail.com] 
Sent: Thursday, July 28, 2016 10:14 AM
To: Musshorn, Kris T CTR USARMY RDECOM ARL (US) <kr...@mail.mil>
Subject: Re: [Non-DoD Source] Re: config question (UNCLASSIFIED)

All active links contained in this email were disabled.  Please verify the identity of the sender, and confirm the authenticity of all links contained within the message prior to copying and pasting the address to a Web browser.  




----

If your Solr loads and can query (even without results) your
Mycollection1 - you've done right.

managed-schema bootstraps from schema.xml. After the first modification via UI, it gets completely reordered, rewritten and looses all its comments. In your case, I suggest you either copy your Nutch's schema.xml over managed-schema or remove managed-schema and put schema.xml to bootstrap from (while Solr is not running). Either way, you should end up with just one of those two files and it will be managed-schema once Solr gets to it.

Hope that helps.

Regards,
    Alex.
----
Newsletter and resources for Solr beginners and intermediates:
Caution-http://www.solr-start.com/


On 29 July 2016 at 00:06, Musshorn, Kris T CTR USARMY RDECOM ARL (US) <kr...@mail.mil> wrote:
> CLASSIFICATION: UNCLASSIFIED
>
> Thank you for your reply Alexandre.
>
> Mycollection1 is a core I made when I set up solr.
> I did this because solr does not have any cores by itself and I thought that nutch would need a core to work.
> Was my assumption correct?
>
> \solr\server\solr\mycollection1\conf\managed-schema file does not have an .xml extension yet looks like a verbose schema.xml file.
> Opening the managed schema file in a text editor shows :
> "This is the Solr schema file. This file should be named "schema.xml" and should be in the conf directory under the solr home (i.e. ./solr/conf/schema.xml by default) or located where the classloader for the Solr webapp can find it."
>
> Should I rename this Schema.xml and leave it in the current location? How should I handle insertion of the nutch schema?
>
>
> Thanks,
> Kris
>
> ~~~~~~~~~~~~~~~~~~~~~~~~~~
> Kris T. Musshorn
> FileMaker Developer - Contractor – Catapult Technology Inc.
> US Army Research Lab
> Aberdeen Proving Ground
> Application Management & Development Branch
> 410-278-7251
> kris.t.musshorn.ctr@mail.mil
> ~~~~~~~~~~~~~~~~~~~~~~~~~~
>
>
> -----Original Message-----
> From: Alexandre Rafalovitch [Caution-mailto:arafalov@gmail.com]
> Sent: Thursday, July 28, 2016 9:38 AM
> To: solr-user <so...@lucene.apache.org>
> Subject: [Non-DoD Source] Re: config question (UNCLASSIFIED)
>
> All active links contained in this email were disabled.  Please verify the identity of the sender, and confirm the authenticity of all links contained within the message prior to copying and pasting the address to a Web browser.
>
>
>
>
> ----
>
> I haven't been through this tutorial before, but from a quick look I see two issues:
> 1) If you don't see schema.xml file in the directory, that means it uses dynamic schema instead (called managed-schema). Try copying your file and replacing managed-schema file.
> 2) It talks about "collection1". That used to be a default collection, so you did not need to use collection name in the URL. I can't remember when that changed, but probably has by 5.5. So, you should be able to use any other names but ensure that all URLs referencing it include collection name. Specifically, "Caution-Caution-http://localhost:8983/solr" is assuming the default collection feature. Instead, it should be Caution-Caution-http://localhost:8983/solr/collectionname everywhere.
> 3) Do you actually have an otherwise-working core/collection in the solr/server/solr/mycollection1? Did you create it with some command (e.g. bin/solr create_core -c mycollection1?)?
>
> If you are still having problems, we may need a bit more specific details on the error messages and command lines used (especially the full expanded URL commands that hit Solr itself). Also, if you are on Windows, this may be worth mentioning in the follow-up messages. Some commands are not available on Windows, such as bin\post (instead post.jar is used).
>
> Regards,
>    Alex.
> ----
> Newsletter and resources for Solr beginners and intermediates:
> Caution-Caution-http://www.solr-start.com/
>
>
> On 28 July 2016 at 23:22, Musshorn, Kris T CTR USARMY RDECOM ARL (US) <kr...@mail.mil> wrote:
>> CLASSIFICATION: UNCLASSIFIED
>>
>> I am trying to integrate nutch 1.12 with solr 5.5.2.
>>
>> In the setup documents from here... Caution-Caution-https://wiki.apache.org/nutch/NutchTutorial it says to replace the schema.xml file in the core with the schema.xml from nutch.
>>
>> The install of solr does not have a schema.xml file anywhere.
>>
>> I moved the schema.xml from nutch to \solr\server\solr\mycollection1\conf\ and tried to index into solr from the tutorial and got errors.
>>
>> I really need some pro help here.
>>
>> I tried to post the same question to the nutch list but have not gotten any responses.
>>
>> Thanks,
>> Kris
>>
>> ~~~~~~~~~~~~~~~~~~~~~~~~~~
>> Kris T. Musshorn
>> FileMaker Developer - Contractor - Catapult Technology Inc.
>> US Army Research Lab
>> Aberdeen Proving Ground
>> Application Management & Development Branch
>> 410-278-7251
>> kris.t.musshorn.ctr@mail.mil
>> ~~~~~~~~~~~~~~~~~~~~~~~~~~
>>
>>
>>
>> CLASSIFICATION: UNCLASSIFIED
>
>
> CLASSIFICATION: UNCLASSIFIED


CLASSIFICATION: UNCLASSIFIED


CLASSIFICATION: UNCLASSIFIED