You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@solr.apache.org by Doug Turnbull <do...@reddit.com.INVALID> on 2023/02/22 15:49:28 UTC

Inconsistent Solr Learning to Rank Feature Store State - SolrCloud / 7.7

Hi all,

In SolrCloud mode, Solr 7.7, I uploaded a feature store, and notice it in
zookeeper under the collection's config. I've tried to upload various
feature store names with the command:

```
 curl -XPUT "http://localhost:8983/solr/collection/schema/feature-store"
--data-binary "@ltr/collection/feature_set.json" -H
'Content-type:application/json'

{
  "responseHeader":{
    "status":0,
    "QTime":19}}
```

The json file contents are:

```
[
    {
      "name" : "title_bm25",
      "class" : "org.apache.solr.ltr.feature.SolrFeature",
      "params" : {
        "q" : "title:(${keywords})"
      }
    },
    {
      "name" : "title_len",
      "class" : "org.apache.solr.ltr.feature.FieldLengthFeature",
      "params" : {
        "field": "title"
      }
    }
]
```

Everytime I curl to see what feature stores exist, I get an inconsistent
response from the node I uploaded to:

```

$ curl -XGET 'http://localhost:8983/solr/collection/schema/feature-store'
{
  "responseHeader":{
    "status":0,
    "QTime":2},
  "featureStores":[]}
(solr/search-v2/pool-v)douglas.turnbull@solr-prod-v2-

$ curl -XGET 'http://localhost:8983/solr/collection/schema/feature-store'
{
  "responseHeader":{
    "status":0,
    "QTime":0},
  "featureStores":[""]}

$ curl -XGET 'http://localhost:8983/solr/collection/schema/feature-store'
{
  "responseHeader":{
    "status":0,
    "QTime":103},
  "featureStores":[]}

$ curl -XGET 'http://localhost:8983/solr/collection/schema/feature-store'
{
  "responseHeader":{
    "status":0,
    "QTime":0},
  "featureStores":["_DEFAULT",
  "_DEFAULT_"]}

$ curl -XGET 'http://localhost:8983/solr/collection/schema/feature-store'
{
  "responseHeader":{
    "status":0,
    "QTime":0},
  "featureStores":["foo", "_DEFAULT_"]}
$ curl -XGET 'http://localhost:8983/solr/collection/schema/feature-store'
{
  "responseHeader":{
    "status":0,
    "QTime":103},
  "featureStores":[]}

```

Similarly when I query for _DEFAULT_ directly:

```
$ curl -XGET 'http://localhost:8983/solr/collection/schema/feature-store/_DEFAULT_'

reddit-search-config$
{
  "responseHeader":{
    "status":0,
    "QTime":2},
  "features":[]}


$ curl -XGET 'http://localhost:8983/solr/collection/schema/feature-store/_DEFAULT_'
{
  "responseHeader":{
    "status":0,
    "QTime":1},
  "features":[{
      "name":"title_bm25",
      "class":"org.apache.solr.ltr.feature.SolrFeature",
      "params":{"q":"title:(${keywords})"},
      "store":"_DEFAULT_"},
    {
      "name":"title_len",
      "class":"org.apache.solr.ltr.feature.FieldLengthFeature",
      "params":{"field":"title"},
      "store":"_DEFAULT_"}]}
$ curl -XGET 'http://localhost:8983/solr/collection/schema/feature-store/_DEFAULT_'

reddit-search-config$
{
  "responseHeader":{
    "status":0,
    "QTime":1},
  "features":[]}
```

Is this a known issue? Seems like a bug, but let me know if there's a
workaround or something I should be doing differently

Thanks
-Doug

Re: Inconsistent Solr Learning to Rank Feature Store State - SolrCloud / 7.7

Posted by Alessandro Benedetti <a....@sease.io>.
Doug, you are always too kind :)
I'm glad our blog post helped!
We have an additional contribution in progress that should be ready soon:
https://issues.apache.org/jira/browse/SOLR-15493

Cheers

--------------------------
*Alessandro Benedetti*
Director @ Sease Ltd.
*Apache Lucene/Solr Committer*
*Apache Solr PMC Member*

e-mail: a.benedetti@sease.io


*Sease* - Information Retrieval Applied
Consulting | Training | Open Source

Website: Sease.io <http://sease.io/>
LinkedIn <https://linkedin.com/company/sease-ltd> | Twitter
<https://twitter.com/seaseltd> | Youtube
<https://www.youtube.com/channel/UCDx86ZKLYNpI3gzMercM7BQ> | Github
<https://github.com/seaseltd>


On Wed, 22 Feb 2023 at 17:01, Doug Turnbull
<do...@reddit.com.invalid> wrote:

> Oof I mean Anna Rugerro :) but also listen to Alessandro, he's smart too!
>
> On Wed, Feb 22, 2023 at 10:57 AM Doug Turnbull <
> douglas.turnbull@reddit.com>
> wrote:
>
> > It's important to always listen to Alessandro Benedetti. Reloading the
> > collection is the solution
> >
> >
> >
> https://sease.io/2021/03/common-errors-with-apache-solr-feature-stores-and-models.html
> >
> >
> >
> > On Wed, Feb 22, 2023 at 10:49 AM Doug Turnbull <
> > douglas.turnbull@reddit.com> wrote:
> >
> >> Hi all,
> >>
> >> In SolrCloud mode, Solr 7.7, I uploaded a feature store, and notice it
> in
> >> zookeeper under the collection's config. I've tried to upload various
> >> feature store names with the command:
> >>
> >> ```
> >>  curl -XPUT "http://localhost:8983/solr/collection/schema/feature-store
> "
> >> --data-binary "@ltr/collection/feature_set.json" -H
> >> 'Content-type:application/json'
> >>
> >> {
> >>   "responseHeader":{
> >>     "status":0,
> >>     "QTime":19}}
> >> ```
> >>
> >> The json file contents are:
> >>
> >> ```
> >> [
> >>     {
> >>       "name" : "title_bm25",
> >>       "class" : "org.apache.solr.ltr.feature.SolrFeature",
> >>       "params" : {
> >>         "q" : "title:(${keywords})"
> >>       }
> >>     },
> >>     {
> >>       "name" : "title_len",
> >>       "class" : "org.apache.solr.ltr.feature.FieldLengthFeature",
> >>       "params" : {
> >>         "field": "title"
> >>       }
> >>     }
> >> ]
> >> ```
> >>
> >> Everytime I curl to see what feature stores exist, I get an inconsistent
> >> response from the node I uploaded to:
> >>
> >> ```
> >>
> >> $ curl -XGET '
> http://localhost:8983/solr/collection/schema/feature-store'
> >> {
> >>   "responseHeader":{
> >>     "status":0,
> >>     "QTime":2},
> >>   "featureStores":[]}
> >> (solr/search-v2/pool-v)douglas.turnbull@solr-prod-v2-
> >>
> >> $ curl -XGET '
> http://localhost:8983/solr/collection/schema/feature-store'
> >> {
> >>   "responseHeader":{
> >>     "status":0,
> >>     "QTime":0},
> >>   "featureStores":[""]}
> >>
> >> $ curl -XGET '
> http://localhost:8983/solr/collection/schema/feature-store'
> >> {
> >>   "responseHeader":{
> >>     "status":0,
> >>     "QTime":103},
> >>   "featureStores":[]}
> >>
> >> $ curl -XGET '
> http://localhost:8983/solr/collection/schema/feature-store'
> >> {
> >>   "responseHeader":{
> >>     "status":0,
> >>     "QTime":0},
> >>   "featureStores":["_DEFAULT",
> >>   "_DEFAULT_"]}
> >>
> >> $ curl -XGET '
> http://localhost:8983/solr/collection/schema/feature-store'
> >> {
> >>   "responseHeader":{
> >>     "status":0,
> >>     "QTime":0},
> >>   "featureStores":["foo", "_DEFAULT_"]}
> >> $ curl -XGET '
> http://localhost:8983/solr/collection/schema/feature-store'
> >> {
> >>   "responseHeader":{
> >>     "status":0,
> >>     "QTime":103},
> >>   "featureStores":[]}
> >>
> >> ```
> >>
> >> Similarly when I query for _DEFAULT_ directly:
> >>
> >> ```
> >> $ curl -XGET '
> http://localhost:8983/solr/collection/schema/feature-store/_DEFAULT_'
>                                                  reddit-search-config$
> >> {
> >>   "responseHeader":{
> >>     "status":0,
> >>     "QTime":2},
> >>   "features":[]}
> >>
> >>
> >> $ curl -XGET '
> http://localhost:8983/solr/collection/schema/feature-store/_DEFAULT_'
> >> {
> >>   "responseHeader":{
> >>     "status":0,
> >>     "QTime":1},
> >>   "features":[{
> >>       "name":"title_bm25",
> >>       "class":"org.apache.solr.ltr.feature.SolrFeature",
> >>       "params":{"q":"title:(${keywords})"},
> >>       "store":"_DEFAULT_"},
> >>     {
> >>       "name":"title_len",
> >>       "class":"org.apache.solr.ltr.feature.FieldLengthFeature",
> >>       "params":{"field":"title"},
> >>       "store":"_DEFAULT_"}]}
> >> $ curl -XGET '
> http://localhost:8983/solr/collection/schema/feature-store/_DEFAULT_'
>                                                  reddit-search-config$
> >> {
> >>   "responseHeader":{
> >>     "status":0,
> >>     "QTime":1},
> >>   "features":[]}
> >> ```
> >>
> >> Is this a known issue? Seems like a bug, but let me know if there's a
> workaround or something I should be doing differently
> >>
> >> Thanks
> >> -Doug
> >>
> >>
>

Re: Inconsistent Solr Learning to Rank Feature Store State - SolrCloud / 7.7

Posted by Doug Turnbull <do...@reddit.com.INVALID>.
Oof I mean Anna Rugerro :) but also listen to Alessandro, he's smart too!

On Wed, Feb 22, 2023 at 10:57 AM Doug Turnbull <do...@reddit.com>
wrote:

> It's important to always listen to Alessandro Benedetti. Reloading the
> collection is the solution
>
>
> https://sease.io/2021/03/common-errors-with-apache-solr-feature-stores-and-models.html
>
>
>
> On Wed, Feb 22, 2023 at 10:49 AM Doug Turnbull <
> douglas.turnbull@reddit.com> wrote:
>
>> Hi all,
>>
>> In SolrCloud mode, Solr 7.7, I uploaded a feature store, and notice it in
>> zookeeper under the collection's config. I've tried to upload various
>> feature store names with the command:
>>
>> ```
>>  curl -XPUT "http://localhost:8983/solr/collection/schema/feature-store"
>> --data-binary "@ltr/collection/feature_set.json" -H
>> 'Content-type:application/json'
>>
>> {
>>   "responseHeader":{
>>     "status":0,
>>     "QTime":19}}
>> ```
>>
>> The json file contents are:
>>
>> ```
>> [
>>     {
>>       "name" : "title_bm25",
>>       "class" : "org.apache.solr.ltr.feature.SolrFeature",
>>       "params" : {
>>         "q" : "title:(${keywords})"
>>       }
>>     },
>>     {
>>       "name" : "title_len",
>>       "class" : "org.apache.solr.ltr.feature.FieldLengthFeature",
>>       "params" : {
>>         "field": "title"
>>       }
>>     }
>> ]
>> ```
>>
>> Everytime I curl to see what feature stores exist, I get an inconsistent
>> response from the node I uploaded to:
>>
>> ```
>>
>> $ curl -XGET 'http://localhost:8983/solr/collection/schema/feature-store'
>> {
>>   "responseHeader":{
>>     "status":0,
>>     "QTime":2},
>>   "featureStores":[]}
>> (solr/search-v2/pool-v)douglas.turnbull@solr-prod-v2-
>>
>> $ curl -XGET 'http://localhost:8983/solr/collection/schema/feature-store'
>> {
>>   "responseHeader":{
>>     "status":0,
>>     "QTime":0},
>>   "featureStores":[""]}
>>
>> $ curl -XGET 'http://localhost:8983/solr/collection/schema/feature-store'
>> {
>>   "responseHeader":{
>>     "status":0,
>>     "QTime":103},
>>   "featureStores":[]}
>>
>> $ curl -XGET 'http://localhost:8983/solr/collection/schema/feature-store'
>> {
>>   "responseHeader":{
>>     "status":0,
>>     "QTime":0},
>>   "featureStores":["_DEFAULT",
>>   "_DEFAULT_"]}
>>
>> $ curl -XGET 'http://localhost:8983/solr/collection/schema/feature-store'
>> {
>>   "responseHeader":{
>>     "status":0,
>>     "QTime":0},
>>   "featureStores":["foo", "_DEFAULT_"]}
>> $ curl -XGET 'http://localhost:8983/solr/collection/schema/feature-store'
>> {
>>   "responseHeader":{
>>     "status":0,
>>     "QTime":103},
>>   "featureStores":[]}
>>
>> ```
>>
>> Similarly when I query for _DEFAULT_ directly:
>>
>> ```
>> $ curl -XGET 'http://localhost:8983/solr/collection/schema/feature-store/_DEFAULT_'                                                     reddit-search-config$
>> {
>>   "responseHeader":{
>>     "status":0,
>>     "QTime":2},
>>   "features":[]}
>>
>>
>> $ curl -XGET 'http://localhost:8983/solr/collection/schema/feature-store/_DEFAULT_'
>> {
>>   "responseHeader":{
>>     "status":0,
>>     "QTime":1},
>>   "features":[{
>>       "name":"title_bm25",
>>       "class":"org.apache.solr.ltr.feature.SolrFeature",
>>       "params":{"q":"title:(${keywords})"},
>>       "store":"_DEFAULT_"},
>>     {
>>       "name":"title_len",
>>       "class":"org.apache.solr.ltr.feature.FieldLengthFeature",
>>       "params":{"field":"title"},
>>       "store":"_DEFAULT_"}]}
>> $ curl -XGET 'http://localhost:8983/solr/collection/schema/feature-store/_DEFAULT_'                                                     reddit-search-config$
>> {
>>   "responseHeader":{
>>     "status":0,
>>     "QTime":1},
>>   "features":[]}
>> ```
>>
>> Is this a known issue? Seems like a bug, but let me know if there's a workaround or something I should be doing differently
>>
>> Thanks
>> -Doug
>>
>>

Re: Inconsistent Solr Learning to Rank Feature Store State - SolrCloud / 7.7

Posted by Doug Turnbull <do...@reddit.com.INVALID>.
It's important to always listen to Alessandro Benedetti. Reloading the
collection is the solution

https://sease.io/2021/03/common-errors-with-apache-solr-feature-stores-and-models.html



On Wed, Feb 22, 2023 at 10:49 AM Doug Turnbull <do...@reddit.com>
wrote:

> Hi all,
>
> In SolrCloud mode, Solr 7.7, I uploaded a feature store, and notice it in
> zookeeper under the collection's config. I've tried to upload various
> feature store names with the command:
>
> ```
>  curl -XPUT "http://localhost:8983/solr/collection/schema/feature-store"
> --data-binary "@ltr/collection/feature_set.json" -H
> 'Content-type:application/json'
>
> {
>   "responseHeader":{
>     "status":0,
>     "QTime":19}}
> ```
>
> The json file contents are:
>
> ```
> [
>     {
>       "name" : "title_bm25",
>       "class" : "org.apache.solr.ltr.feature.SolrFeature",
>       "params" : {
>         "q" : "title:(${keywords})"
>       }
>     },
>     {
>       "name" : "title_len",
>       "class" : "org.apache.solr.ltr.feature.FieldLengthFeature",
>       "params" : {
>         "field": "title"
>       }
>     }
> ]
> ```
>
> Everytime I curl to see what feature stores exist, I get an inconsistent
> response from the node I uploaded to:
>
> ```
>
> $ curl -XGET 'http://localhost:8983/solr/collection/schema/feature-store'
> {
>   "responseHeader":{
>     "status":0,
>     "QTime":2},
>   "featureStores":[]}
> (solr/search-v2/pool-v)douglas.turnbull@solr-prod-v2-
>
> $ curl -XGET 'http://localhost:8983/solr/collection/schema/feature-store'
> {
>   "responseHeader":{
>     "status":0,
>     "QTime":0},
>   "featureStores":[""]}
>
> $ curl -XGET 'http://localhost:8983/solr/collection/schema/feature-store'
> {
>   "responseHeader":{
>     "status":0,
>     "QTime":103},
>   "featureStores":[]}
>
> $ curl -XGET 'http://localhost:8983/solr/collection/schema/feature-store'
> {
>   "responseHeader":{
>     "status":0,
>     "QTime":0},
>   "featureStores":["_DEFAULT",
>   "_DEFAULT_"]}
>
> $ curl -XGET 'http://localhost:8983/solr/collection/schema/feature-store'
> {
>   "responseHeader":{
>     "status":0,
>     "QTime":0},
>   "featureStores":["foo", "_DEFAULT_"]}
> $ curl -XGET 'http://localhost:8983/solr/collection/schema/feature-store'
> {
>   "responseHeader":{
>     "status":0,
>     "QTime":103},
>   "featureStores":[]}
>
> ```
>
> Similarly when I query for _DEFAULT_ directly:
>
> ```
> $ curl -XGET 'http://localhost:8983/solr/collection/schema/feature-store/_DEFAULT_'                                                     reddit-search-config$
> {
>   "responseHeader":{
>     "status":0,
>     "QTime":2},
>   "features":[]}
>
>
> $ curl -XGET 'http://localhost:8983/solr/collection/schema/feature-store/_DEFAULT_'
> {
>   "responseHeader":{
>     "status":0,
>     "QTime":1},
>   "features":[{
>       "name":"title_bm25",
>       "class":"org.apache.solr.ltr.feature.SolrFeature",
>       "params":{"q":"title:(${keywords})"},
>       "store":"_DEFAULT_"},
>     {
>       "name":"title_len",
>       "class":"org.apache.solr.ltr.feature.FieldLengthFeature",
>       "params":{"field":"title"},
>       "store":"_DEFAULT_"}]}
> $ curl -XGET 'http://localhost:8983/solr/collection/schema/feature-store/_DEFAULT_'                                                     reddit-search-config$
> {
>   "responseHeader":{
>     "status":0,
>     "QTime":1},
>   "features":[]}
> ```
>
> Is this a known issue? Seems like a bug, but let me know if there's a workaround or something I should be doing differently
>
> Thanks
> -Doug
>
>