You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Hoss Man (Updated) (JIRA)" <ji...@apache.org> on 2012/03/09 00:07:59 UTC
[jira] [Updated] (SOLR-3124) explain output is confusing when using
trie fields (or any field type where the indexed terms are not human
readable)
[ https://issues.apache.org/jira/browse/SOLR-3124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Hoss Man updated SOLR-3124:
---------------------------
Description:
using the trunk example schema containing...
{noformat}
<fieldType name="tint" class="solr.TrieIntField" precisionStep="8" positionIncrementGap="0"/>
<dynamicField name="*_ti" type="tint" indexed="true" stored="true"/>
{noformat}
...and indexing the doc...
{noformat}
$ java -Ddata=args -jar post.jar '<add><doc><field name="id">HOSS</field><field name="foo_ti">42</field></doc></add>'
{noformat}
...results in a query for [foo_ti:42|http://localhost:8983/solr/select?q=foo_ti:42&start=0&rows=10&wt=json&debug.explain.structured=true&debugQuery=true&indent=true] producing the following debug output...
{noformat}
"debug":{
"rawquerystring":"foo_ti:42",
"querystring":"foo_ti:42",
"parsedquery":"foo_ti:42",
"parsedquery_toString":"foo_ti:`\b\u0000\u0000\u0000*",
"explain":{
"HOSS":{
"match":true,
"value":3.6741486,
"description":"weight(foo_ti:`\b\u0000\u0000\u0000* in 0) [DefaultSimilarity], result of:",
"details":[{
"match":true,
"value":3.6741486,
"description":"fieldWeight in 0, product of:",
"details":[{
"match":true,
"value":1.0,
"description":"tf(freq=1.0), with freq of:",
"details":[{
"match":true,
"value":1.0,
"description":"termFreq=1.0"}]},
{
"match":true,
"value":3.6741486,
"description":"idf(docFreq=1, maxDocs=29)"},
{
"match":true,
"value":1.0,
"description":"fieldNorm(doc=0)"}]}]}},
...
{noformat}
was:
defType=edismax&boost=query($param)¶m=specialties_ids:32&debugQuery=true
<str name="2H7DF">
6.351252 = (MATCH) boost(*:*,query(specialties_ids: #1;#0;#0;#0;#0;#0;#0;#0;#0; ,def=0.0)), product of:
1.0 = (MATCH) MatchAllDocsQuery, product of:
1.0 = queryNorm
6.351252 = query(specialties_ids: #1;#0;#0;#0;#0;#0;#0;#0;#0; ,def=0.0)=6.351252
</str><str name="X5PJW">
6.351252 = (MATCH) boost(*:*,query(specialties_ids: #1;#0;#0;#0;#0;#0;#0;#0;#0; ,def=0.0)), product of:
1.0 = (MATCH) MatchAllDocsQuery, product of:
1.0 = queryNorm
6.351252 = query(specialties_ids: #1;#0;#0;#0;#0;#0;#0;#0;#0; ,def=0.0)=6.351252
</str>
Summary: explain output is confusing when using trie fields (or any field type where the indexed terms are not human readable) (was: explain output looks unreadable when using boost and edismax - #0; ?)
generalizing summary & description since the issue actually has nothing to do with "boosting" and clarifying exactly how to reproduce (the field types used matter)
Bill: the fundamental problem is that the code for generating explain information works with the indexed terms in the queries, which for some field types is non-readable. The Solr FieldType classes know how to format those indexed terms as readable strings, but the code for generating Explanation objects is at a lower level in lucene and doens't know about the schema at all.
> explain output is confusing when using trie fields (or any field type where the indexed terms are not human readable)
> ---------------------------------------------------------------------------------------------------------------------
>
> Key: SOLR-3124
> URL: https://issues.apache.org/jira/browse/SOLR-3124
> Project: Solr
> Issue Type: Bug
> Affects Versions: 3.5
> Reporter: Bill Bell
>
> using the trunk example schema containing...
> {noformat}
> <fieldType name="tint" class="solr.TrieIntField" precisionStep="8" positionIncrementGap="0"/>
> <dynamicField name="*_ti" type="tint" indexed="true" stored="true"/>
> {noformat}
> ...and indexing the doc...
> {noformat}
> $ java -Ddata=args -jar post.jar '<add><doc><field name="id">HOSS</field><field name="foo_ti">42</field></doc></add>'
> {noformat}
> ...results in a query for [foo_ti:42|http://localhost:8983/solr/select?q=foo_ti:42&start=0&rows=10&wt=json&debug.explain.structured=true&debugQuery=true&indent=true] producing the following debug output...
> {noformat}
> "debug":{
> "rawquerystring":"foo_ti:42",
> "querystring":"foo_ti:42",
> "parsedquery":"foo_ti:42",
> "parsedquery_toString":"foo_ti:`\b\u0000\u0000\u0000*",
> "explain":{
> "HOSS":{
> "match":true,
> "value":3.6741486,
> "description":"weight(foo_ti:`\b\u0000\u0000\u0000* in 0) [DefaultSimilarity], result of:",
> "details":[{
> "match":true,
> "value":3.6741486,
> "description":"fieldWeight in 0, product of:",
> "details":[{
> "match":true,
> "value":1.0,
> "description":"tf(freq=1.0), with freq of:",
> "details":[{
> "match":true,
> "value":1.0,
> "description":"termFreq=1.0"}]},
> {
> "match":true,
> "value":3.6741486,
> "description":"idf(docFreq=1, maxDocs=29)"},
> {
> "match":true,
> "value":1.0,
> "description":"fieldNorm(doc=0)"}]}]}},
> ...
> {noformat}
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org