You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@lucene.apache.org by "wuda (Jira)" <ji...@apache.org> on 2021/07/24 15:40:00 UTC

[jira] [Created] (LUCENE-10035) Simple text codec add multi level skip list data

wuda created LUCENE-10035:
-----------------------------

             Summary: Simple text codec add  multi level skip list data 
                 Key: LUCENE-10035
                 URL: https://issues.apache.org/jira/browse/LUCENE-10035
             Project: Lucene - Core
          Issue Type: New Feature
          Components: core/codecs
    Affects Versions: main (9.0)
            Reporter: wuda


Simple text codec add skip list data( include impact) to help understand index format,For debugging, curiosity, transparency only!! When term's docFreq greater than or equal to SimpleTextSkipWriter.BLOCK_SIZE (default value is 8), Simple text codec will write skip list, the *.pst (simple text term dictionary file)* file will looks like this
{code:java}
field title
  term args
    doc 2
      freq 2
      pos 7
      pos 10
    ## we omit docs for better view ......
    doc 98
      freq 2
      pos 2
      pos 6
    skipList 
?
      level 1
        skipDoc 65
        skipDocFP 949
        impacts 
          impact 
            freq 1
            norm 2
          impact 
            freq 2
            norm 12
          impact 
            freq 3
            norm 13
        impacts_end 
?
      level 0
        skipDoc 17
        skipDocFP 284
        impacts 
          impact 
            freq 1
            norm 2
          impact 
            freq 2
            norm 12
        impacts_end         
        skipDoc 34
        skipDocFP 624
        impacts 
          impact 
            freq 1
            norm 2
          impact 
            freq 2
            norm 12
          impact 
            freq 3
            norm 14
        impacts_end         
        skipDoc 65
        skipDocFP 949
        impacts 
          impact 
            freq 1
            norm 2
          impact 
            freq 2
            norm 12
          impact 
            freq 3
            norm 13
        impacts_end         
        skipDoc 90
        skipDocFP 1311
        impacts 
          impact 
            freq 1
            norm 2
          impact 
            freq 2
            norm 10
          impact 
            freq 3
            norm 13
          impact 
            freq 4
            norm 14
        impacts_end 
END
checksum 00000000000829315543

{code}
compare with previous,we add *skipList,level, skipDoc, skipDocFP, impacts, impact, freq, norm* nodes, at the same, simple text codec can support advanceShallow when search time.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@lucene.apache.org
For additional commands, e-mail: issues-help@lucene.apache.org