You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nlpcraft.apache.org by "Sergey Kamov (Jira)" <ji...@apache.org> on 2021/01/19 14:20:00 UTC

[jira] [Created] (NLPCRAFT-223) Suspicious log output.

Sergey Kamov created NLPCRAFT-223:
-------------------------------------

             Summary: Suspicious log output.
                 Key: NLPCRAFT-223
                 URL: https://issues.apache.org/jira/browse/NLPCRAFT-223
             Project: NLPCraft
          Issue Type: Improvement
          Components: probe
            Reporter: Sergey Kamov
            Assignee: Aaron Radzinski
             Fix For: 0.7.4


See {color:#000000}NCEnricherLimitSpec.
See variant
{color}

{color:#000000}Jan-19|17:15:52|INF| NCAsciiTable - (NCAsciiTable.scala:626) - Parsing variant #1 of 3 for: 'handful of a b'
+===================================================================================================================+
| idx | origtext | lemma | pos | quoted | stopword | freeword | wordindexes | direct | sparsity | token data |
+===================================================================================================================+
| 0 | handful of | handful of | --- | false | false | true | [0,1] | true | 0 | |
| 1 | A | a | DT | false | false | false | [2] | true | 0 | <<A>> id=A |
| 2 | B | b | NN | false | false | false | [3] | true | 0 | <<B>> id=B |
+-------------------------------------------------------------------------------------------------------------------+{color}

 

{color:#000000}The first token `handful of` seems strange.{color}

{color:#000000}It is not `stop words chunk, it isn't `limit`, but it contains 2 split words. {color}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)