You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nlpcraft.apache.org by "Sergey Kamov (Jira)" <ji...@apache.org> on 2021/01/19 14:20:00 UTC
[jira] [Created] (NLPCRAFT-223) Suspicious log output.
Sergey Kamov created NLPCRAFT-223:
-------------------------------------
Summary: Suspicious log output.
Key: NLPCRAFT-223
URL: https://issues.apache.org/jira/browse/NLPCRAFT-223
Project: NLPCraft
Issue Type: Improvement
Components: probe
Reporter: Sergey Kamov
Assignee: Aaron Radzinski
Fix For: 0.7.4
See {color:#000000}NCEnricherLimitSpec.
See variant
{color}
{color:#000000}Jan-19|17:15:52|INF| NCAsciiTable - (NCAsciiTable.scala:626) - Parsing variant #1 of 3 for: 'handful of a b'
+===================================================================================================================+
| idx | origtext | lemma | pos | quoted | stopword | freeword | wordindexes | direct | sparsity | token data |
+===================================================================================================================+
| 0 | handful of | handful of | --- | false | false | true | [0,1] | true | 0 | |
| 1 | A | a | DT | false | false | false | [2] | true | 0 | <<A>> id=A |
| 2 | B | b | NN | false | false | false | [3] | true | 0 | <<B>> id=B |
+-------------------------------------------------------------------------------------------------------------------+{color}
{color:#000000}The first token `handful of` seems strange.{color}
{color:#000000}It is not `stop words chunk, it isn't `limit`, but it contains 2 split words. {color}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)