You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nlpcraft.apache.org by "Sergey Kamov (Jira)" <ji...@apache.org> on 2021/01/25 13:11:00 UTC
[jira] [Resolved] (NLPCRAFT-223) Suspicious log output.
[ https://issues.apache.org/jira/browse/NLPCRAFT-223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sergey Kamov resolved NLPCRAFT-223.
-----------------------------------
Assignee: Aaron Radzinski (was: Sergey Kamov)
Resolution: Fixed
ready for review in NLPCRAFT-223
> Suspicious log output.
> ----------------------
>
> Key: NLPCRAFT-223
> URL: https://issues.apache.org/jira/browse/NLPCRAFT-223
> Project: NLPCraft
> Issue Type: Improvement
> Components: probe
> Reporter: Sergey Kamov
> Assignee: Aaron Radzinski
> Priority: Major
> Fix For: 0.7.4
>
>
> See {color:#000000}NCEnricherLimitSpec.
> See variant
> {color}
> {color:#000000}Jan-19|17:15:52|INF| NCAsciiTable - (NCAsciiTable.scala:626) - Parsing variant #1 of 3 for: 'handful of a b'
> +===================================================================================================================+
> | idx | origtext | lemma | pos | quoted | stopword | freeword | wordindexes | direct | sparsity | token data |
> +===================================================================================================================+
> | 0 | handful of | handful of | --- | false | false | true | [0,1] | true | 0 | |
> | 1 | A | a | DT | false | false | false | [2] | true | 0 | <<A>> id=A |
> | 2 | B | b | NN | false | false | false | [3] | true | 0 | <<B>> id=B |
> +-------------------------------------------------------------------------------------------------------------------+{color}
>
> {color:#000000}The first token `handful of` seems strange.{color}
> {color:#000000}It is not `stop words chunk, it isn't `limit`, but it contains 2 split words. {color}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)