You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@lucene.apache.org by dw...@apache.org on 2022/09/23 10:59:52 UTC
[lucene] branch main updated (5b24a233bdf -> 3a04aa44c22)
This is an automated email from the ASF dual-hosted git repository.
dweiss pushed a change to branch main
in repository https://gitbox.apache.org/repos/asf/lucene.git
from 5b24a233bdf LUCENE-10425:speed up IndexSortSortedNumericDocValuesRangeQuery#BoundedDocSetIdIterator construction using bkd binary search (#687)
add 3a04aa44c22 Fix repeating token sentence boundary bug (#11734)
No new revisions were added by this update.
Summary of changes:
lucene/CHANGES.txt | 6 ++
.../analysis/opennlp/OpenNLPChunkerFilter.java | 63 +++++++----------
.../analysis/opennlp/OpenNLPLemmatizerFilter.java | 63 +++++++----------
.../lucene/analysis/opennlp/OpenNLPPOSFilter.java | 63 ++++++++---------
.../lucene/analysis/opennlp/OpenNLPTokenizer.java | 19 +++--
.../opennlp/SentenceAttributeExtractor.java | 81 +++++++++++++++++++++
.../analysis/opennlp/data/early-exit-bug-input.txt | 23 ++++++
.../opennlp/data/early-exit-bug-output.txt | 32 +++++++++
.../opennlp/TestOpenNLPChunkerFilterFactory.java | 12 ++++
.../TestOpenNLPLemmatizerFilterFactory.java | 82 ++++++++++++++++++++++
.../opennlp/TestOpenNLPPOSFilterFactory.java | 24 +++++++
.../{TypeAttribute.java => SentenceAttribute.java} | 24 ++++---
...tributeImpl.java => SentenceAttributeImpl.java} | 50 +++++++------
13 files changed, 391 insertions(+), 151 deletions(-)
create mode 100644 lucene/analysis/opennlp/src/java/org/apache/lucene/analysis/opennlp/SentenceAttributeExtractor.java
create mode 100644 lucene/analysis/opennlp/src/test-files/org/apache/lucene/analysis/opennlp/data/early-exit-bug-input.txt
create mode 100644 lucene/analysis/opennlp/src/test-files/org/apache/lucene/analysis/opennlp/data/early-exit-bug-output.txt
copy lucene/core/src/java/org/apache/lucene/analysis/tokenattributes/{TypeAttribute.java => SentenceAttribute.java} (66%)
copy lucene/core/src/java/org/apache/lucene/analysis/tokenattributes/{FlagsAttributeImpl.java => SentenceAttributeImpl.java} (58%)