You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@lucene.apache.org by us...@apache.org on 2012/07/23 12:35:36 UTC
svn commit: r1364572 - in /lucene/dev/branches/branch_4x: ./ lucene/ solr/
solr/contrib/ solr/contrib/analysis-extras/ solr/contrib/clustering/
solr/contrib/dataimporthandler/ solr/contrib/extraction/
solr/contrib/langid/ solr/contrib/uima/
Author: uschindler
Date: Mon Jul 23 10:35:35 2012
New Revision: 1364572
URL: http://svn.apache.org/viewvc?rev=1364572&view=rev
Log:
Merged revision(s) 1364568, 1364570 from lucene/dev/trunk:
Copy over changes from 3.6 branch
Modified:
lucene/dev/branches/branch_4x/ (props changed)
lucene/dev/branches/branch_4x/lucene/ (props changed)
lucene/dev/branches/branch_4x/lucene/CHANGES.txt (contents, props changed)
lucene/dev/branches/branch_4x/solr/ (props changed)
lucene/dev/branches/branch_4x/solr/CHANGES.txt (contents, props changed)
lucene/dev/branches/branch_4x/solr/contrib/ (props changed)
lucene/dev/branches/branch_4x/solr/contrib/analysis-extras/CHANGES.txt
lucene/dev/branches/branch_4x/solr/contrib/clustering/CHANGES.txt
lucene/dev/branches/branch_4x/solr/contrib/dataimporthandler/CHANGES.txt
lucene/dev/branches/branch_4x/solr/contrib/extraction/CHANGES.txt
lucene/dev/branches/branch_4x/solr/contrib/langid/CHANGES.txt
lucene/dev/branches/branch_4x/solr/contrib/uima/CHANGES.txt
Modified: lucene/dev/branches/branch_4x/lucene/CHANGES.txt
URL: http://svn.apache.org/viewvc/lucene/dev/branches/branch_4x/lucene/CHANGES.txt?rev=1364572&r1=1364571&r2=1364572&view=diff
==============================================================================
--- lucene/dev/branches/branch_4x/lucene/CHANGES.txt (original)
+++ lucene/dev/branches/branch_4x/lucene/CHANGES.txt Mon Jul 23 10:35:35 2012
@@ -1238,7 +1238,60 @@ Build
tasks) to correctly encode build file names as URIs for later processing by
XSL. (Greg Bowyer, Uwe Schindler)
+
+======================= Lucene 3.6.1 =======================
+More information about this release, including any errata related to the
+release notes, upgrade instructions, or other changes may be found online at:
+ https://wiki.apache.org/lucene-java/Lucene3.6.1
+
+Bug Fixes
+
+* LUCENE-3969: Throw IAE on bad arguments that could cause confusing
+ errors in KeywordTokenizer.
+ (Uwe Schindler, Mike McCandless, Robert Muir)
+
+* LUCENE-3971: MappingCharFilter could return invalid final token position.
+ (Dawid Weiss, Robert Muir)
+
+* LUCENE-4023: DisjunctionMaxScorer now implements visitSubScorers().
+ (Uwe Schindler)
+
+* LUCENE-2566: + - operators allow any amount of whitespace (yonik, janhoy)
+
+* LUCENE-3590: Fix AIOOBE in BytesRef/CharsRef copyBytes/copyChars when
+ offset is nonzero, fix off-by-one in CharsRef.subSequence, and fix
+ CharsRef's CharSequence methods to throw exceptions in boundary cases
+ to properly meet the specification. (Robert Muir)
+
+* LUCENE-4222: TieredMergePolicy.getFloorSegmentMB was returning the
+ size in bytes not MB (Chris Fuller via Mike McCandless)
+
+API Changes
+
+* LUCENE-4023: Changed the visibility of Scorer#visitSubScorers() to
+ public, otherwise it's impossible to implement Scorers outside
+ the Lucene package. (Uwe Schindler)
+
+Optimizations
+
+* LUCENE-4163: Improve concurrency of MMapIndexInput.clone() by using
+ the new WeakIdentityMap on top of a ConcurrentHashMap to manage
+ the cloned instances. WeakIdentityMap was extended to support
+ iterating over its keys. (Uwe Schindler)
+
+Tests
+
+* LUCENE-3873: add MockGraphTokenFilter, testing analyzers with
+ random graph tokens. (Mike McCandless)
+
+* LUCENE-3968: factor out LookaheadTokenFilter from
+ MockGraphTokenFilter (Mike Mccandless)
+
+
======================= Lucene 3.6.0 =======================
+More information about this release, including any errata related to the
+release notes, upgrade instructions, or other changes may be found online at:
+ https://wiki.apache.org/lucene-java/Lucene3.6
Changes in backwards compatibility policy
@@ -1294,7 +1347,7 @@ Changes in backwards compatibility polic
* LUCENE-3712: Removed unused and untested ReaderUtil#subReader methods.
(Uwe Schindler)
-
+
* LUCENE-3672: Deprecate Directory.fileModified,
IndexCommit.getTimestamp and .getVersion and
IndexReader.lastModified and getCurrentVersion (Andrzej Bialecki,
@@ -1317,6 +1370,10 @@ Changes in backwards compatibility polic
* LUCENE-3738: All readXxx methods in BufferedIndexInput were made
final. Subclasses should only override protected readInternal /
seekInternal. (Uwe Schindler)
+
+* LUCENE-2599: Deprecated the spatial contrib module, which was buggy and not
+ well maintained. Lucene 4 includes a new spatial module that replaces this.
+ (David Smiley, Ryan McKinley, Chris Male)
Changes in Runtime Behavior
@@ -1358,7 +1415,7 @@ API Changes
query time, wrap your IndexReader using FilterIndexReader, overriding
FilterIndexReader.norms(). To persist the changes on disk, copy the
FilteredIndexReader to a new index using IndexWriter.addIndexes().
- In Lucene 4.0, Similarity will allow you to customize scoring
+ In Lucene 4.0, SimilarityProvider will allow you to customize scoring
using external norms, too. (Uwe Schindler, Robert Muir)
* LUCENE-3735: PayloadProcessorProvider was changed to return a
@@ -1383,7 +1440,7 @@ API Changes
never applying deletes). (MJB, Shai Erera, Mike McCandless)
* LUCENE-3761: Generalize SearcherManager into an abstract ReferenceManager.
- SearcherManager remains a concrete class, but due to the refactoring, the
+ SearcherManager remains a concrete class, but due to the refactoring, the
method maybeReopen has been deprecated in favor of maybeRefresh().
(Shai Erera, Mike McCandless, Simon Willnauer)
@@ -1408,7 +1465,7 @@ New Features
queries. Literal asterisks may be represented by quoting or escaping
(i.e. \* or "*") Custom QueryParser subclasses overriding getRangeQuery()
will be passed null for any open endpoint. (Ingo Renner, Adriano
- Crestani, yonik, Mike McCandless
+ Crestani, yonik, Mike McCandless
* LUCENE-3121: Add sugar reverse lookup (given an output, find the
input mapping to it) for FSTs that have strictly monotonic long
@@ -1428,7 +1485,7 @@ New Features
* LUCENE-3789: Expose MTQ TermsEnum via RewriteMethod for non package private
access (Simon Willnauer)
-
+
* LUCENE-3881: Added UAX29URLEmailAnalyzer: a standard analyzer that recognizes
URLs and emails. (Steve Rowe)
Modified: lucene/dev/branches/branch_4x/solr/CHANGES.txt
URL: http://svn.apache.org/viewvc/lucene/dev/branches/branch_4x/solr/CHANGES.txt?rev=1364572&r1=1364571&r2=1364572&view=diff
==============================================================================
--- lucene/dev/branches/branch_4x/solr/CHANGES.txt (original)
+++ lucene/dev/branches/branch_4x/solr/CHANGES.txt Mon Jul 23 10:35:35 2012
@@ -809,7 +809,36 @@ Documentation
* SOLR-2232: Improved README info on solr.solr.home in examples
(Eric Pugh and hossman)
+================== 3.6.1 ==================
+More information about this release, including any errata related to the
+release notes, upgrade instructions, or other changes may be found online at:
+ https://wiki.apache.org/solr/Solr3.6.1
+
+Bug Fixes:
+
+* LUCENE-3969: Throw IAE on bad arguments that could cause confusing errors in
+ PatternTokenizer. CommonGrams populates PositionLengthAttribute correctly.
+ (Uwe Schindler, Mike McCandless, Robert Muir)
+
+* SOLR-3361: ReplicationHandler "maxNumberOfBackups" doesn't work if backups are triggered on commit
+ (James Dyer, Tomas Fernandez Lobbe)
+
+* SOLR-3375: Fix charset problems with HttpSolrServer (Roger HÃ¥kansson, yonik, siren)
+
+* SOLR-3436: Group count incorrect when not all shards are queried in the second
+ pass. (Francois Perron, Martijn van Groningen)
+
+* SOLR-3454: Exception when using result grouping with main=true and using
+ wt=javabin. (Ludovic Boutros, Martijn van Groningen)
+
+* SOLR-3489: Config file replication less error prone (Jochen Just via janhoy)
+
+* SOLR-3477: SOLR does not start up when no cores are defined (Tomás Fernández Löbbe via tommaso)
+
================== 3.6.0 ==================
+More information about this release, including any errata related to the
+release notes, upgrade instructions, or other changes may be found online at:
+ https://wiki.apache.org/solr/Solr3.6
Upgrading from Solr 3.5
----------------------
@@ -843,16 +872,16 @@ Upgrading from Solr 3.5
* SOLR-3161: Don't use the 'qt' parameter with a leading '/'. It probably won't work in 4.0
and it's now limited in 3.6 to SearchHandler subclasses that aren't lazy-loaded.
+* SOLR-2724: Specifying <defaultSearchField> and <solrQueryParser defaultOperator="..."/> in
+ schema.xml is now considered deprecated. Instead you are encouraged to specify these via the "df"
+ and "q.op" parameters in your request handler definition. (David Smiley)
+
* Bugs found and fixed in the SignatureUpdateProcessor that previously caused
some documents to produce the same signature even when the configured fields
contained distinct (non-String) values. Users of SignatureUpdateProcessor
are strongly advised that they should re-index as document signatures may
have now changed. (see SOLR-3200 & SOLR-3226 for details)
-* SOLR-2724: Specifying <defaultSearchField> and <solrQueryParser defaultOperator="..."/> in
- schema.xml is now considered deprecated. Instead you are encouraged to specify these via the "df"
- and "q.op" parameters in your request handler definition. (David Smiley)
-
New Features
----------------------
* SOLR-2020: Add Java client that uses Apache Http Components http client (4.x).
@@ -867,13 +896,13 @@ New Features
* SOLR-1565: StreamingUpdateSolrServer supports RequestWriter API and therefore, javabin update
format (shalin)
-
+
* SOLR-2438 added MultiTermAwareComponent to the various classes to allow automatic lowercasing
for multiterm queries (wildcards, regex, prefix, range, etc). You can now optionally specify a
"multiterm" analyzer in our schema.xml, but Solr should "do the right thing" if you don't
- specify <analyzer type="multiterm"> (Pete Sturge Erick Erickson, Mentoring from Seeley and Muir)
+ specify <fieldType="multiterm"> (Pete Sturge Erick Erickson, Mentoring from Seeley and Muir)
-* SOLR-2919: Added support for localized range queries when the analysis chain uses
+* SOLR-2919: Added support for localized range queries when the analysis chain uses
CollationKeyFilter or ICUCollationKeyFilter. (Michael Sokolov, rmuir)
* SOLR-2982: Added BeiderMorseFilterFactory for Beider-Morse (BMPM) phonetic encoder. Upgrades
@@ -976,7 +1005,7 @@ Bug Fixes
* SOLR-2819: Improved speed of parsing hex entities in HTMLStripCharFilter
(Bernhard Berger, hossman)
-
+
* SOLR-2509: StringIndexOutOfBoundsException in the spellchecker collate when the term contains
a hyphen. (Thomas Gambier caught the bug, Steffen Godskesen did the patch, via Erick Erickson)
@@ -996,17 +1025,18 @@ Bug Fixes
* SOLR-2542: Fixed DIH Context variables which were broken for all scopes other
then SCOPE_ENTITY (Linbin Chen & Frank Wesemann via hossman)
-
+
* SOLR-3042: Fixed Maven Jetty plugin configuration.
(David Smiley via Steve Rowe)
* SOLR-2970: CSV ResponseWriter returns fields defined as stored=false in schema (janhoy)
* LUCENE-3690, LUCENE-2208, SOLR-882, SOLR-42: Re-implemented
- HTMLStripCharFilter as a JFlex-generated scanner. See below for a list
- of bug fixes and other changes. To get the same behavior as
- HTMLStripCharFilter in Solr version 3.5 and earlier (including the bugs),
- use LegacyHTMLStripCharFilter, which is the previous implementation.
+ HTMLStripCharFilter as a JFlex-generated scanner and moved it to
+ lucene/contrib/analyzers/common/. See below for a list of bug fixes and
+ other changes. To get the same behavior as HTMLStripCharFilter in Solr
+ version 3.5 and earlier (including the bugs), use LegacyHTMLStripCharFilter,
+ which is the previous implementation.
Behavior changes from the previous version:
@@ -1020,7 +1050,7 @@ Bug Fixes
from Unicode character classes [:ID_Start:] and [:ID_Continue:].
- Uppercase character entities """, "©", ">", "<", "®",
and "&" are now recognized and handled as if they were in lowercase.
- - The REPLACEMENT CHARACTER U+FFFD is now used to replace numeric character
+ - The REPLACEMENT CHARACTER U+FFFD is now used to replace numeric character
entities for unpaired UTF-16 low and high surrogates (in the range
[U+D800-U+DFFF]).
- Properly paired numeric character entities for UTF-16 surrogates are now
@@ -1086,6 +1116,9 @@ Bug Fixes
and was fundamentally broken/bizarre.
(hossman, Ahmet Arslan)
+* SOLR-2291: fix JSONWriter to respect field list when writing SolrDocuments
+ (Ahmet Arslan via hossman)
+
* SOLR-3264: Fix CoreContainer and SolrResourceLoader logging to be more
clear about when SolrCores are being created, and stop misleading people
about SolrCore instanceDir's being the "Solr Home Dir" (hossman)
@@ -1101,9 +1134,9 @@ Bug Fixes
* SOLR-3200: Fix SignatureUpdateProcessor "all fields" mode to use all
fields of each document instead of the fields specified by the first
document indexed (Spyros Kapnissis via hossman)
-
-* SOLR-3316: Distributed grouping failed when rows parameter was set to 0 and sometimes returned a wrong
- hit count as matches. (Cody Young, Martijn van Groningen)
+
+* SOLR-3316: Distributed grouping failed when rows parameter was set to 0 and
+ sometimes returned a wrong hit count as matches. (Cody Young, Martijn van Groningen)
Other Changes
----------------------
@@ -1112,7 +1145,7 @@ Other Changes
* SOLR-2920: Refactor frequent conditional use of DefaultSolrParams and
AppendedSolrParams into factory methods.
(David Smiley via hossman)
-
+
* SOLR-3032: Deprecate logOnce from SolrException logOnce and all the supporting
structure will disappear in 4.0. Errors should be caught and logged at the
top-most level or logged and NOT propagated up the chain. (Erick Erickson)
@@ -1136,7 +1169,7 @@ Other Changes
* SOLR-3077: Better error messages when attempting to use "blank" field names
(Antony Stubbs via hossman)
-* SOLR-2712: expecting fl=score to return all fields is now deprecated.
+* SOLR-2712: expecting fl=score to return all fields is now deprecated.
In solr 4.0, this will only return the score. (ryan)
* SOLR-3156: Check for Lucene directory locks at startup. In previous versions
@@ -1149,6 +1182,13 @@ Other Changes
the effect of NOT specifying <indexConfig> at all gives same result as the
example config used to give in 3.5 (janhoy, gsingers)
+* SOLR-3294: In contrib/clustering/lib/, replaced the manually retrowoven
+ Java 1.5-compatible carrot2-core-3.5.0.jar (which is not publicly available,
+ except from the Solr Subversion repository), with newly released Java
+ 1.5-compatible carrot2-core-3.5.0.1.jar (hosted on the Maven Central
+ repository). Also updated dependencies jackson-core-asl and
+ jackson-mapper-asl (both v1.5.2 -> v1.7.4). (Dawid Weiss, Steve Rowe)
+
Build
----------------------
* SOLR-2487: Add build target to package war without slf4j jars (janhoy)
Modified: lucene/dev/branches/branch_4x/solr/contrib/analysis-extras/CHANGES.txt
URL: http://svn.apache.org/viewvc/lucene/dev/branches/branch_4x/solr/contrib/analysis-extras/CHANGES.txt?rev=1364572&r1=1364571&r2=1364572&view=diff
==============================================================================
--- lucene/dev/branches/branch_4x/solr/contrib/analysis-extras/CHANGES.txt (original)
+++ lucene/dev/branches/branch_4x/solr/contrib/analysis-extras/CHANGES.txt Mon Jul 23 10:35:35 2012
@@ -17,6 +17,10 @@ $Id$
the Solr 3.x ICUCollationKeyFilterFactory, and also supports
Locale-sensitive range queries. (rmuir)
+================== 3.6.1 ==================
+
+(No Changes)
+
================== 3.6.0 ==================
* SOLR-2919: Added parametric tailoring options to ICUCollationKeyFilterFactory.
Modified: lucene/dev/branches/branch_4x/solr/contrib/clustering/CHANGES.txt
URL: http://svn.apache.org/viewvc/lucene/dev/branches/branch_4x/solr/contrib/clustering/CHANGES.txt?rev=1364572&r1=1364571&r2=1364572&view=diff
==============================================================================
--- lucene/dev/branches/branch_4x/solr/contrib/clustering/CHANGES.txt (original)
+++ lucene/dev/branches/branch_4x/solr/contrib/clustering/CHANGES.txt Mon Jul 23 10:35:35 2012
@@ -12,6 +12,11 @@ $Id$
* SOLR-3470: Bug fix: custom Carrot2 tokenizer and stemmer factories are
respected now (Stanislaw Osinski, Dawid Weiss)
+================== Release 3.6.1 ==================
+
+* SOLR-3470: Bug fix: custom Carrot2 tokenizer and stemmer factories are
+ respected now (Stanislaw Osinski, Dawid Weiss)
+
================== Release 3.6.0 ==================
* SOLR-2937: Configuring the number of contextual snippets used for
Modified: lucene/dev/branches/branch_4x/solr/contrib/dataimporthandler/CHANGES.txt
URL: http://svn.apache.org/viewvc/lucene/dev/branches/branch_4x/solr/contrib/dataimporthandler/CHANGES.txt?rev=1364572&r1=1364571&r2=1364572&view=diff
==============================================================================
--- lucene/dev/branches/branch_4x/solr/contrib/dataimporthandler/CHANGES.txt (original)
+++ lucene/dev/branches/branch_4x/solr/contrib/dataimporthandler/CHANGES.txt Mon Jul 23 10:35:35 2012
@@ -28,7 +28,12 @@ Other Changes
Bug Fixes
----------------------
-* SOLR-3336: SolrEntityProcessor substitutes most variables at query time
+* SOLR-3360: More bug fixes for the deprecated "threads" parameter. (Mikhail Khludnev, Claudio R, via James Dyer)
+
+* SOLR-3430: Added a new test against a real SQL database. Fixed problems revealed by this new test
+ related to the expanded cache support added to 3.6/SOLR-2382 (James Dyer)
+
+* SOLR-3336: SolrEntityProcessor substitutes most variables at query time.
(Michael Kroh, Lance Norskog, via Martijn van Groningen)
================== 3.6.0 ==================
Modified: lucene/dev/branches/branch_4x/solr/contrib/extraction/CHANGES.txt
URL: http://svn.apache.org/viewvc/lucene/dev/branches/branch_4x/solr/contrib/extraction/CHANGES.txt?rev=1364572&r1=1364571&r2=1364572&view=diff
==============================================================================
--- lucene/dev/branches/branch_4x/solr/contrib/extraction/CHANGES.txt (original)
+++ lucene/dev/branches/branch_4x/solr/contrib/extraction/CHANGES.txt Mon Jul 23 10:35:35 2012
@@ -28,6 +28,10 @@ $Id$
* SOLR-3254: Upgrade Solr to Tika 1.1 (janhoy)
+================== Release 3.6.1 ==================
+
+(No Changes)
+
================== Release 3.6.0 ==================
* SOLR-2346: Add a chance to set content encoding explicitly via content type of stream.
Modified: lucene/dev/branches/branch_4x/solr/contrib/langid/CHANGES.txt
URL: http://svn.apache.org/viewvc/lucene/dev/branches/branch_4x/solr/contrib/langid/CHANGES.txt?rev=1364572&r1=1364571&r2=1364572&view=diff
==============================================================================
--- lucene/dev/branches/branch_4x/solr/contrib/langid/CHANGES.txt (original)
+++ lucene/dev/branches/branch_4x/solr/contrib/langid/CHANGES.txt Mon Jul 23 10:35:35 2012
@@ -11,6 +11,10 @@ $Id$
(No changes)
+================== Release 3.6.1 ==================
+
+(No Changes)
+
================== Release 3.6.0 ==================
* SOLR-3107: When using the LangDetect implementation of langid, set the random
Modified: lucene/dev/branches/branch_4x/solr/contrib/uima/CHANGES.txt
URL: http://svn.apache.org/viewvc/lucene/dev/branches/branch_4x/solr/contrib/uima/CHANGES.txt?rev=1364572&r1=1364571&r2=1364572&view=diff
==============================================================================
--- lucene/dev/branches/branch_4x/solr/contrib/uima/CHANGES.txt (original)
+++ lucene/dev/branches/branch_4x/solr/contrib/uima/CHANGES.txt Mon Jul 23 10:35:35 2012
@@ -26,6 +26,10 @@ $Id$
(No Changes)
+================== 3.6.1 ==================
+
+(No Changes)
+
================== 3.6.0 ==================
(No Changes)