You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@cocoon.apache.org by "David Crossley (JIRA)" <ji...@apache.org> on 2006/03/01 07:48:39 UTC

[jira] Updated: (COCOON-996) [PATCH] LuceneIndexContentHandler.java produces CLOBs

     [ http://issues.apache.org/jira/browse/COCOON-996?page=all ]

David Crossley updated COCOON-996:
----------------------------------

    Bugzilla Id:   (was: 25934)
     Other Info: [Patch available]
    Description: 
cocoon-2.1/src/blocks/lucene/java/org/apache/cocoon/components/search/LuceneIndexContentHandler.java
produces something like a "Character Large Object", because the text from all
XML-elements is concatenated without whitespaces between them:
<list><item>Foo</item><item>Bar</item></list> gets indexed as "FooBar", which
makes searching very hard. Adding a blank after an element might solve this
"problem", but might be wrong for other cases (but I don't know any at the moment)

  was:
cocoon-2.1/src/blocks/lucene/java/org/apache/cocoon/components/search/LuceneIndexContentHandler.java
produces something like a "Character Large Object", because the text from all
XML-elements is concatenated without whitespaces between them:
<list><item>Foo</item><item>Bar</item></list> gets indexed as "FooBar", which
makes searching very hard. Adding a blank after an element might solve this
"problem", but might be wrong for other cases (but I don't know any at the moment)


> [PATCH] LuceneIndexContentHandler.java produces CLOBs
> -----------------------------------------------------
>
>          Key: COCOON-996
>          URL: http://issues.apache.org/jira/browse/COCOON-996
>      Project: Cocoon
>         Type: Bug
>   Components: - Components: Avalon
>     Versions: 2.1.8
>  Environment: Operating System: Linux
> Platform: PC
>     Reporter: Philipp Matthias Hahn
>     Assignee: Cocoon Developers Team
>     Priority: Minor
>  Attachments: LICH.shouldwork.patch.txt, LICH.vadimjoerg.patch.txt, LuceneIndexContentHandler.java.diff
>
> cocoon-2.1/src/blocks/lucene/java/org/apache/cocoon/components/search/LuceneIndexContentHandler.java
> produces something like a "Character Large Object", because the text from all
> XML-elements is concatenated without whitespaces between them:
> <list><item>Foo</item><item>Bar</item></list> gets indexed as "FooBar", which
> makes searching very hard. Adding a blank after an element might solve this
> "problem", but might be wrong for other cases (but I don't know any at the moment)

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira