You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@poi.apache.org by Alex Cougarman <ac...@bwc.org> on 2012/09/10 07:48:30 UTC
Bug 53380
Hi. I'm having the same issue from this bug with hundreds of our DOC files being fed through Solr/Tika: https://issues.apache.org/bugzilla/show_bug.cgi?id=53380
I downloaded the DOC file attached to the ticket and was able to generate the same error we've been getting (please see below for the exception).
Anyone know of a solution/workaround? Is there a timeline for a fix? I commented and voted on the ticket but not sure if it's a priority. Thanks.
org.apache.tika.exception.TikaException
: Unexpected RuntimeException from
org.apache.tika.parser.microsoft.OfficeParser@328c62ce
org.apache.solr.common.SolrException:
org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.microsoft.OfficeParser@328c62ce
at
org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(Extr
actingDocumentLoader.java:230)
at
org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(Co
ntentStreamHandlerBase.java:74)
at
org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandl
erBase.java:129)
at
org.apache.solr.core.RequestHandlers$LazyRequestHandlerWrapper.handle
Request(RequestHandlers.java:240)
at org.apache.solr.core.SolrCore.execute(SolrCore.java:1656)
at
org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter
.java:454)
at
org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilte
r.java:275)
at
org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(Servlet
Handler.java:1337)
at
org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java
:484)
at
org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.j
ava:119)
at
org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java:524)
at
org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandl
er.java:233)
at
org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandl
er.java:1065)
at
org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:
413)
at
org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandle
r.java:192)
at
org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandle
r.java:999)
at
org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.j
ava:117)
at
org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(Cont
extHandlerCollection.java:250)
at
org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerColl
ection.java:149)
at
org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper
.java:111)
at org.eclipse.jetty.server.Server.handle(Server.java:351)
at
org.eclipse.jetty.server.AbstractHttpConnection.handleRequest(Abstrac
tHttpConnection.java:454)
at
org.eclipse.jetty.server.BlockingHttpConnection.handleRequest(Blockin
gHttpConnection.java:47)
at
org.eclipse.jetty.server.AbstractHttpConnection.headerComplete(Abstra
ctHttpConnection.java:890)
at
org.eclipse.jetty.server.AbstractHttpConnection$RequestHandler.header
Complete(AbstractHttpConnection.java:944)
at
org.eclipse.jetty.http.HttpParser.parseNext(HttpParser.java:642)
at
org.eclipse.jetty.http.HttpParser.parseAvailable(HttpParser.java:230)
at
org.eclipse.jetty.server.BlockingHttpConnection.handle(BlockingHttpCo
nnection.java:66)
at
org.eclipse.jetty.server.bio.SocketConnector$ConnectorEndPoint.run(So
cketConnector.java:254)
at
org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPoo
l.java:599)
at
org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool
.java:534)
at java.lang.Thread.run(Unknown Source)
Caused by: org.apache.tika.exception.TikaException: Unexpected RuntimeException
from org.apache.tika.parser.microsoft.OfficeParser@328c62ce
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244
)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242
)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:1
20)
at
org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(Extr
actingDocumentLoader.java:224)
... 31 more
Caused by: java.lang.ArrayIndexOutOfBoundsException: 7
at
org.apache.poi.util.LittleEndian.getInt(LittleEndian.java:163)
at
org.apache.poi.hwpf.model.Colorref.<init>(Colorref.java:81)
at
org.apache.poi.hwpf.model.types.SHDAbstractType.fillFields(SHDAbstrac
tType.java:56)
at
org.apache.poi.hwpf.usermodel.ShadingDescriptor.<init>(ShadingD
escriptor.java:38)
at
org.apache.poi.hwpf.sprm.CharacterSprmUncompressor.unCompressCHPOpera
tion(CharacterSprmUncompressor.java:582)
at
org.apache.poi.hwpf.sprm.CharacterSprmUncompressor.uncompressCHP(Char
acterSprmUncompressor.java:65)
at
org.apache.poi.hwpf.model.StyleSheet.createChp(StyleSheet.java:288)
at
org.apache.poi.hwpf.model.StyleSheet.<init>(StyleSheet.java:121
)
at
org.apache.poi.hwpf.HWPFDocument.<init>(HWPFDocument.java:346)
at
org.apache.tika.parser.microsoft.WordExtractor.parse(WordExtractor.ja
va:77)
at
org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java
:185)
at
org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java
:160)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242
)
... 34 more
Warm regards,
Alex
RE: Bug 53380
Posted by Alex Cougarman <ac...@bwc.org>.
Dear Sergey,
Build 50 works well. I hadn't replaced all the old POI JAR files with the new ones. Thanks.
Warm regards,
Alex
-----Original Message-----
From: Alex Cougarman [mailto:acougarm@bwc.org]
Sent: 26 September 2012 8:58 AM
To: 'POI Users List'
Subject: RE: Bug 53380
Thanks, Sergey. Sorry to not include the stack trace -- was worried it would be too long; I've added the stack trace to the ticket now :)
Warm regards,
Alex
-----Original Message-----
From: Sergey Vladimirov [mailto:vlsergey@gmail.com]
Sent: 25 September 2012 10:21 AM
To: POI Users List
Subject: Re: Bug 53380
Alex,
I will take a look into it a bit later. But I need to note, that it should be different bug, i.e. different reason for ArrayIndexOutOfBounds, because all previous files are "passed" now. So, please include stack trace next time :)
Best regards,
Sergey
On Tue, Sep 25, 2012 at 11:07 AM, Alex Cougarman <ac...@bwc.org> wrote:
> Hi Sergey,
>
> The bug persists. We've uploaded a Word DOC (blank_2.doc) to the bug
> that generates the ArrayIndexOutOfBounds exception:
> https://issues.apache.org/bugzilla/show_bug.cgi?id=53380
> This is using the latest build (#50) from here:
> https://builds.apache.org/job/POI/50/
>
> Warm regards,
> Alex
>
> -----Original Message-----
> From: Sergey Vladimirov [mailto:vlsergey@gmail.com]
> Sent: 13 September 2012 11:55 AM
> To: POI Users List
> Subject: Re: Bug 53380
>
> Hi,
>
> Try #47, build by Yegor:
>
> https://builds.apache.org/job/POI/47/
>
> Best regards,
> Sergey
>
> On Thu, Sep 13, 2012 at 9:03 AM, Alex Cougarman <ac...@bwc.org> wrote:
>
> > Any update on the bug fix for this? There's a Build #46 on this page
> > but it says "Failed" when you roll over the red circle:
> > https://builds.apache.org/job/POI/46/
> >
> > Thank you :)
> >
> > Warm regards,
> > Alex Cougarman
> >
> > Bahá’í World Centre
> > Haifa, Israel
> > Office: +972-4-835-8683
> > Cell: +972-54-241-4742
> > acougarm@bwc.org
> >
> >
> > -----Original Message-----
> > From: Alex Cougarman [mailto:acougarm@bwc.org]
> > Sent: 11 September 2012 11:42 AM
> > To: 'POI Users List'
> > Subject: RE: Bug 53380
> >
> > Hi Sergey,
> >
> > Thank you for looking into this issue. It will make a huge
> > difference for us :)
> >
> > Warm regards,
> > Alex
> >
> > -----Original Message-----
> > From: Sergey Vladimirov [mailto:vlsergey@gmail.com]
> > Sent: 10 September 2012 2:13 PM
> > To: POI Users List
> > Subject: Re: Bug 53380
> >
> > Hi
> >
> > I will take a look into it today or tomorrow.
> > Sorry for the long waiting
> >
> > Best regards,
> > Sergey
> >
> > On Mon, Sep 10, 2012 at 11:19 AM, Alex Cougarman <ac...@bwc.org>
> wrote:
> >
> > > Dear Yegor,
> > >
> > > Thank you for your reply. If I knew enough about Java, I'd go in
> > > and fix it :) Just happy to have you guys providing such a great tool.
> > > Thanks and keep up the great work.
> > >
> > > Warm regards,
> > > Alex
> > >
> > > -----Original Message-----
> > > From: Yegor Kozlov [mailto:yegor.kozlov@dinom.ru]
> > > Sent: 10 September 2012 10:16 AM
> > > To: POI Users List
> > > Subject: Re: Bug 53380
> > >
> > > We have all pre-requisites for fixing this bug, just need to find
> > > a person to do it :)
> > >
> > > POI is a volunteer project and if this problem is important for
> > > you, please do work on it and submit a patch. Otherwise please wait.
> > > Unfortuntaly we don't have a active developer working on DOC/DOCX
> > > modules, so fixing may take some time.
> > >
> > > Yegor
> > >
> > > On Mon, Sep 10, 2012 at 9:48 AM, Alex Cougarman <ac...@bwc.org>
> > wrote:
> > > > Hi. I'm having the same issue from this bug with hundreds of our
> > > > DOC files being fed through Solr/Tika:
> > > > https://issues.apache.org/bugzilla/show_bug.cgi?id=53380
> > > >
> > > > I downloaded the DOC file attached to the ticket and was able to
> > > generate the same error we've been getting (please see below for
> > > the exception).
> > > >
> > > > Anyone know of a solution/workaround? Is there a timeline for a fix?
> > > > I
> > > commented and voted on the ticket but not sure if it's a priority.
> > Thanks.
> > > >
> > > > org.apache.tika.exception.TikaException
> > > > : Unexpected RuntimeException from
> > > > org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> > > > org.apache.solr.common.SolrException:
> > > > org.apache.tika.exception.TikaException: Unexpected
> > > > RuntimeException
> > > from org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> > > > at
> > > > org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(Extr
> > > > actingDocumentLoader.java:230)
> > > > at
> > > > org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(Co
> > > > ntentStreamHandlerBase.java:74)
> > > > at
> > > > org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandl
> > > > erBase.java:129)
> > > > at
> > > > org.apache.solr.core.RequestHandlers$LazyRequestHandlerWrapper.handle
> > > > Request(RequestHandlers.java:240)
> > > > at
> > org.apache.solr.core.SolrCore.execute(SolrCore.java:1656)
> > > > at
> > > > org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter
> > > > .java:454)
> > > > at
> > > > org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilte
> > > > r.java:275)
> > > > at
> > > > org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(Servlet
> > > > Handler.java:1337)
> > > > at
> > > > org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java
> > > > :484)
> > > > at
> > > > org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.j
> > > > ava:119)
> > > > at
> > > >
> > > org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.
> > > ja
> > > va
> > > :524)
> > > > at
> > > > org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandl
> > > > er.java:233)
> > > > at
> > > > org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandl
> > > > er.java:1065)
> > > > at
> > > > org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:
> > > > 413)
> > > > at
> > > > org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandle
> > > > r.java:192)
> > > > at
> > > > org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandle
> > > > r.java:999)
> > > > at
> > > > org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.j
> > > > ava:117)
> > > > at
> > > > org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(Cont
> > > > extHandlerCollection.java:250)
> > > > at
> > > > org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerColl
> > > > ection.java:149)
> > > > at
> > > > org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper
> > > > .java:111)
> > > > at
> org.eclipse.jetty.server.Server.handle(Server.java:351)
> > > > at
> > > > org.eclipse.jetty.server.AbstractHttpConnection.handleRequest(Abstrac
> > > > tHttpConnection.java:454)
> > > > at
> > > > org.eclipse.jetty.server.BlockingHttpConnection.handleRequest(Blockin
> > > > gHttpConnection.java:47)
> > > > at
> > > > org.eclipse.jetty.server.AbstractHttpConnection.headerComplete(Abstra
> > > > ctHttpConnection.java:890)
> > > > at
> > > > org.eclipse.jetty.server.AbstractHttpConnection$RequestHandler.header
> > > > Complete(AbstractHttpConnection.java:944)
> > > > at
> > > > org.eclipse.jetty.http.HttpParser.parseNext(HttpParser.java:642)
> > > > at
> > > > org.eclipse.jetty.http.HttpParser.parseAvailable(HttpParser.java
> > > > :2
> > > > 30
> > > > )
> > > >
> > > > at
> > > > org.eclipse.jetty.server.BlockingHttpConnection.handle(BlockingHttpCo
> > > > nnection.java:66)
> > > > at
> > > > org.eclipse.jetty.server.bio.SocketConnector$ConnectorEndPoint.run(So
> > > > cketConnector.java:254)
> > > > at
> > > > org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPoo
> > > > l.java:599)
> > > > at
> > > > org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool
> > > > .java:534)
> > > > at java.lang.Thread.run(Unknown Source)
> > > > Caused by: org.apache.tika.exception.TikaException:
> > > > Unexpected
> > > RuntimeException
> > > > from org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> > > > at
> > > > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244
> > > > )
> > > > at
> > > > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242
> > > > )
> > > > at
> > > > org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:1
> > > > 20)
> > > > at
> > > > org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(Extr
> > > > actingDocumentLoader.java:224)
> > > > ... 31 more
> > > > Caused by: java.lang.ArrayIndexOutOfBoundsException: 7
> > > > at
> > > > org.apache.poi.util.LittleEndian.getInt(LittleEndian.java:163)
> > > > at
> > > > org.apache.poi.hwpf.model.Colorref.<init>(Colorref.java:81)
> > > > at
> > > > org.apache.poi.hwpf.model.types.SHDAbstractType.fillFields(SHDAbstrac
> > > > tType.java:56)
> > > > at
> > > > org.apache.poi.hwpf.usermodel.ShadingDescriptor.<init>(ShadingD
> > > > escriptor.java:38)
> > > > at
> > > > org.apache.poi.hwpf.sprm.CharacterSprmUncompressor.unCompressCHPOpera
> > > > tion(CharacterSprmUncompressor.java:582)
> > > > at
> > > > org.apache.poi.hwpf.sprm.CharacterSprmUncompressor.uncompressCHP(Char
> > > > acterSprmUncompressor.java:65)
> > > > at
> > > > org.apache.poi.hwpf.model.StyleSheet.createChp(StyleSheet.java:288)
> > > > at
> > > > org.apache.poi.hwpf.model.StyleSheet.<init>(StyleSheet.java:121
> > > > )
> > > > at
> > > > org.apache.poi.hwpf.HWPFDocument.<init>(HWPFDocument.java:346)
> > > > at
> > > > org.apache.tika.parser.microsoft.WordExtractor.parse(WordExtractor.ja
> > > > va:77)
> > > > at
> > > > org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java
> > > > :185)
> > > > at
> > > > org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java
> > > > :160)
> > > > at
> > > > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242
> > > > )
> > > > ... 34 more
> > > >
> > > >
> > > > Warm regards,
> > > > Alex
> > > >
> > >
> > > ------------------------------------------------------------------
> > > --
> > > - To unsubscribe, e-mail: user-unsubscribe@poi.apache.org For
> > > additional commands, e-mail: user-help@poi.apache.org
> > >
> > >
> > > ------------------------------------------------------------------
> > > --
> > > - To unsubscribe, e-mail: user-unsubscribe@poi.apache.org For
> > > additional commands, e-mail: user-help@poi.apache.org
> > >
> > >
> >
> >
> > --
> > Sergey Vladimirov
> >
> > --------------------------------------------------------------------
> > - To unsubscribe, e-mail: user-unsubscribe@poi.apache.org For
> > additional commands, e-mail: user-help@poi.apache.org
> >
>
>
>
> --
> Sergey Vladimirov
>
--
Sergey Vladimirov
B KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKCB [ X ܚX KK[XZ[
\ \ ][ X ܚX P K \X K ܙ B ܈Y][ۘ[ [X[ K[XZ[
\ \ Z[ K \X K ܙ B
RE: Bug 53380
Posted by Alex Cougarman <ac...@bwc.org>.
Thanks, Sergey. Sorry to not include the stack trace -- was worried it would be too long; I've added the stack trace to the ticket now :)
Warm regards,
Alex
-----Original Message-----
From: Sergey Vladimirov [mailto:vlsergey@gmail.com]
Sent: 25 September 2012 10:21 AM
To: POI Users List
Subject: Re: Bug 53380
Alex,
I will take a look into it a bit later. But I need to note, that it should be different bug, i.e. different reason for ArrayIndexOutOfBounds, because all previous files are "passed" now. So, please include stack trace next time :)
Best regards,
Sergey
On Tue, Sep 25, 2012 at 11:07 AM, Alex Cougarman <ac...@bwc.org> wrote:
> Hi Sergey,
>
> The bug persists. We've uploaded a Word DOC (blank_2.doc) to the bug
> that generates the ArrayIndexOutOfBounds exception:
> https://issues.apache.org/bugzilla/show_bug.cgi?id=53380
> This is using the latest build (#50) from here:
> https://builds.apache.org/job/POI/50/
>
> Warm regards,
> Alex
>
> -----Original Message-----
> From: Sergey Vladimirov [mailto:vlsergey@gmail.com]
> Sent: 13 September 2012 11:55 AM
> To: POI Users List
> Subject: Re: Bug 53380
>
> Hi,
>
> Try #47, build by Yegor:
>
> https://builds.apache.org/job/POI/47/
>
> Best regards,
> Sergey
>
> On Thu, Sep 13, 2012 at 9:03 AM, Alex Cougarman <ac...@bwc.org> wrote:
>
> > Any update on the bug fix for this? There's a Build #46 on this page
> > but it says "Failed" when you roll over the red circle:
> > https://builds.apache.org/job/POI/46/
> >
> > Thank you :)
> >
> > Warm regards,
> > Alex Cougarman
> >
> > Bahá’í World Centre
> > Haifa, Israel
> > Office: +972-4-835-8683
> > Cell: +972-54-241-4742
> > acougarm@bwc.org
> >
> >
> > -----Original Message-----
> > From: Alex Cougarman [mailto:acougarm@bwc.org]
> > Sent: 11 September 2012 11:42 AM
> > To: 'POI Users List'
> > Subject: RE: Bug 53380
> >
> > Hi Sergey,
> >
> > Thank you for looking into this issue. It will make a huge
> > difference for us :)
> >
> > Warm regards,
> > Alex
> >
> > -----Original Message-----
> > From: Sergey Vladimirov [mailto:vlsergey@gmail.com]
> > Sent: 10 September 2012 2:13 PM
> > To: POI Users List
> > Subject: Re: Bug 53380
> >
> > Hi
> >
> > I will take a look into it today or tomorrow.
> > Sorry for the long waiting
> >
> > Best regards,
> > Sergey
> >
> > On Mon, Sep 10, 2012 at 11:19 AM, Alex Cougarman <ac...@bwc.org>
> wrote:
> >
> > > Dear Yegor,
> > >
> > > Thank you for your reply. If I knew enough about Java, I'd go in
> > > and fix it :) Just happy to have you guys providing such a great tool.
> > > Thanks and keep up the great work.
> > >
> > > Warm regards,
> > > Alex
> > >
> > > -----Original Message-----
> > > From: Yegor Kozlov [mailto:yegor.kozlov@dinom.ru]
> > > Sent: 10 September 2012 10:16 AM
> > > To: POI Users List
> > > Subject: Re: Bug 53380
> > >
> > > We have all pre-requisites for fixing this bug, just need to find
> > > a person to do it :)
> > >
> > > POI is a volunteer project and if this problem is important for
> > > you, please do work on it and submit a patch. Otherwise please wait.
> > > Unfortuntaly we don't have a active developer working on DOC/DOCX
> > > modules, so fixing may take some time.
> > >
> > > Yegor
> > >
> > > On Mon, Sep 10, 2012 at 9:48 AM, Alex Cougarman <ac...@bwc.org>
> > wrote:
> > > > Hi. I'm having the same issue from this bug with hundreds of our
> > > > DOC files being fed through Solr/Tika:
> > > > https://issues.apache.org/bugzilla/show_bug.cgi?id=53380
> > > >
> > > > I downloaded the DOC file attached to the ticket and was able to
> > > generate the same error we've been getting (please see below for
> > > the exception).
> > > >
> > > > Anyone know of a solution/workaround? Is there a timeline for a fix?
> > > > I
> > > commented and voted on the ticket but not sure if it's a priority.
> > Thanks.
> > > >
> > > > org.apache.tika.exception.TikaException
> > > > : Unexpected RuntimeException from
> > > > org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> > > > org.apache.solr.common.SolrException:
> > > > org.apache.tika.exception.TikaException: Unexpected
> > > > RuntimeException
> > > from org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> > > > at
> > > > org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(Extr
> > > > actingDocumentLoader.java:230)
> > > > at
> > > > org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(Co
> > > > ntentStreamHandlerBase.java:74)
> > > > at
> > > > org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandl
> > > > erBase.java:129)
> > > > at
> > > > org.apache.solr.core.RequestHandlers$LazyRequestHandlerWrapper.handle
> > > > Request(RequestHandlers.java:240)
> > > > at
> > org.apache.solr.core.SolrCore.execute(SolrCore.java:1656)
> > > > at
> > > > org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter
> > > > .java:454)
> > > > at
> > > > org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilte
> > > > r.java:275)
> > > > at
> > > > org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(Servlet
> > > > Handler.java:1337)
> > > > at
> > > > org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java
> > > > :484)
> > > > at
> > > > org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.j
> > > > ava:119)
> > > > at
> > > >
> > > org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.
> > > ja
> > > va
> > > :524)
> > > > at
> > > > org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandl
> > > > er.java:233)
> > > > at
> > > > org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandl
> > > > er.java:1065)
> > > > at
> > > > org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:
> > > > 413)
> > > > at
> > > > org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandle
> > > > r.java:192)
> > > > at
> > > > org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandle
> > > > r.java:999)
> > > > at
> > > > org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.j
> > > > ava:117)
> > > > at
> > > > org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(Cont
> > > > extHandlerCollection.java:250)
> > > > at
> > > > org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerColl
> > > > ection.java:149)
> > > > at
> > > > org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper
> > > > .java:111)
> > > > at
> org.eclipse.jetty.server.Server.handle(Server.java:351)
> > > > at
> > > > org.eclipse.jetty.server.AbstractHttpConnection.handleRequest(Abstrac
> > > > tHttpConnection.java:454)
> > > > at
> > > > org.eclipse.jetty.server.BlockingHttpConnection.handleRequest(Blockin
> > > > gHttpConnection.java:47)
> > > > at
> > > > org.eclipse.jetty.server.AbstractHttpConnection.headerComplete(Abstra
> > > > ctHttpConnection.java:890)
> > > > at
> > > > org.eclipse.jetty.server.AbstractHttpConnection$RequestHandler.header
> > > > Complete(AbstractHttpConnection.java:944)
> > > > at
> > > > org.eclipse.jetty.http.HttpParser.parseNext(HttpParser.java:642)
> > > > at
> > > > org.eclipse.jetty.http.HttpParser.parseAvailable(HttpParser.java
> > > > :2
> > > > 30
> > > > )
> > > >
> > > > at
> > > > org.eclipse.jetty.server.BlockingHttpConnection.handle(BlockingHttpCo
> > > > nnection.java:66)
> > > > at
> > > > org.eclipse.jetty.server.bio.SocketConnector$ConnectorEndPoint.run(So
> > > > cketConnector.java:254)
> > > > at
> > > > org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPoo
> > > > l.java:599)
> > > > at
> > > > org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool
> > > > .java:534)
> > > > at java.lang.Thread.run(Unknown Source)
> > > > Caused by: org.apache.tika.exception.TikaException:
> > > > Unexpected
> > > RuntimeException
> > > > from org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> > > > at
> > > > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244
> > > > )
> > > > at
> > > > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242
> > > > )
> > > > at
> > > > org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:1
> > > > 20)
> > > > at
> > > > org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(Extr
> > > > actingDocumentLoader.java:224)
> > > > ... 31 more
> > > > Caused by: java.lang.ArrayIndexOutOfBoundsException: 7
> > > > at
> > > > org.apache.poi.util.LittleEndian.getInt(LittleEndian.java:163)
> > > > at
> > > > org.apache.poi.hwpf.model.Colorref.<init>(Colorref.java:81)
> > > > at
> > > > org.apache.poi.hwpf.model.types.SHDAbstractType.fillFields(SHDAbstrac
> > > > tType.java:56)
> > > > at
> > > > org.apache.poi.hwpf.usermodel.ShadingDescriptor.<init>(ShadingD
> > > > escriptor.java:38)
> > > > at
> > > > org.apache.poi.hwpf.sprm.CharacterSprmUncompressor.unCompressCHPOpera
> > > > tion(CharacterSprmUncompressor.java:582)
> > > > at
> > > > org.apache.poi.hwpf.sprm.CharacterSprmUncompressor.uncompressCHP(Char
> > > > acterSprmUncompressor.java:65)
> > > > at
> > > > org.apache.poi.hwpf.model.StyleSheet.createChp(StyleSheet.java:288)
> > > > at
> > > > org.apache.poi.hwpf.model.StyleSheet.<init>(StyleSheet.java:121
> > > > )
> > > > at
> > > > org.apache.poi.hwpf.HWPFDocument.<init>(HWPFDocument.java:346)
> > > > at
> > > > org.apache.tika.parser.microsoft.WordExtractor.parse(WordExtractor.ja
> > > > va:77)
> > > > at
> > > > org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java
> > > > :185)
> > > > at
> > > > org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java
> > > > :160)
> > > > at
> > > > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242
> > > > )
> > > > ... 34 more
> > > >
> > > >
> > > > Warm regards,
> > > > Alex
> > > >
> > >
> > > ------------------------------------------------------------------
> > > --
> > > - To unsubscribe, e-mail: user-unsubscribe@poi.apache.org For
> > > additional commands, e-mail: user-help@poi.apache.org
> > >
> > >
> > > ------------------------------------------------------------------
> > > --
> > > - To unsubscribe, e-mail: user-unsubscribe@poi.apache.org For
> > > additional commands, e-mail: user-help@poi.apache.org
> > >
> > >
> >
> >
> > --
> > Sergey Vladimirov
> >
> > --------------------------------------------------------------------
> > - To unsubscribe, e-mail: user-unsubscribe@poi.apache.org For
> > additional commands, e-mail: user-help@poi.apache.org
> >
>
>
>
> --
> Sergey Vladimirov
>
--
Sergey Vladimirov
Re: Bug 53380
Posted by Sergey Vladimirov <vl...@gmail.com>.
Alex,
I will take a look into it a bit later. But I need to note, that it should
be different bug, i.e. different reason for ArrayIndexOutOfBounds, because
all previous files are "passed" now. So, please include stack trace next
time :)
Best regards,
Sergey
On Tue, Sep 25, 2012 at 11:07 AM, Alex Cougarman <ac...@bwc.org> wrote:
> Hi Sergey,
>
> The bug persists. We've uploaded a Word DOC (blank_2.doc) to the bug that
> generates the ArrayIndexOutOfBounds exception:
> https://issues.apache.org/bugzilla/show_bug.cgi?id=53380
> This is using the latest build (#50) from here:
> https://builds.apache.org/job/POI/50/
>
> Warm regards,
> Alex
>
> -----Original Message-----
> From: Sergey Vladimirov [mailto:vlsergey@gmail.com]
> Sent: 13 September 2012 11:55 AM
> To: POI Users List
> Subject: Re: Bug 53380
>
> Hi,
>
> Try #47, build by Yegor:
>
> https://builds.apache.org/job/POI/47/
>
> Best regards,
> Sergey
>
> On Thu, Sep 13, 2012 at 9:03 AM, Alex Cougarman <ac...@bwc.org> wrote:
>
> > Any update on the bug fix for this? There's a Build #46 on this page
> > but it says "Failed" when you roll over the red circle:
> > https://builds.apache.org/job/POI/46/
> >
> > Thank you :)
> >
> > Warm regards,
> > Alex Cougarman
> >
> > Bahá’í World Centre
> > Haifa, Israel
> > Office: +972-4-835-8683
> > Cell: +972-54-241-4742
> > acougarm@bwc.org
> >
> >
> > -----Original Message-----
> > From: Alex Cougarman [mailto:acougarm@bwc.org]
> > Sent: 11 September 2012 11:42 AM
> > To: 'POI Users List'
> > Subject: RE: Bug 53380
> >
> > Hi Sergey,
> >
> > Thank you for looking into this issue. It will make a huge difference
> > for us :)
> >
> > Warm regards,
> > Alex
> >
> > -----Original Message-----
> > From: Sergey Vladimirov [mailto:vlsergey@gmail.com]
> > Sent: 10 September 2012 2:13 PM
> > To: POI Users List
> > Subject: Re: Bug 53380
> >
> > Hi
> >
> > I will take a look into it today or tomorrow.
> > Sorry for the long waiting
> >
> > Best regards,
> > Sergey
> >
> > On Mon, Sep 10, 2012 at 11:19 AM, Alex Cougarman <ac...@bwc.org>
> wrote:
> >
> > > Dear Yegor,
> > >
> > > Thank you for your reply. If I knew enough about Java, I'd go in and
> > > fix it :) Just happy to have you guys providing such a great tool.
> > > Thanks and keep up the great work.
> > >
> > > Warm regards,
> > > Alex
> > >
> > > -----Original Message-----
> > > From: Yegor Kozlov [mailto:yegor.kozlov@dinom.ru]
> > > Sent: 10 September 2012 10:16 AM
> > > To: POI Users List
> > > Subject: Re: Bug 53380
> > >
> > > We have all pre-requisites for fixing this bug, just need to find a
> > > person to do it :)
> > >
> > > POI is a volunteer project and if this problem is important for you,
> > > please do work on it and submit a patch. Otherwise please wait.
> > > Unfortuntaly we don't have a active developer working on DOC/DOCX
> > > modules, so fixing may take some time.
> > >
> > > Yegor
> > >
> > > On Mon, Sep 10, 2012 at 9:48 AM, Alex Cougarman <ac...@bwc.org>
> > wrote:
> > > > Hi. I'm having the same issue from this bug with hundreds of our
> > > > DOC files being fed through Solr/Tika:
> > > > https://issues.apache.org/bugzilla/show_bug.cgi?id=53380
> > > >
> > > > I downloaded the DOC file attached to the ticket and was able to
> > > generate the same error we've been getting (please see below for the
> > > exception).
> > > >
> > > > Anyone know of a solution/workaround? Is there a timeline for a fix?
> > > > I
> > > commented and voted on the ticket but not sure if it's a priority.
> > Thanks.
> > > >
> > > > org.apache.tika.exception.TikaException
> > > > : Unexpected RuntimeException from
> > > > org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> > > > org.apache.solr.common.SolrException:
> > > > org.apache.tika.exception.TikaException: Unexpected
> > > > RuntimeException
> > > from org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> > > > at
> > > > org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(Extr
> > > > actingDocumentLoader.java:230)
> > > > at
> > > > org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(Co
> > > > ntentStreamHandlerBase.java:74)
> > > > at
> > > > org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandl
> > > > erBase.java:129)
> > > > at
> > > > org.apache.solr.core.RequestHandlers$LazyRequestHandlerWrapper.handle
> > > > Request(RequestHandlers.java:240)
> > > > at
> > org.apache.solr.core.SolrCore.execute(SolrCore.java:1656)
> > > > at
> > > > org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter
> > > > .java:454)
> > > > at
> > > > org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilte
> > > > r.java:275)
> > > > at
> > > > org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(Servlet
> > > > Handler.java:1337)
> > > > at
> > > > org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java
> > > > :484)
> > > > at
> > > > org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.j
> > > > ava:119)
> > > > at
> > > >
> > > org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.ja
> > > va
> > > :524)
> > > > at
> > > > org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandl
> > > > er.java:233)
> > > > at
> > > > org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandl
> > > > er.java:1065)
> > > > at
> > > > org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:
> > > > 413)
> > > > at
> > > > org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandle
> > > > r.java:192)
> > > > at
> > > > org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandle
> > > > r.java:999)
> > > > at
> > > > org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.j
> > > > ava:117)
> > > > at
> > > > org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(Cont
> > > > extHandlerCollection.java:250)
> > > > at
> > > > org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerColl
> > > > ection.java:149)
> > > > at
> > > > org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper
> > > > .java:111)
> > > > at
> org.eclipse.jetty.server.Server.handle(Server.java:351)
> > > > at
> > > > org.eclipse.jetty.server.AbstractHttpConnection.handleRequest(Abstrac
> > > > tHttpConnection.java:454)
> > > > at
> > > > org.eclipse.jetty.server.BlockingHttpConnection.handleRequest(Blockin
> > > > gHttpConnection.java:47)
> > > > at
> > > > org.eclipse.jetty.server.AbstractHttpConnection.headerComplete(Abstra
> > > > ctHttpConnection.java:890)
> > > > at
> > > > org.eclipse.jetty.server.AbstractHttpConnection$RequestHandler.header
> > > > Complete(AbstractHttpConnection.java:944)
> > > > at
> > > > org.eclipse.jetty.http.HttpParser.parseNext(HttpParser.java:642)
> > > > at
> > > > org.eclipse.jetty.http.HttpParser.parseAvailable(HttpParser.java:2
> > > > 30
> > > > )
> > > >
> > > > at
> > > > org.eclipse.jetty.server.BlockingHttpConnection.handle(BlockingHttpCo
> > > > nnection.java:66)
> > > > at
> > > > org.eclipse.jetty.server.bio.SocketConnector$ConnectorEndPoint.run(So
> > > > cketConnector.java:254)
> > > > at
> > > > org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPoo
> > > > l.java:599)
> > > > at
> > > > org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool
> > > > .java:534)
> > > > at java.lang.Thread.run(Unknown Source)
> > > > Caused by: org.apache.tika.exception.TikaException: Unexpected
> > > RuntimeException
> > > > from org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> > > > at
> > > > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244
> > > > )
> > > > at
> > > > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242
> > > > )
> > > > at
> > > > org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:1
> > > > 20)
> > > > at
> > > > org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(Extr
> > > > actingDocumentLoader.java:224)
> > > > ... 31 more
> > > > Caused by: java.lang.ArrayIndexOutOfBoundsException: 7
> > > > at
> > > > org.apache.poi.util.LittleEndian.getInt(LittleEndian.java:163)
> > > > at
> > > > org.apache.poi.hwpf.model.Colorref.<init>(Colorref.java:81)
> > > > at
> > > > org.apache.poi.hwpf.model.types.SHDAbstractType.fillFields(SHDAbstrac
> > > > tType.java:56)
> > > > at
> > > > org.apache.poi.hwpf.usermodel.ShadingDescriptor.<init>(ShadingD
> > > > escriptor.java:38)
> > > > at
> > > > org.apache.poi.hwpf.sprm.CharacterSprmUncompressor.unCompressCHPOpera
> > > > tion(CharacterSprmUncompressor.java:582)
> > > > at
> > > > org.apache.poi.hwpf.sprm.CharacterSprmUncompressor.uncompressCHP(Char
> > > > acterSprmUncompressor.java:65)
> > > > at
> > > > org.apache.poi.hwpf.model.StyleSheet.createChp(StyleSheet.java:288)
> > > > at
> > > > org.apache.poi.hwpf.model.StyleSheet.<init>(StyleSheet.java:121
> > > > )
> > > > at
> > > > org.apache.poi.hwpf.HWPFDocument.<init>(HWPFDocument.java:346)
> > > > at
> > > > org.apache.tika.parser.microsoft.WordExtractor.parse(WordExtractor.ja
> > > > va:77)
> > > > at
> > > > org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java
> > > > :185)
> > > > at
> > > > org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java
> > > > :160)
> > > > at
> > > > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242
> > > > )
> > > > ... 34 more
> > > >
> > > >
> > > > Warm regards,
> > > > Alex
> > > >
> > >
> > > --------------------------------------------------------------------
> > > - To unsubscribe, e-mail: user-unsubscribe@poi.apache.org For
> > > additional commands, e-mail: user-help@poi.apache.org
> > >
> > >
> > > --------------------------------------------------------------------
> > > - To unsubscribe, e-mail: user-unsubscribe@poi.apache.org For
> > > additional commands, e-mail: user-help@poi.apache.org
> > >
> > >
> >
> >
> > --
> > Sergey Vladimirov
> >
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail: user-unsubscribe@poi.apache.org For additional
> > commands, e-mail: user-help@poi.apache.org
> >
>
>
>
> --
> Sergey Vladimirov
>
--
Sergey Vladimirov
RE: Bug 53380
Posted by Alex Cougarman <ac...@bwc.org>.
Hi Sergey,
The bug persists. We've uploaded a Word DOC (blank_2.doc) to the bug that generates the ArrayIndexOutOfBounds exception: https://issues.apache.org/bugzilla/show_bug.cgi?id=53380
This is using the latest build (#50) from here: https://builds.apache.org/job/POI/50/
Warm regards,
Alex
-----Original Message-----
From: Sergey Vladimirov [mailto:vlsergey@gmail.com]
Sent: 13 September 2012 11:55 AM
To: POI Users List
Subject: Re: Bug 53380
Hi,
Try #47, build by Yegor:
https://builds.apache.org/job/POI/47/
Best regards,
Sergey
On Thu, Sep 13, 2012 at 9:03 AM, Alex Cougarman <ac...@bwc.org> wrote:
> Any update on the bug fix for this? There's a Build #46 on this page
> but it says "Failed" when you roll over the red circle:
> https://builds.apache.org/job/POI/46/
>
> Thank you :)
>
> Warm regards,
> Alex Cougarman
>
> Bahá’í World Centre
> Haifa, Israel
> Office: +972-4-835-8683
> Cell: +972-54-241-4742
> acougarm@bwc.org
>
>
> -----Original Message-----
> From: Alex Cougarman [mailto:acougarm@bwc.org]
> Sent: 11 September 2012 11:42 AM
> To: 'POI Users List'
> Subject: RE: Bug 53380
>
> Hi Sergey,
>
> Thank you for looking into this issue. It will make a huge difference
> for us :)
>
> Warm regards,
> Alex
>
> -----Original Message-----
> From: Sergey Vladimirov [mailto:vlsergey@gmail.com]
> Sent: 10 September 2012 2:13 PM
> To: POI Users List
> Subject: Re: Bug 53380
>
> Hi
>
> I will take a look into it today or tomorrow.
> Sorry for the long waiting
>
> Best regards,
> Sergey
>
> On Mon, Sep 10, 2012 at 11:19 AM, Alex Cougarman <ac...@bwc.org> wrote:
>
> > Dear Yegor,
> >
> > Thank you for your reply. If I knew enough about Java, I'd go in and
> > fix it :) Just happy to have you guys providing such a great tool.
> > Thanks and keep up the great work.
> >
> > Warm regards,
> > Alex
> >
> > -----Original Message-----
> > From: Yegor Kozlov [mailto:yegor.kozlov@dinom.ru]
> > Sent: 10 September 2012 10:16 AM
> > To: POI Users List
> > Subject: Re: Bug 53380
> >
> > We have all pre-requisites for fixing this bug, just need to find a
> > person to do it :)
> >
> > POI is a volunteer project and if this problem is important for you,
> > please do work on it and submit a patch. Otherwise please wait.
> > Unfortuntaly we don't have a active developer working on DOC/DOCX
> > modules, so fixing may take some time.
> >
> > Yegor
> >
> > On Mon, Sep 10, 2012 at 9:48 AM, Alex Cougarman <ac...@bwc.org>
> wrote:
> > > Hi. I'm having the same issue from this bug with hundreds of our
> > > DOC files being fed through Solr/Tika:
> > > https://issues.apache.org/bugzilla/show_bug.cgi?id=53380
> > >
> > > I downloaded the DOC file attached to the ticket and was able to
> > generate the same error we've been getting (please see below for the
> > exception).
> > >
> > > Anyone know of a solution/workaround? Is there a timeline for a fix?
> > > I
> > commented and voted on the ticket but not sure if it's a priority.
> Thanks.
> > >
> > > org.apache.tika.exception.TikaException
> > > : Unexpected RuntimeException from
> > > org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> > > org.apache.solr.common.SolrException:
> > > org.apache.tika.exception.TikaException: Unexpected
> > > RuntimeException
> > from org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> > > at
> > > org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(Extr
> > > actingDocumentLoader.java:230)
> > > at
> > > org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(Co
> > > ntentStreamHandlerBase.java:74)
> > > at
> > > org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandl
> > > erBase.java:129)
> > > at
> > > org.apache.solr.core.RequestHandlers$LazyRequestHandlerWrapper.handle
> > > Request(RequestHandlers.java:240)
> > > at
> org.apache.solr.core.SolrCore.execute(SolrCore.java:1656)
> > > at
> > > org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter
> > > .java:454)
> > > at
> > > org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilte
> > > r.java:275)
> > > at
> > > org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(Servlet
> > > Handler.java:1337)
> > > at
> > > org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java
> > > :484)
> > > at
> > > org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.j
> > > ava:119)
> > > at
> > >
> > org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.ja
> > va
> > :524)
> > > at
> > > org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandl
> > > er.java:233)
> > > at
> > > org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandl
> > > er.java:1065)
> > > at
> > > org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:
> > > 413)
> > > at
> > > org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandle
> > > r.java:192)
> > > at
> > > org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandle
> > > r.java:999)
> > > at
> > > org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.j
> > > ava:117)
> > > at
> > > org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(Cont
> > > extHandlerCollection.java:250)
> > > at
> > > org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerColl
> > > ection.java:149)
> > > at
> > > org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper
> > > .java:111)
> > > at org.eclipse.jetty.server.Server.handle(Server.java:351)
> > > at
> > > org.eclipse.jetty.server.AbstractHttpConnection.handleRequest(Abstrac
> > > tHttpConnection.java:454)
> > > at
> > > org.eclipse.jetty.server.BlockingHttpConnection.handleRequest(Blockin
> > > gHttpConnection.java:47)
> > > at
> > > org.eclipse.jetty.server.AbstractHttpConnection.headerComplete(Abstra
> > > ctHttpConnection.java:890)
> > > at
> > > org.eclipse.jetty.server.AbstractHttpConnection$RequestHandler.header
> > > Complete(AbstractHttpConnection.java:944)
> > > at
> > > org.eclipse.jetty.http.HttpParser.parseNext(HttpParser.java:642)
> > > at
> > > org.eclipse.jetty.http.HttpParser.parseAvailable(HttpParser.java:2
> > > 30
> > > )
> > >
> > > at
> > > org.eclipse.jetty.server.BlockingHttpConnection.handle(BlockingHttpCo
> > > nnection.java:66)
> > > at
> > > org.eclipse.jetty.server.bio.SocketConnector$ConnectorEndPoint.run(So
> > > cketConnector.java:254)
> > > at
> > > org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPoo
> > > l.java:599)
> > > at
> > > org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool
> > > .java:534)
> > > at java.lang.Thread.run(Unknown Source)
> > > Caused by: org.apache.tika.exception.TikaException: Unexpected
> > RuntimeException
> > > from org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> > > at
> > > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244
> > > )
> > > at
> > > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242
> > > )
> > > at
> > > org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:1
> > > 20)
> > > at
> > > org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(Extr
> > > actingDocumentLoader.java:224)
> > > ... 31 more
> > > Caused by: java.lang.ArrayIndexOutOfBoundsException: 7
> > > at
> > > org.apache.poi.util.LittleEndian.getInt(LittleEndian.java:163)
> > > at
> > > org.apache.poi.hwpf.model.Colorref.<init>(Colorref.java:81)
> > > at
> > > org.apache.poi.hwpf.model.types.SHDAbstractType.fillFields(SHDAbstrac
> > > tType.java:56)
> > > at
> > > org.apache.poi.hwpf.usermodel.ShadingDescriptor.<init>(ShadingD
> > > escriptor.java:38)
> > > at
> > > org.apache.poi.hwpf.sprm.CharacterSprmUncompressor.unCompressCHPOpera
> > > tion(CharacterSprmUncompressor.java:582)
> > > at
> > > org.apache.poi.hwpf.sprm.CharacterSprmUncompressor.uncompressCHP(Char
> > > acterSprmUncompressor.java:65)
> > > at
> > > org.apache.poi.hwpf.model.StyleSheet.createChp(StyleSheet.java:288)
> > > at
> > > org.apache.poi.hwpf.model.StyleSheet.<init>(StyleSheet.java:121
> > > )
> > > at
> > > org.apache.poi.hwpf.HWPFDocument.<init>(HWPFDocument.java:346)
> > > at
> > > org.apache.tika.parser.microsoft.WordExtractor.parse(WordExtractor.ja
> > > va:77)
> > > at
> > > org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java
> > > :185)
> > > at
> > > org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java
> > > :160)
> > > at
> > > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242
> > > )
> > > ... 34 more
> > >
> > >
> > > Warm regards,
> > > Alex
> > >
> >
> > --------------------------------------------------------------------
> > - To unsubscribe, e-mail: user-unsubscribe@poi.apache.org For
> > additional commands, e-mail: user-help@poi.apache.org
> >
> >
> > --------------------------------------------------------------------
> > - To unsubscribe, e-mail: user-unsubscribe@poi.apache.org For
> > additional commands, e-mail: user-help@poi.apache.org
> >
> >
>
>
> --
> Sergey Vladimirov
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: user-unsubscribe@poi.apache.org For additional
> commands, e-mail: user-help@poi.apache.org
>
--
Sergey Vladimirov
RE: Bug 53380
Posted by Alex Cougarman <ac...@bwc.org>.
Hi,
The bug is fixed with this build: https://builds.apache.org/job/POI/47/
We confirmed with our files and all the DOC files are processed beautifully :)
Thank you.
Warm regards,
Alex Cougarman
Bahá’í World Centre
Haifa, Israel
Office: +972-4-835-8683
Cell: +972-54-241-4742
acougarm@bwc.org
-----Original Message-----
From: Sergey Vladimirov [mailto:vlsergey@gmail.com]
Sent: 13 September 2012 11:55 AM
To: POI Users List
Subject: Re: Bug 53380
Hi,
Try #47, build by Yegor:
https://builds.apache.org/job/POI/47/
Best regards,
Sergey
On Thu, Sep 13, 2012 at 9:03 AM, Alex Cougarman <ac...@bwc.org> wrote:
> Any update on the bug fix for this? There's a Build #46 on this page
> but it says "Failed" when you roll over the red circle:
> https://builds.apache.org/job/POI/46/
>
> Thank you :)
>
> Warm regards,
> Alex Cougarman
>
> Bahá’í World Centre
> Haifa, Israel
> Office: +972-4-835-8683
> Cell: +972-54-241-4742
> acougarm@bwc.org
>
>
> -----Original Message-----
> From: Alex Cougarman [mailto:acougarm@bwc.org]
> Sent: 11 September 2012 11:42 AM
> To: 'POI Users List'
> Subject: RE: Bug 53380
>
> Hi Sergey,
>
> Thank you for looking into this issue. It will make a huge difference
> for us :)
>
> Warm regards,
> Alex
>
> -----Original Message-----
> From: Sergey Vladimirov [mailto:vlsergey@gmail.com]
> Sent: 10 September 2012 2:13 PM
> To: POI Users List
> Subject: Re: Bug 53380
>
> Hi
>
> I will take a look into it today or tomorrow.
> Sorry for the long waiting
>
> Best regards,
> Sergey
>
> On Mon, Sep 10, 2012 at 11:19 AM, Alex Cougarman <ac...@bwc.org> wrote:
>
> > Dear Yegor,
> >
> > Thank you for your reply. If I knew enough about Java, I'd go in and
> > fix it :) Just happy to have you guys providing such a great tool.
> > Thanks and keep up the great work.
> >
> > Warm regards,
> > Alex
> >
> > -----Original Message-----
> > From: Yegor Kozlov [mailto:yegor.kozlov@dinom.ru]
> > Sent: 10 September 2012 10:16 AM
> > To: POI Users List
> > Subject: Re: Bug 53380
> >
> > We have all pre-requisites for fixing this bug, just need to find a
> > person to do it :)
> >
> > POI is a volunteer project and if this problem is important for you,
> > please do work on it and submit a patch. Otherwise please wait.
> > Unfortuntaly we don't have a active developer working on DOC/DOCX
> > modules, so fixing may take some time.
> >
> > Yegor
> >
> > On Mon, Sep 10, 2012 at 9:48 AM, Alex Cougarman <ac...@bwc.org>
> wrote:
> > > Hi. I'm having the same issue from this bug with hundreds of our
> > > DOC files being fed through Solr/Tika:
> > > https://issues.apache.org/bugzilla/show_bug.cgi?id=53380
> > >
> > > I downloaded the DOC file attached to the ticket and was able to
> > generate the same error we've been getting (please see below for the
> > exception).
> > >
> > > Anyone know of a solution/workaround? Is there a timeline for a fix?
> > > I
> > commented and voted on the ticket but not sure if it's a priority.
> Thanks.
> > >
> > > org.apache.tika.exception.TikaException
> > > : Unexpected RuntimeException from
> > > org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> > > org.apache.solr.common.SolrException:
> > > org.apache.tika.exception.TikaException: Unexpected
> > > RuntimeException
> > from org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> > > at
> > > org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(Extr
> > > actingDocumentLoader.java:230)
> > > at
> > > org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(Co
> > > ntentStreamHandlerBase.java:74)
> > > at
> > > org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandl
> > > erBase.java:129)
> > > at
> > > org.apache.solr.core.RequestHandlers$LazyRequestHandlerWrapper.handle
> > > Request(RequestHandlers.java:240)
> > > at
> org.apache.solr.core.SolrCore.execute(SolrCore.java:1656)
> > > at
> > > org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter
> > > .java:454)
> > > at
> > > org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilte
> > > r.java:275)
> > > at
> > > org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(Servlet
> > > Handler.java:1337)
> > > at
> > > org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java
> > > :484)
> > > at
> > > org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.j
> > > ava:119)
> > > at
> > >
> > org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.ja
> > va
> > :524)
> > > at
> > > org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandl
> > > er.java:233)
> > > at
> > > org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandl
> > > er.java:1065)
> > > at
> > > org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:
> > > 413)
> > > at
> > > org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandle
> > > r.java:192)
> > > at
> > > org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandle
> > > r.java:999)
> > > at
> > > org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.j
> > > ava:117)
> > > at
> > > org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(Cont
> > > extHandlerCollection.java:250)
> > > at
> > > org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerColl
> > > ection.java:149)
> > > at
> > > org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper
> > > .java:111)
> > > at org.eclipse.jetty.server.Server.handle(Server.java:351)
> > > at
> > > org.eclipse.jetty.server.AbstractHttpConnection.handleRequest(Abstrac
> > > tHttpConnection.java:454)
> > > at
> > > org.eclipse.jetty.server.BlockingHttpConnection.handleRequest(Blockin
> > > gHttpConnection.java:47)
> > > at
> > > org.eclipse.jetty.server.AbstractHttpConnection.headerComplete(Abstra
> > > ctHttpConnection.java:890)
> > > at
> > > org.eclipse.jetty.server.AbstractHttpConnection$RequestHandler.header
> > > Complete(AbstractHttpConnection.java:944)
> > > at
> > > org.eclipse.jetty.http.HttpParser.parseNext(HttpParser.java:642)
> > > at
> > > org.eclipse.jetty.http.HttpParser.parseAvailable(HttpParser.java:2
> > > 30
> > > )
> > >
> > > at
> > > org.eclipse.jetty.server.BlockingHttpConnection.handle(BlockingHttpCo
> > > nnection.java:66)
> > > at
> > > org.eclipse.jetty.server.bio.SocketConnector$ConnectorEndPoint.run(So
> > > cketConnector.java:254)
> > > at
> > > org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPoo
> > > l.java:599)
> > > at
> > > org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool
> > > .java:534)
> > > at java.lang.Thread.run(Unknown Source)
> > > Caused by: org.apache.tika.exception.TikaException: Unexpected
> > RuntimeException
> > > from org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> > > at
> > > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244
> > > )
> > > at
> > > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242
> > > )
> > > at
> > > org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:1
> > > 20)
> > > at
> > > org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(Extr
> > > actingDocumentLoader.java:224)
> > > ... 31 more
> > > Caused by: java.lang.ArrayIndexOutOfBoundsException: 7
> > > at
> > > org.apache.poi.util.LittleEndian.getInt(LittleEndian.java:163)
> > > at
> > > org.apache.poi.hwpf.model.Colorref.<init>(Colorref.java:81)
> > > at
> > > org.apache.poi.hwpf.model.types.SHDAbstractType.fillFields(SHDAbstrac
> > > tType.java:56)
> > > at
> > > org.apache.poi.hwpf.usermodel.ShadingDescriptor.<init>(ShadingD
> > > escriptor.java:38)
> > > at
> > > org.apache.poi.hwpf.sprm.CharacterSprmUncompressor.unCompressCHPOpera
> > > tion(CharacterSprmUncompressor.java:582)
> > > at
> > > org.apache.poi.hwpf.sprm.CharacterSprmUncompressor.uncompressCHP(Char
> > > acterSprmUncompressor.java:65)
> > > at
> > > org.apache.poi.hwpf.model.StyleSheet.createChp(StyleSheet.java:288)
> > > at
> > > org.apache.poi.hwpf.model.StyleSheet.<init>(StyleSheet.java:121
> > > )
> > > at
> > > org.apache.poi.hwpf.HWPFDocument.<init>(HWPFDocument.java:346)
> > > at
> > > org.apache.tika.parser.microsoft.WordExtractor.parse(WordExtractor.ja
> > > va:77)
> > > at
> > > org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java
> > > :185)
> > > at
> > > org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java
> > > :160)
> > > at
> > > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242
> > > )
> > > ... 34 more
> > >
> > >
> > > Warm regards,
> > > Alex
> > >
> >
> > --------------------------------------------------------------------
> > - To unsubscribe, e-mail: user-unsubscribe@poi.apache.org For
> > additional commands, e-mail: user-help@poi.apache.org
> >
> >
> > --------------------------------------------------------------------
> > - To unsubscribe, e-mail: user-unsubscribe@poi.apache.org For
> > additional commands, e-mail: user-help@poi.apache.org
> >
> >
>
>
> --
> Sergey Vladimirov
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: user-unsubscribe@poi.apache.org For additional
> commands, e-mail: user-help@poi.apache.org
>
--
Sergey Vladimirov
Re: Bug 53380
Posted by Sergey Vladimirov <vl...@gmail.com>.
Hi,
Try #47, build by Yegor:
https://builds.apache.org/job/POI/47/
Best regards,
Sergey
On Thu, Sep 13, 2012 at 9:03 AM, Alex Cougarman <ac...@bwc.org> wrote:
> Any update on the bug fix for this? There's a Build #46 on this page but
> it says "Failed" when you roll over the red circle:
> https://builds.apache.org/job/POI/46/
>
> Thank you :)
>
> Warm regards,
> Alex Cougarman
>
> Bahá’í World Centre
> Haifa, Israel
> Office: +972-4-835-8683
> Cell: +972-54-241-4742
> acougarm@bwc.org
>
>
> -----Original Message-----
> From: Alex Cougarman [mailto:acougarm@bwc.org]
> Sent: 11 September 2012 11:42 AM
> To: 'POI Users List'
> Subject: RE: Bug 53380
>
> Hi Sergey,
>
> Thank you for looking into this issue. It will make a huge difference for
> us :)
>
> Warm regards,
> Alex
>
> -----Original Message-----
> From: Sergey Vladimirov [mailto:vlsergey@gmail.com]
> Sent: 10 September 2012 2:13 PM
> To: POI Users List
> Subject: Re: Bug 53380
>
> Hi
>
> I will take a look into it today or tomorrow.
> Sorry for the long waiting
>
> Best regards,
> Sergey
>
> On Mon, Sep 10, 2012 at 11:19 AM, Alex Cougarman <ac...@bwc.org> wrote:
>
> > Dear Yegor,
> >
> > Thank you for your reply. If I knew enough about Java, I'd go in and
> > fix it :) Just happy to have you guys providing such a great tool.
> > Thanks and keep up the great work.
> >
> > Warm regards,
> > Alex
> >
> > -----Original Message-----
> > From: Yegor Kozlov [mailto:yegor.kozlov@dinom.ru]
> > Sent: 10 September 2012 10:16 AM
> > To: POI Users List
> > Subject: Re: Bug 53380
> >
> > We have all pre-requisites for fixing this bug, just need to find a
> > person to do it :)
> >
> > POI is a volunteer project and if this problem is important for you,
> > please do work on it and submit a patch. Otherwise please wait.
> > Unfortuntaly we don't have a active developer working on DOC/DOCX
> > modules, so fixing may take some time.
> >
> > Yegor
> >
> > On Mon, Sep 10, 2012 at 9:48 AM, Alex Cougarman <ac...@bwc.org>
> wrote:
> > > Hi. I'm having the same issue from this bug with hundreds of our DOC
> > > files being fed through Solr/Tika:
> > > https://issues.apache.org/bugzilla/show_bug.cgi?id=53380
> > >
> > > I downloaded the DOC file attached to the ticket and was able to
> > generate the same error we've been getting (please see below for the
> > exception).
> > >
> > > Anyone know of a solution/workaround? Is there a timeline for a fix?
> > > I
> > commented and voted on the ticket but not sure if it's a priority.
> Thanks.
> > >
> > > org.apache.tika.exception.TikaException
> > > : Unexpected RuntimeException from
> > > org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> > > org.apache.solr.common.SolrException:
> > > org.apache.tika.exception.TikaException: Unexpected RuntimeException
> > from org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> > > at
> > > org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(Extr
> > > actingDocumentLoader.java:230)
> > > at
> > > org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(Co
> > > ntentStreamHandlerBase.java:74)
> > > at
> > > org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandl
> > > erBase.java:129)
> > > at
> > > org.apache.solr.core.RequestHandlers$LazyRequestHandlerWrapper.handle
> > > Request(RequestHandlers.java:240)
> > > at
> org.apache.solr.core.SolrCore.execute(SolrCore.java:1656)
> > > at
> > > org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter
> > > .java:454)
> > > at
> > > org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilte
> > > r.java:275)
> > > at
> > > org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(Servlet
> > > Handler.java:1337)
> > > at
> > > org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java
> > > :484)
> > > at
> > > org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.j
> > > ava:119)
> > > at
> > >
> > org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java
> > :524)
> > > at
> > > org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandl
> > > er.java:233)
> > > at
> > > org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandl
> > > er.java:1065)
> > > at
> > > org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:
> > > 413)
> > > at
> > > org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandle
> > > r.java:192)
> > > at
> > > org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandle
> > > r.java:999)
> > > at
> > > org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.j
> > > ava:117)
> > > at
> > > org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(Cont
> > > extHandlerCollection.java:250)
> > > at
> > > org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerColl
> > > ection.java:149)
> > > at
> > > org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper
> > > .java:111)
> > > at org.eclipse.jetty.server.Server.handle(Server.java:351)
> > > at
> > > org.eclipse.jetty.server.AbstractHttpConnection.handleRequest(Abstrac
> > > tHttpConnection.java:454)
> > > at
> > > org.eclipse.jetty.server.BlockingHttpConnection.handleRequest(Blockin
> > > gHttpConnection.java:47)
> > > at
> > > org.eclipse.jetty.server.AbstractHttpConnection.headerComplete(Abstra
> > > ctHttpConnection.java:890)
> > > at
> > > org.eclipse.jetty.server.AbstractHttpConnection$RequestHandler.header
> > > Complete(AbstractHttpConnection.java:944)
> > > at
> > > org.eclipse.jetty.http.HttpParser.parseNext(HttpParser.java:642)
> > > at
> > > org.eclipse.jetty.http.HttpParser.parseAvailable(HttpParser.java:230
> > > )
> > >
> > > at
> > > org.eclipse.jetty.server.BlockingHttpConnection.handle(BlockingHttpCo
> > > nnection.java:66)
> > > at
> > > org.eclipse.jetty.server.bio.SocketConnector$ConnectorEndPoint.run(So
> > > cketConnector.java:254)
> > > at
> > > org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPoo
> > > l.java:599)
> > > at
> > > org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool
> > > .java:534)
> > > at java.lang.Thread.run(Unknown Source)
> > > Caused by: org.apache.tika.exception.TikaException: Unexpected
> > RuntimeException
> > > from org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> > > at
> > > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244
> > > )
> > > at
> > > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242
> > > )
> > > at
> > > org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:1
> > > 20)
> > > at
> > > org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(Extr
> > > actingDocumentLoader.java:224)
> > > ... 31 more
> > > Caused by: java.lang.ArrayIndexOutOfBoundsException: 7
> > > at
> > > org.apache.poi.util.LittleEndian.getInt(LittleEndian.java:163)
> > > at
> > > org.apache.poi.hwpf.model.Colorref.<init>(Colorref.java:81)
> > > at
> > > org.apache.poi.hwpf.model.types.SHDAbstractType.fillFields(SHDAbstrac
> > > tType.java:56)
> > > at
> > > org.apache.poi.hwpf.usermodel.ShadingDescriptor.<init>(ShadingD
> > > escriptor.java:38)
> > > at
> > > org.apache.poi.hwpf.sprm.CharacterSprmUncompressor.unCompressCHPOpera
> > > tion(CharacterSprmUncompressor.java:582)
> > > at
> > > org.apache.poi.hwpf.sprm.CharacterSprmUncompressor.uncompressCHP(Char
> > > acterSprmUncompressor.java:65)
> > > at
> > > org.apache.poi.hwpf.model.StyleSheet.createChp(StyleSheet.java:288)
> > > at
> > > org.apache.poi.hwpf.model.StyleSheet.<init>(StyleSheet.java:121
> > > )
> > > at
> > > org.apache.poi.hwpf.HWPFDocument.<init>(HWPFDocument.java:346)
> > > at
> > > org.apache.tika.parser.microsoft.WordExtractor.parse(WordExtractor.ja
> > > va:77)
> > > at
> > > org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java
> > > :185)
> > > at
> > > org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java
> > > :160)
> > > at
> > > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242
> > > )
> > > ... 34 more
> > >
> > >
> > > Warm regards,
> > > Alex
> > >
> >
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail: user-unsubscribe@poi.apache.org For additional
> > commands, e-mail: user-help@poi.apache.org
> >
> >
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail: user-unsubscribe@poi.apache.org For additional
> > commands, e-mail: user-help@poi.apache.org
> >
> >
>
>
> --
> Sergey Vladimirov
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: user-unsubscribe@poi.apache.org For additional
> commands, e-mail: user-help@poi.apache.org
>
--
Sergey Vladimirov
RE: Bug 53380
Posted by Alex Cougarman <ac...@bwc.org>.
Any update on the bug fix for this? There's a Build #46 on this page but it says "Failed" when you roll over the red circle: https://builds.apache.org/job/POI/46/
Thank you :)
Warm regards,
Alex Cougarman
Bahá’í World Centre
Haifa, Israel
Office: +972-4-835-8683
Cell: +972-54-241-4742
acougarm@bwc.org
-----Original Message-----
From: Alex Cougarman [mailto:acougarm@bwc.org]
Sent: 11 September 2012 11:42 AM
To: 'POI Users List'
Subject: RE: Bug 53380
Hi Sergey,
Thank you for looking into this issue. It will make a huge difference for us :)
Warm regards,
Alex
-----Original Message-----
From: Sergey Vladimirov [mailto:vlsergey@gmail.com]
Sent: 10 September 2012 2:13 PM
To: POI Users List
Subject: Re: Bug 53380
Hi
I will take a look into it today or tomorrow.
Sorry for the long waiting
Best regards,
Sergey
On Mon, Sep 10, 2012 at 11:19 AM, Alex Cougarman <ac...@bwc.org> wrote:
> Dear Yegor,
>
> Thank you for your reply. If I knew enough about Java, I'd go in and
> fix it :) Just happy to have you guys providing such a great tool.
> Thanks and keep up the great work.
>
> Warm regards,
> Alex
>
> -----Original Message-----
> From: Yegor Kozlov [mailto:yegor.kozlov@dinom.ru]
> Sent: 10 September 2012 10:16 AM
> To: POI Users List
> Subject: Re: Bug 53380
>
> We have all pre-requisites for fixing this bug, just need to find a
> person to do it :)
>
> POI is a volunteer project and if this problem is important for you,
> please do work on it and submit a patch. Otherwise please wait.
> Unfortuntaly we don't have a active developer working on DOC/DOCX
> modules, so fixing may take some time.
>
> Yegor
>
> On Mon, Sep 10, 2012 at 9:48 AM, Alex Cougarman <ac...@bwc.org> wrote:
> > Hi. I'm having the same issue from this bug with hundreds of our DOC
> > files being fed through Solr/Tika:
> > https://issues.apache.org/bugzilla/show_bug.cgi?id=53380
> >
> > I downloaded the DOC file attached to the ticket and was able to
> generate the same error we've been getting (please see below for the
> exception).
> >
> > Anyone know of a solution/workaround? Is there a timeline for a fix?
> > I
> commented and voted on the ticket but not sure if it's a priority. Thanks.
> >
> > org.apache.tika.exception.TikaException
> > : Unexpected RuntimeException from
> > org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> > org.apache.solr.common.SolrException:
> > org.apache.tika.exception.TikaException: Unexpected RuntimeException
> from org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> > at
> > org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(Extr
> > actingDocumentLoader.java:230)
> > at
> > org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(Co
> > ntentStreamHandlerBase.java:74)
> > at
> > org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandl
> > erBase.java:129)
> > at
> > org.apache.solr.core.RequestHandlers$LazyRequestHandlerWrapper.handle
> > Request(RequestHandlers.java:240)
> > at org.apache.solr.core.SolrCore.execute(SolrCore.java:1656)
> > at
> > org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter
> > .java:454)
> > at
> > org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilte
> > r.java:275)
> > at
> > org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(Servlet
> > Handler.java:1337)
> > at
> > org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java
> > :484)
> > at
> > org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.j
> > ava:119)
> > at
> >
> org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java
> :524)
> > at
> > org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandl
> > er.java:233)
> > at
> > org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandl
> > er.java:1065)
> > at
> > org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:
> > 413)
> > at
> > org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandle
> > r.java:192)
> > at
> > org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandle
> > r.java:999)
> > at
> > org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.j
> > ava:117)
> > at
> > org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(Cont
> > extHandlerCollection.java:250)
> > at
> > org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerColl
> > ection.java:149)
> > at
> > org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper
> > .java:111)
> > at org.eclipse.jetty.server.Server.handle(Server.java:351)
> > at
> > org.eclipse.jetty.server.AbstractHttpConnection.handleRequest(Abstrac
> > tHttpConnection.java:454)
> > at
> > org.eclipse.jetty.server.BlockingHttpConnection.handleRequest(Blockin
> > gHttpConnection.java:47)
> > at
> > org.eclipse.jetty.server.AbstractHttpConnection.headerComplete(Abstra
> > ctHttpConnection.java:890)
> > at
> > org.eclipse.jetty.server.AbstractHttpConnection$RequestHandler.header
> > Complete(AbstractHttpConnection.java:944)
> > at
> > org.eclipse.jetty.http.HttpParser.parseNext(HttpParser.java:642)
> > at
> > org.eclipse.jetty.http.HttpParser.parseAvailable(HttpParser.java:230
> > )
> >
> > at
> > org.eclipse.jetty.server.BlockingHttpConnection.handle(BlockingHttpCo
> > nnection.java:66)
> > at
> > org.eclipse.jetty.server.bio.SocketConnector$ConnectorEndPoint.run(So
> > cketConnector.java:254)
> > at
> > org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPoo
> > l.java:599)
> > at
> > org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool
> > .java:534)
> > at java.lang.Thread.run(Unknown Source)
> > Caused by: org.apache.tika.exception.TikaException: Unexpected
> RuntimeException
> > from org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> > at
> > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244
> > )
> > at
> > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242
> > )
> > at
> > org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:1
> > 20)
> > at
> > org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(Extr
> > actingDocumentLoader.java:224)
> > ... 31 more
> > Caused by: java.lang.ArrayIndexOutOfBoundsException: 7
> > at
> > org.apache.poi.util.LittleEndian.getInt(LittleEndian.java:163)
> > at
> > org.apache.poi.hwpf.model.Colorref.<init>(Colorref.java:81)
> > at
> > org.apache.poi.hwpf.model.types.SHDAbstractType.fillFields(SHDAbstrac
> > tType.java:56)
> > at
> > org.apache.poi.hwpf.usermodel.ShadingDescriptor.<init>(ShadingD
> > escriptor.java:38)
> > at
> > org.apache.poi.hwpf.sprm.CharacterSprmUncompressor.unCompressCHPOpera
> > tion(CharacterSprmUncompressor.java:582)
> > at
> > org.apache.poi.hwpf.sprm.CharacterSprmUncompressor.uncompressCHP(Char
> > acterSprmUncompressor.java:65)
> > at
> > org.apache.poi.hwpf.model.StyleSheet.createChp(StyleSheet.java:288)
> > at
> > org.apache.poi.hwpf.model.StyleSheet.<init>(StyleSheet.java:121
> > )
> > at
> > org.apache.poi.hwpf.HWPFDocument.<init>(HWPFDocument.java:346)
> > at
> > org.apache.tika.parser.microsoft.WordExtractor.parse(WordExtractor.ja
> > va:77)
> > at
> > org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java
> > :185)
> > at
> > org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java
> > :160)
> > at
> > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242
> > )
> > ... 34 more
> >
> >
> > Warm regards,
> > Alex
> >
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: user-unsubscribe@poi.apache.org For additional
> commands, e-mail: user-help@poi.apache.org
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: user-unsubscribe@poi.apache.org For additional
> commands, e-mail: user-help@poi.apache.org
>
>
--
Sergey Vladimirov
---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@poi.apache.org For additional commands, e-mail: user-help@poi.apache.org
RE: Bug 53380
Posted by Alex Cougarman <ac...@bwc.org>.
Hi Sergey,
Thank you for looking into this issue. It will make a huge difference for us :)
Warm regards,
Alex
-----Original Message-----
From: Sergey Vladimirov [mailto:vlsergey@gmail.com]
Sent: 10 September 2012 2:13 PM
To: POI Users List
Subject: Re: Bug 53380
Hi
I will take a look into it today or tomorrow.
Sorry for the long waiting
Best regards,
Sergey
On Mon, Sep 10, 2012 at 11:19 AM, Alex Cougarman <ac...@bwc.org> wrote:
> Dear Yegor,
>
> Thank you for your reply. If I knew enough about Java, I'd go in and
> fix it :) Just happy to have you guys providing such a great tool.
> Thanks and keep up the great work.
>
> Warm regards,
> Alex
>
> -----Original Message-----
> From: Yegor Kozlov [mailto:yegor.kozlov@dinom.ru]
> Sent: 10 September 2012 10:16 AM
> To: POI Users List
> Subject: Re: Bug 53380
>
> We have all pre-requisites for fixing this bug, just need to find a
> person to do it :)
>
> POI is a volunteer project and if this problem is important for you,
> please do work on it and submit a patch. Otherwise please wait.
> Unfortuntaly we don't have a active developer working on DOC/DOCX
> modules, so fixing may take some time.
>
> Yegor
>
> On Mon, Sep 10, 2012 at 9:48 AM, Alex Cougarman <ac...@bwc.org> wrote:
> > Hi. I'm having the same issue from this bug with hundreds of our DOC
> > files being fed through Solr/Tika:
> > https://issues.apache.org/bugzilla/show_bug.cgi?id=53380
> >
> > I downloaded the DOC file attached to the ticket and was able to
> generate the same error we've been getting (please see below for the
> exception).
> >
> > Anyone know of a solution/workaround? Is there a timeline for a fix?
> > I
> commented and voted on the ticket but not sure if it's a priority. Thanks.
> >
> > org.apache.tika.exception.TikaException
> > : Unexpected RuntimeException from
> > org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> > org.apache.solr.common.SolrException:
> > org.apache.tika.exception.TikaException: Unexpected RuntimeException
> from org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> > at
> > org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(Extr
> > actingDocumentLoader.java:230)
> > at
> > org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(Co
> > ntentStreamHandlerBase.java:74)
> > at
> > org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandl
> > erBase.java:129)
> > at
> > org.apache.solr.core.RequestHandlers$LazyRequestHandlerWrapper.handle
> > Request(RequestHandlers.java:240)
> > at org.apache.solr.core.SolrCore.execute(SolrCore.java:1656)
> > at
> > org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter
> > .java:454)
> > at
> > org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilte
> > r.java:275)
> > at
> > org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(Servlet
> > Handler.java:1337)
> > at
> > org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java
> > :484)
> > at
> > org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.j
> > ava:119)
> > at
> >
> org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java
> :524)
> > at
> > org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandl
> > er.java:233)
> > at
> > org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandl
> > er.java:1065)
> > at
> > org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:
> > 413)
> > at
> > org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandle
> > r.java:192)
> > at
> > org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandle
> > r.java:999)
> > at
> > org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.j
> > ava:117)
> > at
> > org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(Cont
> > extHandlerCollection.java:250)
> > at
> > org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerColl
> > ection.java:149)
> > at
> > org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper
> > .java:111)
> > at org.eclipse.jetty.server.Server.handle(Server.java:351)
> > at
> > org.eclipse.jetty.server.AbstractHttpConnection.handleRequest(Abstrac
> > tHttpConnection.java:454)
> > at
> > org.eclipse.jetty.server.BlockingHttpConnection.handleRequest(Blockin
> > gHttpConnection.java:47)
> > at
> > org.eclipse.jetty.server.AbstractHttpConnection.headerComplete(Abstra
> > ctHttpConnection.java:890)
> > at
> > org.eclipse.jetty.server.AbstractHttpConnection$RequestHandler.header
> > Complete(AbstractHttpConnection.java:944)
> > at
> > org.eclipse.jetty.http.HttpParser.parseNext(HttpParser.java:642)
> > at
> > org.eclipse.jetty.http.HttpParser.parseAvailable(HttpParser.java:230
> > )
> >
> > at
> > org.eclipse.jetty.server.BlockingHttpConnection.handle(BlockingHttpCo
> > nnection.java:66)
> > at
> > org.eclipse.jetty.server.bio.SocketConnector$ConnectorEndPoint.run(So
> > cketConnector.java:254)
> > at
> > org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPoo
> > l.java:599)
> > at
> > org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool
> > .java:534)
> > at java.lang.Thread.run(Unknown Source)
> > Caused by: org.apache.tika.exception.TikaException: Unexpected
> RuntimeException
> > from org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> > at
> > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244
> > )
> > at
> > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242
> > )
> > at
> > org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:1
> > 20)
> > at
> > org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(Extr
> > actingDocumentLoader.java:224)
> > ... 31 more
> > Caused by: java.lang.ArrayIndexOutOfBoundsException: 7
> > at
> > org.apache.poi.util.LittleEndian.getInt(LittleEndian.java:163)
> > at
> > org.apache.poi.hwpf.model.Colorref.<init>(Colorref.java:81)
> > at
> > org.apache.poi.hwpf.model.types.SHDAbstractType.fillFields(SHDAbstrac
> > tType.java:56)
> > at
> > org.apache.poi.hwpf.usermodel.ShadingDescriptor.<init>(ShadingD
> > escriptor.java:38)
> > at
> > org.apache.poi.hwpf.sprm.CharacterSprmUncompressor.unCompressCHPOpera
> > tion(CharacterSprmUncompressor.java:582)
> > at
> > org.apache.poi.hwpf.sprm.CharacterSprmUncompressor.uncompressCHP(Char
> > acterSprmUncompressor.java:65)
> > at
> > org.apache.poi.hwpf.model.StyleSheet.createChp(StyleSheet.java:288)
> > at
> > org.apache.poi.hwpf.model.StyleSheet.<init>(StyleSheet.java:121
> > )
> > at
> > org.apache.poi.hwpf.HWPFDocument.<init>(HWPFDocument.java:346)
> > at
> > org.apache.tika.parser.microsoft.WordExtractor.parse(WordExtractor.ja
> > va:77)
> > at
> > org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java
> > :185)
> > at
> > org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java
> > :160)
> > at
> > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242
> > )
> > ... 34 more
> >
> >
> > Warm regards,
> > Alex
> >
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: user-unsubscribe@poi.apache.org For additional
> commands, e-mail: user-help@poi.apache.org
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: user-unsubscribe@poi.apache.org For additional
> commands, e-mail: user-help@poi.apache.org
>
>
--
Sergey Vladimirov
---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@poi.apache.org
For additional commands, e-mail: user-help@poi.apache.org
Re: Bug 53380
Posted by Sergey Vladimirov <vl...@gmail.com>.
Hi
I will take a look into it today or tomorrow.
Sorry for the long waiting
Best regards,
Sergey
On Mon, Sep 10, 2012 at 11:19 AM, Alex Cougarman <ac...@bwc.org> wrote:
> Dear Yegor,
>
> Thank you for your reply. If I knew enough about Java, I'd go in and fix
> it :)
> Just happy to have you guys providing such a great tool. Thanks and keep
> up the great work.
>
> Warm regards,
> Alex
>
> -----Original Message-----
> From: Yegor Kozlov [mailto:yegor.kozlov@dinom.ru]
> Sent: 10 September 2012 10:16 AM
> To: POI Users List
> Subject: Re: Bug 53380
>
> We have all pre-requisites for fixing this bug, just need to find a person
> to do it :)
>
> POI is a volunteer project and if this problem is important for you,
> please do work on it and submit a patch. Otherwise please wait.
> Unfortuntaly we don't have a active developer working on DOC/DOCX modules,
> so fixing may take some time.
>
> Yegor
>
> On Mon, Sep 10, 2012 at 9:48 AM, Alex Cougarman <ac...@bwc.org> wrote:
> > Hi. I'm having the same issue from this bug with hundreds of our DOC
> > files being fed through Solr/Tika:
> > https://issues.apache.org/bugzilla/show_bug.cgi?id=53380
> >
> > I downloaded the DOC file attached to the ticket and was able to
> generate the same error we've been getting (please see below for the
> exception).
> >
> > Anyone know of a solution/workaround? Is there a timeline for a fix? I
> commented and voted on the ticket but not sure if it's a priority. Thanks.
> >
> > org.apache.tika.exception.TikaException
> > : Unexpected RuntimeException from
> > org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> > org.apache.solr.common.SolrException:
> > org.apache.tika.exception.TikaException: Unexpected RuntimeException
> from org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> > at
> > org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(Extr
> > actingDocumentLoader.java:230)
> > at
> > org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(Co
> > ntentStreamHandlerBase.java:74)
> > at
> > org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandl
> > erBase.java:129)
> > at
> > org.apache.solr.core.RequestHandlers$LazyRequestHandlerWrapper.handle
> > Request(RequestHandlers.java:240)
> > at org.apache.solr.core.SolrCore.execute(SolrCore.java:1656)
> > at
> > org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter
> > .java:454)
> > at
> > org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilte
> > r.java:275)
> > at
> > org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(Servlet
> > Handler.java:1337)
> > at
> > org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java
> > :484)
> > at
> > org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.j
> > ava:119)
> > at
> >
> org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java:524)
> > at
> > org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandl
> > er.java:233)
> > at
> > org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandl
> > er.java:1065)
> > at
> > org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:
> > 413)
> > at
> > org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandle
> > r.java:192)
> > at
> > org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandle
> > r.java:999)
> > at
> > org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.j
> > ava:117)
> > at
> > org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(Cont
> > extHandlerCollection.java:250)
> > at
> > org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerColl
> > ection.java:149)
> > at
> > org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper
> > .java:111)
> > at org.eclipse.jetty.server.Server.handle(Server.java:351)
> > at
> > org.eclipse.jetty.server.AbstractHttpConnection.handleRequest(Abstrac
> > tHttpConnection.java:454)
> > at
> > org.eclipse.jetty.server.BlockingHttpConnection.handleRequest(Blockin
> > gHttpConnection.java:47)
> > at
> > org.eclipse.jetty.server.AbstractHttpConnection.headerComplete(Abstra
> > ctHttpConnection.java:890)
> > at
> > org.eclipse.jetty.server.AbstractHttpConnection$RequestHandler.header
> > Complete(AbstractHttpConnection.java:944)
> > at
> > org.eclipse.jetty.http.HttpParser.parseNext(HttpParser.java:642)
> > at
> > org.eclipse.jetty.http.HttpParser.parseAvailable(HttpParser.java:230)
> >
> > at
> > org.eclipse.jetty.server.BlockingHttpConnection.handle(BlockingHttpCo
> > nnection.java:66)
> > at
> > org.eclipse.jetty.server.bio.SocketConnector$ConnectorEndPoint.run(So
> > cketConnector.java:254)
> > at
> > org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPoo
> > l.java:599)
> > at
> > org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool
> > .java:534)
> > at java.lang.Thread.run(Unknown Source)
> > Caused by: org.apache.tika.exception.TikaException: Unexpected
> RuntimeException
> > from org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> > at
> > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244
> > )
> > at
> > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242
> > )
> > at
> > org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:1
> > 20)
> > at
> > org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(Extr
> > actingDocumentLoader.java:224)
> > ... 31 more
> > Caused by: java.lang.ArrayIndexOutOfBoundsException: 7
> > at
> > org.apache.poi.util.LittleEndian.getInt(LittleEndian.java:163)
> > at
> > org.apache.poi.hwpf.model.Colorref.<init>(Colorref.java:81)
> > at
> > org.apache.poi.hwpf.model.types.SHDAbstractType.fillFields(SHDAbstrac
> > tType.java:56)
> > at
> > org.apache.poi.hwpf.usermodel.ShadingDescriptor.<init>(ShadingD
> > escriptor.java:38)
> > at
> > org.apache.poi.hwpf.sprm.CharacterSprmUncompressor.unCompressCHPOpera
> > tion(CharacterSprmUncompressor.java:582)
> > at
> > org.apache.poi.hwpf.sprm.CharacterSprmUncompressor.uncompressCHP(Char
> > acterSprmUncompressor.java:65)
> > at
> > org.apache.poi.hwpf.model.StyleSheet.createChp(StyleSheet.java:288)
> > at
> > org.apache.poi.hwpf.model.StyleSheet.<init>(StyleSheet.java:121
> > )
> > at
> > org.apache.poi.hwpf.HWPFDocument.<init>(HWPFDocument.java:346)
> > at
> > org.apache.tika.parser.microsoft.WordExtractor.parse(WordExtractor.ja
> > va:77)
> > at
> > org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java
> > :185)
> > at
> > org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java
> > :160)
> > at
> > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242
> > )
> > ... 34 more
> >
> >
> > Warm regards,
> > Alex
> >
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: user-unsubscribe@poi.apache.org For additional
> commands, e-mail: user-help@poi.apache.org
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: user-unsubscribe@poi.apache.org
> For additional commands, e-mail: user-help@poi.apache.org
>
>
--
Sergey Vladimirov
RE: Bug 53380
Posted by Alex Cougarman <ac...@bwc.org>.
Dear Yegor,
Thank you for your reply. If I knew enough about Java, I'd go in and fix it :)
Just happy to have you guys providing such a great tool. Thanks and keep up the great work.
Warm regards,
Alex
-----Original Message-----
From: Yegor Kozlov [mailto:yegor.kozlov@dinom.ru]
Sent: 10 September 2012 10:16 AM
To: POI Users List
Subject: Re: Bug 53380
We have all pre-requisites for fixing this bug, just need to find a person to do it :)
POI is a volunteer project and if this problem is important for you, please do work on it and submit a patch. Otherwise please wait.
Unfortuntaly we don't have a active developer working on DOC/DOCX modules, so fixing may take some time.
Yegor
On Mon, Sep 10, 2012 at 9:48 AM, Alex Cougarman <ac...@bwc.org> wrote:
> Hi. I'm having the same issue from this bug with hundreds of our DOC
> files being fed through Solr/Tika:
> https://issues.apache.org/bugzilla/show_bug.cgi?id=53380
>
> I downloaded the DOC file attached to the ticket and was able to generate the same error we've been getting (please see below for the exception).
>
> Anyone know of a solution/workaround? Is there a timeline for a fix? I commented and voted on the ticket but not sure if it's a priority. Thanks.
>
> org.apache.tika.exception.TikaException
> : Unexpected RuntimeException from
> org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> org.apache.solr.common.SolrException:
> org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> at
> org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(Extr
> actingDocumentLoader.java:230)
> at
> org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(Co
> ntentStreamHandlerBase.java:74)
> at
> org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandl
> erBase.java:129)
> at
> org.apache.solr.core.RequestHandlers$LazyRequestHandlerWrapper.handle
> Request(RequestHandlers.java:240)
> at org.apache.solr.core.SolrCore.execute(SolrCore.java:1656)
> at
> org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter
> .java:454)
> at
> org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilte
> r.java:275)
> at
> org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(Servlet
> Handler.java:1337)
> at
> org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java
> :484)
> at
> org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.j
> ava:119)
> at
> org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java:524)
> at
> org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandl
> er.java:233)
> at
> org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandl
> er.java:1065)
> at
> org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:
> 413)
> at
> org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandle
> r.java:192)
> at
> org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandle
> r.java:999)
> at
> org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.j
> ava:117)
> at
> org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(Cont
> extHandlerCollection.java:250)
> at
> org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerColl
> ection.java:149)
> at
> org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper
> .java:111)
> at org.eclipse.jetty.server.Server.handle(Server.java:351)
> at
> org.eclipse.jetty.server.AbstractHttpConnection.handleRequest(Abstrac
> tHttpConnection.java:454)
> at
> org.eclipse.jetty.server.BlockingHttpConnection.handleRequest(Blockin
> gHttpConnection.java:47)
> at
> org.eclipse.jetty.server.AbstractHttpConnection.headerComplete(Abstra
> ctHttpConnection.java:890)
> at
> org.eclipse.jetty.server.AbstractHttpConnection$RequestHandler.header
> Complete(AbstractHttpConnection.java:944)
> at
> org.eclipse.jetty.http.HttpParser.parseNext(HttpParser.java:642)
> at
> org.eclipse.jetty.http.HttpParser.parseAvailable(HttpParser.java:230)
>
> at
> org.eclipse.jetty.server.BlockingHttpConnection.handle(BlockingHttpCo
> nnection.java:66)
> at
> org.eclipse.jetty.server.bio.SocketConnector$ConnectorEndPoint.run(So
> cketConnector.java:254)
> at
> org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPoo
> l.java:599)
> at
> org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool
> .java:534)
> at java.lang.Thread.run(Unknown Source)
> Caused by: org.apache.tika.exception.TikaException: Unexpected RuntimeException
> from org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> at
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244
> )
> at
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242
> )
> at
> org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:1
> 20)
> at
> org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(Extr
> actingDocumentLoader.java:224)
> ... 31 more
> Caused by: java.lang.ArrayIndexOutOfBoundsException: 7
> at
> org.apache.poi.util.LittleEndian.getInt(LittleEndian.java:163)
> at
> org.apache.poi.hwpf.model.Colorref.<init>(Colorref.java:81)
> at
> org.apache.poi.hwpf.model.types.SHDAbstractType.fillFields(SHDAbstrac
> tType.java:56)
> at
> org.apache.poi.hwpf.usermodel.ShadingDescriptor.<init>(ShadingD
> escriptor.java:38)
> at
> org.apache.poi.hwpf.sprm.CharacterSprmUncompressor.unCompressCHPOpera
> tion(CharacterSprmUncompressor.java:582)
> at
> org.apache.poi.hwpf.sprm.CharacterSprmUncompressor.uncompressCHP(Char
> acterSprmUncompressor.java:65)
> at
> org.apache.poi.hwpf.model.StyleSheet.createChp(StyleSheet.java:288)
> at
> org.apache.poi.hwpf.model.StyleSheet.<init>(StyleSheet.java:121
> )
> at
> org.apache.poi.hwpf.HWPFDocument.<init>(HWPFDocument.java:346)
> at
> org.apache.tika.parser.microsoft.WordExtractor.parse(WordExtractor.ja
> va:77)
> at
> org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java
> :185)
> at
> org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java
> :160)
> at
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242
> )
> ... 34 more
>
>
> Warm regards,
> Alex
>
---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@poi.apache.org For additional commands, e-mail: user-help@poi.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@poi.apache.org
For additional commands, e-mail: user-help@poi.apache.org
Re: Bug 53380
Posted by Yegor Kozlov <ye...@dinom.ru>.
We have all pre-requisites for fixing this bug, just need to find a
person to do it :)
POI is a volunteer project and if this problem is important for you,
please do work on it and submit a patch. Otherwise please wait.
Unfortuntaly we don't have a active developer working on DOC/DOCX
modules, so fixing may take some time.
Yegor
On Mon, Sep 10, 2012 at 9:48 AM, Alex Cougarman <ac...@bwc.org> wrote:
> Hi. I'm having the same issue from this bug with hundreds of our DOC files being fed through Solr/Tika: https://issues.apache.org/bugzilla/show_bug.cgi?id=53380
>
> I downloaded the DOC file attached to the ticket and was able to generate the same error we've been getting (please see below for the exception).
>
> Anyone know of a solution/workaround? Is there a timeline for a fix? I commented and voted on the ticket but not sure if it's a priority. Thanks.
>
> org.apache.tika.exception.TikaException
> : Unexpected RuntimeException from
> org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> org.apache.solr.common.SolrException:
> org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> at
> org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(Extr
> actingDocumentLoader.java:230)
> at
> org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(Co
> ntentStreamHandlerBase.java:74)
> at
> org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandl
> erBase.java:129)
> at
> org.apache.solr.core.RequestHandlers$LazyRequestHandlerWrapper.handle
> Request(RequestHandlers.java:240)
> at org.apache.solr.core.SolrCore.execute(SolrCore.java:1656)
> at
> org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter
> .java:454)
> at
> org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilte
> r.java:275)
> at
> org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(Servlet
> Handler.java:1337)
> at
> org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java
> :484)
> at
> org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.j
> ava:119)
> at
> org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java:524)
> at
> org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandl
> er.java:233)
> at
> org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandl
> er.java:1065)
> at
> org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:
> 413)
> at
> org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandle
> r.java:192)
> at
> org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandle
> r.java:999)
> at
> org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.j
> ava:117)
> at
> org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(Cont
> extHandlerCollection.java:250)
> at
> org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerColl
> ection.java:149)
> at
> org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper
> .java:111)
> at org.eclipse.jetty.server.Server.handle(Server.java:351)
> at
> org.eclipse.jetty.server.AbstractHttpConnection.handleRequest(Abstrac
> tHttpConnection.java:454)
> at
> org.eclipse.jetty.server.BlockingHttpConnection.handleRequest(Blockin
> gHttpConnection.java:47)
> at
> org.eclipse.jetty.server.AbstractHttpConnection.headerComplete(Abstra
> ctHttpConnection.java:890)
> at
> org.eclipse.jetty.server.AbstractHttpConnection$RequestHandler.header
> Complete(AbstractHttpConnection.java:944)
> at
> org.eclipse.jetty.http.HttpParser.parseNext(HttpParser.java:642)
> at
> org.eclipse.jetty.http.HttpParser.parseAvailable(HttpParser.java:230)
>
> at
> org.eclipse.jetty.server.BlockingHttpConnection.handle(BlockingHttpCo
> nnection.java:66)
> at
> org.eclipse.jetty.server.bio.SocketConnector$ConnectorEndPoint.run(So
> cketConnector.java:254)
> at
> org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPoo
> l.java:599)
> at
> org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool
> .java:534)
> at java.lang.Thread.run(Unknown Source)
> Caused by: org.apache.tika.exception.TikaException: Unexpected RuntimeException
> from org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> at
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244
> )
> at
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242
> )
> at
> org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:1
> 20)
> at
> org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(Extr
> actingDocumentLoader.java:224)
> ... 31 more
> Caused by: java.lang.ArrayIndexOutOfBoundsException: 7
> at
> org.apache.poi.util.LittleEndian.getInt(LittleEndian.java:163)
> at
> org.apache.poi.hwpf.model.Colorref.<init>(Colorref.java:81)
> at
> org.apache.poi.hwpf.model.types.SHDAbstractType.fillFields(SHDAbstrac
> tType.java:56)
> at
> org.apache.poi.hwpf.usermodel.ShadingDescriptor.<init>(ShadingD
> escriptor.java:38)
> at
> org.apache.poi.hwpf.sprm.CharacterSprmUncompressor.unCompressCHPOpera
> tion(CharacterSprmUncompressor.java:582)
> at
> org.apache.poi.hwpf.sprm.CharacterSprmUncompressor.uncompressCHP(Char
> acterSprmUncompressor.java:65)
> at
> org.apache.poi.hwpf.model.StyleSheet.createChp(StyleSheet.java:288)
> at
> org.apache.poi.hwpf.model.StyleSheet.<init>(StyleSheet.java:121
> )
> at
> org.apache.poi.hwpf.HWPFDocument.<init>(HWPFDocument.java:346)
> at
> org.apache.tika.parser.microsoft.WordExtractor.parse(WordExtractor.ja
> va:77)
> at
> org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java
> :185)
> at
> org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java
> :160)
> at
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242
> )
> ... 34 more
>
>
> Warm regards,
> Alex
>
---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@poi.apache.org
For additional commands, e-mail: user-help@poi.apache.org