You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@poi.apache.org by Jebarlin Robertson <je...@gmail.com> on 2012/11/02 13:35:02 UTC

Re: Source code package to retrieve only the text from 2003 documents (word, ppt & xsl)

Thanks Nick for your reply.
Anyway I am trying to remove myself the code which is not required for text
extraction.
Thanks Nick once again.


On Wed, Oct 31, 2012 at 4:54 PM, Nick Burch <ap...@gagravarr.org> wrote:

> On Wed, 31 Oct 2012, Jebarlin Robertson wrote:
>
>> I Just want to retrieve only the text part from the 2003 documents (Word,
>> Excel and PowerPoint). There are some TextExtarctor APIs, I can use that.
>> But I do not want the whole source code (Whole parser) to use in my
>> project. Can I get only that particular code to retrieve the text part
>> from
>> the document.
>>
>
> Sadly not - the text extraction builds on top of the rest of the code of
> the project. You can only extract the text once you have processed the
> file, and that needs the bulk of the library
>
> Nick
>
> ------------------------------**------------------------------**---------
> To unsubscribe, e-mail: user-unsubscribe@poi.apache.**org<us...@poi.apache.org>
> For additional commands, e-mail: user-help@poi.apache.org
>
>