You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by io...@group.novartis.com on 2004/10/25 16:55:27 UTC

Need advice: what Word/Excel/PowerPoint lib to use?

Hello all,

I need a piece of advice/experience again..

What ms Word/Excel/PowerPoint parsers (written in java) u'd recommend?

Thanks in advance
J.



---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Re: Need advice: what Word/Excel/PowerPoint lib to use?

Posted by Andrzej Bialecki <ab...@getopt.org>.
Ryan Ackley wrote:
> Their API is amazing. However, you run into the same problems that you 
> do when you automate MS Office using VBA. Which is instability and 
> everything is single-threaded. Your are basically automating a gui 
> application.
> 

AFAIK they don't provide a separate converters' API, which is a pity.

The results of the conversion are I think one of the best, if not the 
best, that you can get with OpenSource tools, but the fact that all 
converters are so entangled with the rest of the application practically 
excludes their use in high-volume applications...

-- 
Best regards,
Andrzej Bialecki

-------------------------------------------------
Software Architect, System Integration Specialist
CEN/ISSS EC Workshop, ECIMF project chair
EU FP6 E-Commerce Expert/Evaluator
-------------------------------------------------
FreeBSD developer (http://www.freebsd.org)


---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Re: Need advice: what Word/Excel/PowerPoint lib to use?

Posted by Ryan Ackley <sa...@cfl.rr.com>.
Their API is amazing. However, you run into the same problems that you do 
when you automate MS Office using VBA. Which is instability and everything 
is single-threaded. Your are basically automating a gui application.

-Ryan

----- Original Message ----- 
From: "Genty Jean-Paul" <ge...@libertysurf.fr>
To: "Lucene Users List" <lu...@jakarta.apache.org>
Sent: Monday, October 25, 2004 1:17 PM
Subject: Re: Need advice: what Word/Excel/PowerPoint lib to use?


> At 19:42 25/10/2004, you wrote:
>>>At 17:05 25/10/2004, you wrote:
>>>
>>>>of course POI, for open source.
>>>>There are some commercial products based on POI also.
>>>>
>>>>for WORD consider textmining.org
>>>>for XLS, POI does anything you need
>>>>for powerpoint  there is one commercial (it's about 1000$), but you can 
>>>>also find some source code in archives.
>>>
>>>
>>>  And what do you think about using Open Office's UNO APIs  ?
>>
>>I didn't knew about them. Are they implemented in Java?
>
> Yes
>  Check out  http://api.openoffice.org/ , They have good examples, I can 
> also provide you my small test.
>  You can do some amazing things with their API.
>
>>Do they support all MSOffice formats (97/2000/XP)?
>
> Check http://www.openoffice.org/product/docs/OOoFlyer11s.pdf
>
> Jean-Paul
>
>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
> For additional commands, e-mail: lucene-user-help@jakarta.apache.org
> 


---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Re: Need advice: what Word/Excel/PowerPoint lib to use?

Posted by Genty Jean-Paul <ge...@libertysurf.fr>.
At 19:42 25/10/2004, you wrote:
>>At 17:05 25/10/2004, you wrote:
>>
>>>of course POI, for open source.
>>>There are some commercial products based on POI also.
>>>
>>>for WORD consider textmining.org
>>>for XLS, POI does anything you need
>>>for powerpoint  there is one commercial (it's about 1000$), but you can 
>>>also find some source code in archives.
>>
>>
>>  And what do you think about using Open Office's UNO APIs  ?
>
>I didn't knew about them. Are they implemented in Java?

Yes
  Check out  http://api.openoffice.org/ , They have good examples, I can 
also provide you my small test.
  You can do some amazing things with their API.

>Do they support all MSOffice formats (97/2000/XP)?

Check http://www.openoffice.org/product/docs/OOoFlyer11s.pdf

Jean-Paul




---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Re: Need advice: what Word/Excel/PowerPoint lib to use?

Posted by Sergiu Gordea <gs...@ifit.uni-klu.ac.at>.
Genty Jean-Paul wrote:

> At 17:05 25/10/2004, you wrote:
>
>> of course POI, for open source.
>> There are some commercial products based on POI also.
>>
>> for WORD consider textmining.org
>> for XLS, POI does anything you need
>> for powerpoint  there is one commercial (it's about 1000$), but you 
>> can also find some source code in archives.
>
>
>  And what do you think about using Open Office's UNO APIs  ?

I didn't knew about them. Are they implemented in Java?
Do they support all MSOffice formats (97/2000/XP)?

 Sergiu

>
>  If someone did, does it scale well ? (I just did some unit testing )
>
> Jean-Paul 
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
> For additional commands, e-mail: lucene-user-help@jakarta.apache.org
>



---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Re: Need advice: what Word/Excel/PowerPoint lib to use?

Posted by Genty Jean-Paul <ge...@libertysurf.fr>.
At 17:05 25/10/2004, you wrote:
>of course POI, for open source.
>There are some commercial products based on POI also.
>
>for WORD consider textmining.org
>for XLS, POI does anything you need
>for powerpoint  there is one commercial (it's about 1000$), but you can 
>also find some source code in archives.

  And what do you think about using Open Office's UNO APIs  ?

  If someone did, does it scale well ? (I just did some unit testing )

Jean-Paul  



---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Re: Need advice: what Word/Excel/PowerPoint lib to use?

Posted by sergiu gordea <gs...@ifit.uni-klu.ac.at>.
of course POI, for open source.
There are some commercial products based on POI also.

for WORD consider textmining.org
for XLS, POI does anything you need
for powerpoint  there is one commercial (it's about 1000$), but you can 
also find some source code in archives.

 All the best,

  Sergiu
 

iouli.golovatyi@group.novartis.com wrote:

>Hello all,
>
>I need a piece of advice/experience again..
>
>What ms Word/Excel/PowerPoint parsers (written in java) u'd recommend?
>
>Thanks in advance
>J.
>
>
>
>---------------------------------------------------------------------
>To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
>For additional commands, e-mail: lucene-user-help@jakarta.apache.org
>
>  
>


---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org