You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@poi.apache.org by Vicente <ce...@hotmail.com> on 2014/02/05 17:06:31 UTC

extract metadata when using XSSF and SAX (Event API)

I'm using XSSF and SAX (Event API) to extract the content from xlsx files, 
these files are really big, and the only way that I have seeing to extract 
the content without generating an OutOfMemory is using the SAX (Event API), 
it seems to work very nice.

But, I'm wondering if there is a way to extract the metadata from the same 
file, for example, author, lastupdate, etc, custom metadata, core metadata, 
etc.

I know that there is a way to extract it using XSSFWorkbook but I cannot use 
that, I'm using  big files and it make an OutOfMemory.

Please let me know if this approach is possible, I have been searching in the 
web and all the solutions are using XSSFWorkbook.

Thank you!!!


---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@poi.apache.org
For additional commands, e-mail: user-help@poi.apache.org


Re: extract metadata when using XSSF and SAX (Event API)

Posted by Nick Burch <ap...@gagravarr.org>.
On Wed, 5 Feb 2014, Vicente wrote:
> But, I'm wondering if there is a way to extract the metadata from the 
> same file, for example, author, lastupdate, etc, custom metadata, core 
> metadata, etc.

Can you not just open the OPCPackage, and grab the metadata parts from 
that? Or if really needed, just treat the .xlsx file as a zip and get the 
parts directly? (They have well known names, normally...)

Nick

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@poi.apache.org
For additional commands, e-mail: user-help@poi.apache.org