You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@poi.apache.org by bu...@apache.org on 2014/03/20 13:17:33 UTC

[Bug 56294] New: Cannot parse an XLSX

https://issues.apache.org/bugzilla/show_bug.cgi?id=56294

            Bug ID: 56294
           Summary: Cannot parse an XLSX
           Product: POI
           Version: 3.10
          Hardware: PC
            Status: NEW
          Severity: critical
          Priority: P2
         Component: XSSF
          Assignee: dev@poi.apache.org
          Reporter: eamonny@thyme-it.ie

I am trying to open a workbook in an XLSX format in order to parse it and
retrieve certain elements from certain cells. The file is provided by a
customer. Please find (a modified version of) the file attached. I am able to
open the file directly in Excel, but not in POI.

I'm running Java version jdk1.6.0_23 and have installed version 3.10 of the POI
library package.

When attempting to parse the file using
"org.apache.poi.xssf.usermodel.XSSFWorkbook", Memory usage jumps from under
200KB to over 1.1GB. The CPU also jumps to 100% capacity.

The program then stalls and does not recover. The Debug window in Eclipse shows
a large number (380) of Threads as "Running".

I would really appreciate it if you could investigate the cause behind this
issue.

Thanks in advance,
Eamonn.

P.s. I looked at bug 55769, but we do not get any error message - Java just
hangs...

-- 
You are receiving this mail because:
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


[Bug 56294] Cannot parse an XLSX

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=56294

Nick Burch <ap...@gagravarr.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |NEEDINFO
                 OS|                            |All

--- Comment #1 from Nick Burch <ap...@gagravarr.org> ---
Can you try bumping up the heap size more, to see if that helps?

Can you run the jvm so it reports when it does garbage collection, so you can
see if the hang is due to terminal-GC or something else?

If you take a thread dump, what are the threads doing when it hangs?

-- 
You are receiving this mail because:
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


[Bug 56294] Cannot parse an XLSX

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=56294

--- Comment #5 from Nick Burch <ap...@gagravarr.org> ---
Have you tried following the suggestions/strategy from the FAQ entry "I think
POI is using too much memory! What can I do?" ?
http://poi.apache.org/faq.html#faq-N10109

-- 
You are receiving this mail because:
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


[Bug 56294] Cannot parse an XLSX

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=56294

--- Comment #2 from Eamonn Young <ea...@thyme-it.ie> ---
(In reply to Nick Burch from comment #1)
> Can you try bumping up the heap size more, to see if that helps?
> 
> Can you run the jvm so it reports when it does garbage collection, so you
> can see if the hang is due to terminal-GC or something else?
> 
> If you take a thread dump, what are the threads doing when it hangs?

Nick,

Thanks a million for your swift response. We increased the heap size and that
allows us to process the file. So thanks for the suggestion. It is slow, but it
gets there.

Thanks again,
Eamonn.

-- 
You are receiving this mail because:
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


[Bug 56294] Cannot parse an XLSX

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=56294

--- Comment #4 from Arik <ar...@hp.com> ---
Hi,

We encountered the same behavior, 25 MB file with 382K rows and 15 columns
each.

anything lower than that and we get OOM exception.

We are using POI-3.10-FINAL on JDK 7 with the latest patch.

Please advise.

Thank you,
Arik

-- 
You are receiving this mail because:
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


[Bug 56294] Cannot parse an XLSX

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=56294

--- Comment #3 from Nick Burch <ap...@gagravarr.org> ---
OK, that's good. Next thing to check - is the problem with your code or file?

There's two example programs that can be used to check generation speed and
reading speed, try using SSPerformanceTest to check how long to generate a
simple file of your desired size. Next check if your program can load it
quicker or not. Now, use XLSX2CSV to read both your file and your generated
test file - how slow/quick, and are they about the same speed?

-- 
You are receiving this mail because:
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


[Bug 56294] Cannot parse an XLSX

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=56294

Andreas Beeker <an...@gmx.de> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEEDINFO                    |RESOLVED
         Resolution|---                         |LATER

--- Comment #6 from Andreas Beeker <an...@gmx.de> ---
There was no feedback over the last 5 month, so I'm closing this now.
We can re-open it later anyways ...

-- 
You are receiving this mail because:
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org