You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@poi.apache.org by bu...@apache.org on 2011/11/05 11:55:13 UTC

DO NOT REPLY [Bug 51803] Content on master slide is not extracted

https://issues.apache.org/bugzilla/show_bug.cgi?id=51803

mikemccand <lu...@mikemccandless.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|RESOLVED                    |REOPENED
         Resolution|WORKSFORME                  |

--- Comment #2 from mikemccand <lu...@mikemccandless.com> 2011-11-05 10:55:13 UTC ---
I think there is still a problem here: with the example PPT I
attached, I see boiler-plate text when I run PowerPointExtract (which
does set to flag to include master slide text, in its static main
method).

I see code in HSLF for detecting that a given Shape is a placeholder
(MasterSheet.isPlaceholder), so it seems possible we can avoid
extracting such text?  But I'm not familiar enough with the APIs, eg
when Sheet.findTextRuns is invoked for a MasterSlide, how can it get
the Shape for each run and then skip its text if it's a placeholder?

-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org