You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@poi.apache.org by bu...@apache.org on 2011/12/19 19:13:25 UTC
DO NOT REPLY [Bug 52367] New: PPT: text extraction missing "update
automatically" dates/times
https://issues.apache.org/bugzilla/show_bug.cgi?id=52367
Bug #: 52367
Summary: PPT: text extraction missing "update automatically"
dates/times
Product: POI
Version: 3.8-dev
Platform: PC
Status: NEW
Severity: normal
Priority: P2
Component: HSLF
AssignedTo: dev@poi.apache.org
ReportedBy: albert@logik.com
Classification: Unclassified
Created attachment 28087
--> https://issues.apache.org/bugzilla/attachment.cgi?id=28087
sample of file that fails to be text extracted
When text extracting a PPT file, dates/times that are inserted with "update
automatically" are not text extracted.
--
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org
DO NOT REPLY [Bug 52367] PPT: text extraction missing "update
automatically" dates/times
Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=52367
Yegor Kozlov <ye...@dinom.ru> changed:
What |Removed |Added
----------------------------------------------------------------------------
Status|NEW |RESOLVED
Resolution| |WONTFIX
OS/Version| |All
--- Comment #1 from Yegor Kozlov <ye...@dinom.ru> 2012-01-15 12:07:35 UTC ---
Automatic date/time text from .ppt files cannot be extracted because it is not
stored in the file. The viewing application (PowerPoint, OpenOffice, whatever)
is responsible for interpretation of "automatic" text elements and showing
current date/time. POI does not interpret the format, it can read and extract
data but does not support all features available in MS Office.
Note the different between auto text in PPT and PPTX formats: the PPTX format
always stores cached text, i.e. the last value seen by PowerPoint and this is
the text that is extracted. The PPT format does not store cached value.
Yegor
--
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org