You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@poi.apache.org by bu...@apache.org on 2011/12/19 19:13:25 UTC

DO NOT REPLY [Bug 52367] New: PPT: text extraction missing "update automatically" dates/times

https://issues.apache.org/bugzilla/show_bug.cgi?id=52367

             Bug #: 52367
           Summary: PPT: text extraction missing "update automatically"
                    dates/times
           Product: POI
           Version: 3.8-dev
          Platform: PC
            Status: NEW
          Severity: normal
          Priority: P2
         Component: HSLF
        AssignedTo: dev@poi.apache.org
        ReportedBy: albert@logik.com
    Classification: Unclassified


Created attachment 28087
  --> https://issues.apache.org/bugzilla/attachment.cgi?id=28087
sample of file that fails to be text extracted

When text extracting a PPT file, dates/times that are inserted with "update
automatically" are not text extracted.

-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


DO NOT REPLY [Bug 52367] PPT: text extraction missing "update automatically" dates/times

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=52367

Yegor Kozlov <ye...@dinom.ru> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |RESOLVED
         Resolution|                            |WONTFIX
         OS/Version|                            |All

--- Comment #1 from Yegor Kozlov <ye...@dinom.ru> 2012-01-15 12:07:35 UTC ---
Automatic date/time text from .ppt files cannot be extracted because it is not
stored in the file. The viewing application (PowerPoint, OpenOffice, whatever)
is  responsible for interpretation of "automatic" text elements and showing
current date/time. POI does not interpret the format, it can read and extract
data but does not support all features available in MS Office.

Note the different between auto text in PPT and PPTX formats: the PPTX format
always stores cached text, i.e. the last value seen by PowerPoint and this is
the text that is extracted. The  PPT format does not store cached value. 

Yegor

-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org