You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@poi.apache.org by bu...@apache.org on 2009/01/21 20:46:42 UTC

DO NOT REPLY [Bug 46577] New: POI engine logs errors about invalid uri files on certain office 2007 documents

https://issues.apache.org/bugzilla/show_bug.cgi?id=46577

           Summary: POI engine logs errors about invalid uri files on
                    certain office 2007 documents
           Product: POI
           Version: 3.5-dev
          Platform: PC
        OS/Version: Windows XP
            Status: NEW
          Severity: critical
          Priority: P1
         Component: POI Overall
        AssignedTo: dev@poi.apache.org
        ReportedBy: sreeni@sendmail.com


Created an attachment (id=23152)
 --> (https://issues.apache.org/bugzilla/attachment.cgi?id=23152)
PPTX file to be extracted

Please use the attached office 2007 pptx, docx files, and try to extract the
text, you will see some output text like below


127.0.0.1:41812-m9L4iMsg015630] INFO  org.openxml4j.opc  - target contains \
therefore not a valid
URIfile:///C:\Ilias\Projects\MERIT\Beam%20Instrumentation%20and%20Optics\MERIT_OpticsSummary.xlsx
replaced by /


-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


DO NOT REPLY [Bug 46577] POI engine logs errors about invalid uri files on certain office 2007 documents

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=46577


Yegor Kozlov <ye...@dinom.ru> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |RESOLVED
         Resolution|                            |INVALID




--- Comment #2 from Yegor Kozlov <ye...@dinom.ru>  2009-04-20 10:47:59 PST ---
I don't see a bug here.

Firstly, both attached files contain absolute references to external resources,
for example, 2007_Calendar.docx contains this one:
  <Relationship Id="rId1"
Type="http://schemas.openxmlformats.org/officeDocument/2006/relationships/attachedTemplate"
Target="file:///C:\Documents%20and%20Settings\Keith%20C.%20Brown\Application%20Data\Microsoft\Templates\2007%20calendar.dotx"
TargetMode="External"/>

POI can only process embedded OPC resources.

Secondly, these absolute references are invalid as they contain back slashes
while only forward slashes are allowed. POI strictly follows the OPC spec and
issues a warning. This is expected behavior.

Yegor

-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


DO NOT REPLY [Bug 46577] POI engine logs errors about invalid uri files on certain office 2007 documents

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=46577





--- Comment #1 from sreeni <sr...@sendmail.com>  2009-01-21 11:47:24 PST ---
Created an attachment (id=23153)
 --> (https://issues.apache.org/bugzilla/attachment.cgi?id=23153)
docx file to be extracted


-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org