You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@openoffice.apache.org by bu...@apache.org on 2015/03/16 04:33:58 UTC

[Issue 126178] New: mixed language with different font export pdf result in question mark

https://bz.apache.org/ooo/show_bug.cgi?id=126178

          Issue ID: 126178
        Issue Type: DEFECT
           Summary: mixed language with different font export pdf result
                    in question mark
           Product: Writer
           Version: 4.2.0-dev
          Hardware: PC
                OS: Linux
            Status: UNCONFIRMED
          Severity: normal
          Priority: P5
         Component: save-export
          Assignee: issues@openoffice.apache.org
          Reporter: zb@bisp.com

Created attachment 84591
  --> https://bz.apache.org/ooo/attachment.cgi?id=84591&action=edit
example text and style, I can only submit one attachment, you can export to pdf
youself.

When a style contains different language, with different font(one Chinese font,
one westen font, if all use Chinese font, no problem) applied to a mixed
paragraph. export to pdf will result in a missing of font. The mixed para
mentioned above contains Chinese pinyin, which with code not in ascii. 

my openoffice is:
Version: 4.2.7.2
Build ID: 420m0(Build:2)

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126178] mixed language with different font export pdf result in question mark

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126178

--- Comment #2 from orcmid <or...@apache.org> ---
Created attachment 84593
  --> https://bz.apache.org/ooo/attachment.cgi?id=84593&action=edit
The PDF Exported Directly from the Test Document

After opening the document in AOO 4.1.1, I immediately exported the document to
PDF.  This image is of the PDF as opened in Adobe Reader XI 11.0.10 for
Windows.

Note that the Roman alphabet characters are now in different fonts, with the
Courier replaced with a fixed-pitch sans serif font.

>From these, it is not clear what the defect is, and whether the same defect
appears in AOO 4.1.1.

To demonstrate what happens with AOO 4.2.0-dev, it would be useful to provide
comparable screen captures so that we can discern exactly what is happening.

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126178] mixed language with different font export pdf result in question mark

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126178

--- Comment #7 from orcmid <or...@apache.org> ---
(In reply to zb from comment #6)
> Created attachment 84598 [details]
> my pdf export with apache openoffice 4.1.1
> 
>   As I use ubuntu, my default openoffice is libreoffice. Now I uninstall
> libreoffice, install apache openoffice 4.1.1. This is the pdf export with
> apache openoffice 4.1.1. question mark replaced with blank space.

Is Untitled2.odt different than Untitled1.odt in any respect?

You saw how Untitled1.odt and the PDF from it appears with Apache OpenOffice
4.1.1 on Windows 8.1.  I looked in the XML of Untitle1.odt and I see the text
in the file (in Unicode) that appears exactly as shown in my screen captures.
There is a font difference, but the characters are all presented correctly,
including the ā.

I will look further to see what had the PDF on Windows change the font (to
monospace sans serif) but have the characters be correct.

Someone else will have to check on what may be difference when AOO is compiled
for Linux.

But we now know the defect is not a regression in 4.2.0-dev, it is in 4.1.1.
Thank you for the effort it took to confirm that.

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126178] mixed language with different font export pdf result in question mark

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126178

--- Comment #12 from zb <zb...@bisp.com> ---
I don't know. You can check your pdf, if Courier is not embeded, and your
windows does not have Courier(but this seems impossible), I think adobe will
substitute Courier with other available font.

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126178] mixed language with different font export pdf result in question mark

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126178

--- Comment #6 from zb <zb...@bisp.com> ---
Created attachment 84598
  --> https://bz.apache.org/ooo/attachment.cgi?id=84598&action=edit
my pdf export with apache openoffice 4.1.1

  As I use ubuntu, my default openoffice is libreoffice. Now I uninstall
libreoffice, install apache openoffice 4.1.1. This is the pdf export with
apache openoffice 4.1.1. question mark replaced with blank space.

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126178] mixed language with different font export pdf result in question mark

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126178

orcmid <or...@apache.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
     Ever confirmed|0                           |1
             Status|UNCONFIRMED                 |CONFIRMED

--- Comment #11 from orcmid <or...@apache.org> ---
(In reply to zb from comment #9)
>   ā in duān is u+0101, this font is available in Courier font. 
>   In libreoffice export, Courier font is embeded in the pdf, but cmap for ā
> is not defined.
>   In AOO 4.1.1 export, Courier font is not embeded, but just a reference,
> encoding for Courier font is WinAnsiEncoding, this encoding does not include
> ā(u+0101).
> [ ... ] So I think encoding for
> English language should extended to unicode when export pdf. BTW duān
> display good in odt file, and rendered in Courier font.

Agreed.  It does not make sense to restrict to WinAnsi in the export,
especially because the ODT has no such restriction and the Unicode for ā is
present and understood in the paragraph-content stream.

It is intriguing that on Windows, there is a font substitution in the line that
is Courier in the ODT, using a Sans Serif font that covers Unicode.  Do you
think it is the Adobe Reader that is accomplishing this on Windows?

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126178] mixed language with different font export pdf result in question mark

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126178

--- Comment #9 from zb <zb...@bisp.com> ---
  ā in duān is u+0101, this font is available in Courier font. 
  In libreoffice export, Courier font is embeded in the pdf, but cmap for ā is
not defined.
  In AOO 4.1.1 export, Courier font is not embeded, but just a reference,
encoding for Courier font is WinAnsiEncoding, this encoding does not include
ā(u+0101).
  The problem is whether you define ā as English language or as Asian
language(Chinese) in the style definition.  If ā defined as English language,
the encoding for Courier should not limited to ascii, if ā defined as Asian
language, ā should be rendered with Chinese font instead of Courier font(this
is why the second line is good). I prefer the first, because symbol like ā used
in Chinese pinyin are available in Courier font, and more beautiful when they
are rendered in same font. So I think encoding for English language should
extended to unicode when export pdf. BTW duān display good in odt file, and
rendered in Courier font.

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126178] mixed language with different font export pdf result in question mark

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126178

orcmid <or...@apache.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |orcmid@apache.org

--- Comment #1 from orcmid <or...@apache.org> ---
Created attachment 84592
  --> https://bz.apache.org/ooo/attachment.cgi?id=84592&action=edit
How the ODT Looks in 4.1.1 on Windows 8.1

This image is a screen capture of how the sample document opens in Apache
OpenOffice 4.1.1 directly.  I named the file differently and did nothing else
with it.

Note that the "This is Wrong" line appears in Courier.

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126178] mixed language with different font export pdf result in question mark

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126178

--- Comment #8 from zb <zb...@bisp.com> ---
No Untitled2.odt at all, Untitled2.pdf is exported from Untitled1.odt. just
renamed.

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126178] mixed language with different font export pdf result in question mark

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126178

--- Comment #10 from orcmid <or...@apache.org> ---
(In reply to zb from comment #8)
> No Untitled2.odt at all, Untitled2.pdf is exported from Untitled1.odt. just
> renamed.

Good.  So we have a stable view of things.

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126178] mixed language with different font export pdf result in question mark

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126178

--- Comment #4 from zb <zb...@bisp.com> ---
Created attachment 84594
  --> https://bz.apache.org/ooo/attachment.cgi?id=84594&action=edit
This is my export.

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126178] mixed language with different font export pdf result in question mark

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126178

--- Comment #5 from orcmid <or...@apache.org> ---
(In reply to zb from comment #4)
> Created attachment 84594 [details]
> This is my export.

There appear to be multiple defects.

 1. On the Linux system used, there are visible differences in the capabilities
of fonts, such that certain characters do not render correctly.  The "a" with
over-bar in the first line is an example of that.  These could be problems with
the Acrobat Reader or with the way font dependencies are handled in the Export.
It could be that a font substitution failed.

 2. There appear to be style conflicts that have the first line render
differently in different exports, and this might have to do with font
substitution also.

 3. The defect is still not identified clearly, so its reproduction is
reliable.

REQUEST

Please use Apache OpenOffice *4.1.1* to do a PDF export from the same document.
 Provide a *screen capture* of that result (not the PDF).  We need to eliminate
differences in how the PDF may render on different machines and we need to see
exactly what you see on your system.

By using AOO 4.1.1 we will be able to determine whether this is a regression or
it is something else having to do with style-handling that applies to your
multi-lingual document.

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126178] mixed language with different font export pdf result in question mark

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126178

--- Comment #3 from orcmid <or...@apache.org> ---
(In reply to orcmid from comment #2)
> Created attachment 84593 [details]

> From these, it is not clear what the defect is, and whether the same defect
> appears in AOO 4.1.1.
To clarify.  Is the defect also present in AOO 4.1.1, or is what I provide
considered correct behavior and there is a regression in AOO 4.2.0-dev (the
build you are checking)?

-- 
You are receiving this mail because:
You are the assignee for the issue.