You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@openoffice.apache.org by bu...@apache.org on 2016/03/23 17:55:22 UTC

[Issue 126883] New: PDF export: Invalid characters depending on fonts

https://bz.apache.org/ooo/show_bug.cgi?id=126883

          Issue ID: 126883
        Issue Type: DEFECT
           Summary: PDF export: Invalid characters depending on fonts
           Product: General
           Version: 4.1.2
          Hardware: PC
                OS: Linux 64-bit
            Status: UNCONFIRMED
          Severity: Normal
          Priority: P5 (lowest)
         Component: ui
          Assignee: issues@openoffice.apache.org
          Reporter: david.vogt@adfinis-sygroup.ch

Created attachment 85364
  --> https://bz.apache.org/ooo/attachment.cgi?id=85364&action=edit
Document for reproducing the issue

There is an issue with 4.1.2 on Linux, where apostrophes (') are rendered as
copyright signs (©) in the generated PDF output.

It seems to be related to font embedding. If it is activated, the problem
happens, if not, the PDF looks clean.

This only seems to happen with some fonts, but not others. I've successfully
reproduced this with the official AOO build on CentOS 6.7 with the "Helvetica"
font. Other fonts did not exhibit the problem however.

I'm attaching a test document as well as the generated PDF for reference.

This *could* be related to #125012, but I'm not 100% sure, so I'm creating a
new issue instead. Feel free to merge if it is indeed the same.

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126883] PDF export: Invalid characters depending on fonts

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126883

--- Comment #11 from david.vogt@adfinis-sygroup.ch ---
Hi all

I've just reproduced it on a plain CentOS 6.7 machine and RPM packages from
upstream to verify that no other influence causes the problem.

Reproduced as follows:

Preparations
============

1) Ensure Helvetica font is NOT installed
2) Install 4.1.1 packages (Source: [1])
3) Create test document with several test characters (' " ` ´), and set the
font deliberately to Helvetica (despite it not being installed)


Test case 1
===========

1) Export as PDF with default (no font embedding, no pdf/1a)
2) Export as PDF with font embedding
3) Export with pdf/1a


Test case 2
===========

1) Uninstall all 4.1.1 packages (yum remove openoffice-*)
2) Install 4.1.2 packages (Source: [2])
3) Repeat steps from test case 1 above


Results
=======

4.1.1 - defaults      - all look good
4.1.1 - pdf1a         - all look good
4.1.1 - font embedded - all look good
4.1.2 - defaults      - all look good
4.1.2 - pdf1a         - broken chars
4.1.2 - font embedded - broken chars


Please let me know if you need anything more to move this to "CONFIRMED" stage.



[1]:
http://downloads.sourceforge.net/project/openofficeorg.mirror/4.1.1/binaries/en-US/Apache_OpenOffice_4.1.1_Linux_x86-64_install-rpm_en-US.tar.gz
[2]:
http://downloads.sourceforge.net/project/openofficeorg.mirror/4.1.2/binaries/en-US/Apache_OpenOffice_4.1.2_Linux_x86-64_install-rpm_en-US.tar.gz

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126883] PDF export: Invalid characters depending on fonts

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126883

mroe <mr...@gmx.net> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|CONFIRMED                   |UNCONFIRMED
           Keywords|regression                  |
     Ever confirmed|1                           |0

--- Comment #6 from mroe <mr...@gmx.net> ---
Hmm. It's strange.
I have installed AOO 4.1.1 and 4.1.2 both with the same printer settings.
In 4.1.1 it shows me Helvetica, Times and some other fonts as installed printer
fonts. But in 4.1.2 it does not. :-( (I verified it only in 4.1.1 ... Sorry.)
I must search for the reason. So I erase the regression.

But I see the problem: Why is there no warning for not installed fonts?
It is a long time wish of mine, that the font listbox shows me if a font is
installed or not (maybe with an other colour or a coloured background).

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126883] PDF export: Invalid characters depending on fonts

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126883

oooforum <oo...@free.fr> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |oooforum@free.fr

--- Comment #3 from oooforum <oo...@free.fr> ---
I was not able to reproduce this problem with 4.1.2 and Win7.
Are you sure that Helvetica font is installed on your CentOS?

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126883] PDF export: Invalid characters depending on fonts

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126883

--- Comment #16 from oooforum (fr) <oo...@free.fr> ---
(In reply to Kay from comment #15)
> Thanks you all for your work on this issue. I will investigate what r1705192
> was attempting to fix and see what can be done.
@Kay: did you have found a fix?

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126883] PDF export: Invalid characters depending on fonts

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126883

Oliver Sauder <os...@esite.ch> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |os@esite.ch

--- Comment #14 from Oliver Sauder <os...@esite.ch> ---
I have tested this when unapplying change of
http://svn.apache.org/viewvc?view=revision&revision=1705192 this issue
disappears as well.

Change of revision 1705192 seems to have caused this regression.

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126883] PDF export: Invalid characters depending on fonts

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126883

orcmid <or...@apache.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |orcmid@apache.org

--- Comment #10 from orcmid <or...@apache.org> ---
(In reply to Kay from comment #9)
> (In reply to mroe from comment #8)
> > (In reply to Kay from comment #7)
> > > (In reply to mroe from comment #6)
> This MAY be possible but we would need to ascertain difficulty. Many
> applications work in much this same way -- attempting to find the "closest"
> match for a non-available font,.  
> 
> In any case, there is an outdated, though still useful wiki page that delves
> into this very issue. The page could use some updates. I think this
> situation might apply to more platforms and not just *nix.
> 
> https://wiki.openoffice.org/wiki/Font-
> FAQ#What_is_Font_Fallback_in_OpenOffice.org_2.3F

It's also a little weird in this case because PDF's are expected to have a
soft-substitution for classes of fonts too.  So there may be more going on in
how PDFs are being produced as well, or there is a settings error in the export
options.

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126883] PDF export: Invalid characters depending on fonts

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126883

Matthias Seidel <ms...@apache.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
           See Also|                            |https://bz.apache.org/ooo/s
                   |                            |how_bug.cgi?id=127849

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126883] PDF export: Invalid characters depending on fonts

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126883

--- Comment #2 from david.vogt@adfinis-sygroup.ch ---
Created attachment 85366
  --> https://bz.apache.org/ooo/attachment.cgi?id=85366&action=edit
Working PDF (font embedding disabled)

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126883] PDF export: Invalid characters depending on fonts

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126883

--- Comment #1 from david.vogt@adfinis-sygroup.ch ---
Created attachment 85365
  --> https://bz.apache.org/ooo/attachment.cgi?id=85365&action=edit
Broken PDF (with font embedding enabled)

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126883] PDF export: Invalid characters depending on fonts

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126883

--- Comment #19 from Matthias Seidel <ms...@apache.org> ---
(In reply to c.kruk from comment #18)
> Sorry. I posted the above in the wrong thread.

No problem, both issues are related.

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126883] PDF export: Invalid characters depending on fonts

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126883

Matthias Seidel <ms...@apache.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |mseidel@apache.org

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126883] PDF export: Invalid characters depending on fonts

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126883

--- Comment #12 from david.vogt@adfinis-sygroup.ch ---
Created attachment 85565
  --> https://bz.apache.org/ooo/attachment.cgi?id=85565&action=edit
Test document and PDF output coming from reproduction of the steps in Comment
11

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126883] PDF export: Invalid characters depending on fonts

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126883

oooforum (fr) <oo...@free.fr> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |myosotis.arae@gmail.com

--- Comment #20 from oooforum (fr) <oo...@free.fr> ---
*** Issue 127866 has been marked as a duplicate of this issue. ***

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126883] PDF export: Invalid characters depending on fonts

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126883

dyo <ol...@bger.ch> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |olivier.duffey@bger.ch

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126883] PDF export: Invalid characters depending on fonts

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126883

--- Comment #13 from david.vogt@adfinis-sygroup.ch ---
I searched the changes between 4.1.1 and 4.1.2 and found this one that could be
the culprit:

   http://svn.apache.org/viewvc?view=revision&revision=1705192

I'll do some tests to verify this.

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126883] PDF export: Invalid characters depending on fonts

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126883

orcmid <or...@apache.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|orcmid@apache.org           |

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126883] PDF export: Invalid characters depending on fonts

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126883

Kay <ks...@apache.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |kschenk@apache.org

--- Comment #7 from Kay <ks...@apache.org> ---
(In reply to mroe from comment #6)
> Hmm. It's strange.
> I have installed AOO 4.1.1 and 4.1.2 both with the same printer settings.
> In 4.1.1 it shows me Helvetica, Times and some other fonts as installed
> printer fonts. But in 4.1.2 it does not. :-( (I verified it only in 4.1.1
> ... Sorry.)
> I must search for the reason. So I erase the regression.
> 
> But I see the problem: Why is there no warning for not installed fonts?
> It is a long time wish of mine, that the font listbox shows me if a font is
> installed or not (maybe with an other colour or a coloured background).

Are you saying you want AOO to give a warning that a document which is
specifying a font which is NOT installed on the user's system should tell the
user that the font used in the document is not available?

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126883] PDF export: Invalid characters depending on fonts

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126883

--- Comment #9 from Kay <ks...@apache.org> ---
(In reply to mroe from comment #8)
> (In reply to Kay from comment #7)
> > (In reply to mroe from comment #6)
> > > But I see the problem: Why is there no warning for not installed fonts?
> > > It is a long time wish of mine, that the font listbox shows me if a font is
> > > installed or not (maybe with an other colour or a coloured background).
> > 
> > Are you saying you want AOO to give a warning that a document which is
> > specifying a font which is NOT installed on the user's system should tell
> > the user that the font used in the document is not available?
> 
> Yes. The user expect the same visual result if one export the document as
> PDF whether a font is embedded or not.
> 
> So there are 2 choices: AOO embeds the used available font with a warning or
> it embeds nothing (and exports nothing) with the warning that the user
> should disable embedding or reformat the document with available fonts.
> 
> But for the last point the user needs a visual hint that a used font is not
> installed. (I have searched whether there exists an issue for that. But it
> seems that I have sent this wish long time ago to StarDivision/Sun.)

This MAY be possible but we would need to ascertain difficulty. Many
applications work in much this same way -- attempting to find the "closest"
match for a non-available font,.  

In any case, there is an outdated, though still useful wiki page that delves
into this very issue. The page could use some updates. I think this situation
might apply to more platforms and not just *nix.

https://wiki.openoffice.org/wiki/Font-FAQ#What_is_Font_Fallback_in_OpenOffice.org_2.3F

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126883] PDF export: Invalid characters depending on fonts

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126883

mroe <mr...@gmx.net> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|UNCONFIRMED                 |CONFIRMED
     Ever confirmed|0                           |1
           Keywords|                            |regression

--- Comment #4 from mroe <mr...@gmx.net> ---
Confirmed with AOO 4.1.2 at Ubuntu 64bit.
The issue does not occur with AOO 4.1.1, so it is a regression.

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126883] PDF export: Invalid characters depending on fonts

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126883

--- Comment #5 from oooforum (fr) <oo...@free.fr> ---
(In reply to mroe from comment #4)
> Confirmed with AOO 4.1.2 at Ubuntu 64bit.
Helvetica font is not free and not exit under Linux so, could you explain how
you have make?

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126883] PDF export: Invalid characters depending on fonts

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126883

--- Comment #8 from mroe <mr...@gmx.net> ---
(In reply to Kay from comment #7)
> (In reply to mroe from comment #6)
> > But I see the problem: Why is there no warning for not installed fonts?
> > It is a long time wish of mine, that the font listbox shows me if a font is
> > installed or not (maybe with an other colour or a coloured background).
> 
> Are you saying you want AOO to give a warning that a document which is
> specifying a font which is NOT installed on the user's system should tell
> the user that the font used in the document is not available?

Yes. The user expect the same visual result if one export the document as PDF
whether a font is embedded or not.

So there are 2 choices: AOO embeds the used available font with a warning or it
embeds nothing (and exports nothing) with the warning that the user should
disable embedding or reformat the document with available fonts.

But for the last point the user needs a visual hint that a used font is not
installed. (I have searched whether there exists an issue for that. But it
seems that I have sent this wish long time ago to StarDivision/Sun.)

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126883] PDF export: Invalid characters depending on fonts

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126883

Kay <ks...@apache.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|UNCONFIRMED                 |CONFIRMED
     Ever confirmed|0                           |1

--- Comment #15 from Kay <ks...@apache.org> ---
Thanks you all for your work on this issue. I will investigate what r1705192
was attempting to fix and see what can be done.

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126883] PDF export: Invalid characters depending on fonts

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126883

c.kruk <ce...@poczta.wp.pl> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |cezary.kruk@poczta.wp.pl

--- Comment #17 from c.kruk <ce...@poczta.wp.pl> ---
I checked valid and invalid PDFs using pdffonts program. They differ in
encoding. Valid ones use Builtin encoding while invalid ones use WinAnsi one.
It seems I found the culprit.

$ pdffonts URW_Palladio_L-OpenOffice_4.1.1.pdf 
name                                 type              encoding         emb sub
uni object ID
------------------------------------ ----------------- ---------------- --- ---
--- ---------
URWPalladioL-Bold                    Type 1            Builtin          yes no 
yes     19  0
URWPalladioL-Roma                    Type 1            Builtin          yes no 
yes     24  0
URWPalladioL-Ital                    Type 1            Builtin          yes no 
yes      9  0
URWPalladioL-BoldItal                Type 1            Builtin          yes no 
yes     14  0

$ pdffonts URW_Palladio_L-OpenOffice_4.1.2.pdf 
name                                 type              encoding         emb sub
uni object ID
------------------------------------ ----------------- ---------------- --- ---
--- ---------
URWPalladioL-Bold                    Type 1            WinAnsi          yes no 
yes     19  0
URWPalladioL-Roma                    Type 1            WinAnsi          yes no 
yes     24  0
URWPalladioL-Ital                    Type 1            WinAnsi          yes no 
yes      9  0
URWPalladioL-BoldItal                Type 1            WinAnsi          yes no 
yes     14  0

$ pdffonts URW_Palladio_L-OpenOffice_4.1.5.pdf 
name                                 type              encoding         emb sub
uni object ID
------------------------------------ ----------------- ---------------- --- ---
--- ---------
URWPalladioL-Bold                    Type 1            WinAnsi          yes no 
yes     19  0
URWPalladioL-Roma                    Type 1            WinAnsi          yes no 
yes     24  0
URWPalladioL-Ital                    Type 1            WinAnsi          yes no 
yes      9  0
URWPalladioL-BoldItal                Type 1            WinAnsi          yes no 
yes     14  0

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 126883] PDF export: Invalid characters depending on fonts

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=126883

--- Comment #18 from c.kruk <ce...@poczta.wp.pl> ---
Sorry. I posted the above in the wrong thread.

-- 
You are receiving this mail because:
You are the assignee for the issue.