You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@pdfbox.apache.org by Georg Datterl <ge...@geneon.de> on 2009/07/27 17:23:37 UTC

PDF without a page root?

Hello everybody,

P0900842.pdf obvously has two pages, but PDDocument.load(pdfFile).getDocumentCatalog().getAllPages().size(); returns only one page. The two pages should be Objects 801 and 981. In PDFParser: 451 I can see Object 981 is parsed, but I can't see where it is connected to the parent element. According to Jeremias who had a quick look over it the parent element should be 797 which is not present in the PDF. Root->Pages->Kids contains only Object 801. 

Could somebody on the list please have a look at the PDF and tell me, why it does not work as expected?

Regards,
 
Georg Datterl
 
------ Kontakt ------
 
Georg Datterl
 
Geneon media solutions gmbh
Gutenstetter Straße 8a
90449 Nürnberg
 
HRB Nürnberg: 17193
Geschäftsführer: Yong-Harry Steiert 

Tel.: 0911/36 78 88 - 26
Fax: 0911/36 78 88 - 20
 
www.geneon.de
 
Weitere Mitglieder der Willmy MediaGroup:
 
IRS Integrated Realization Services GmbH:    www.irs-nbg.de 
Willmy PrintMedia GmbH:                            www.willmy.de
Willmy Consult & Content GmbH:                 www.willmycc.de 

Re: PDF without a page root?

Posted by Iain Clapham <ia...@googlemail.com>.
Georg,

I ran it thru my slightly modified PDFTextStripper and it shows up as 
two pages :-

1.1_98.56           SQS herewith certifies that the company named below 
has a management system which meets the 
1.2_111.56          requirements of the normative bases specified below.
1.3_157.198         Georg Fischer Piping Systems Ltd.
1.4_179.198         CH-8201 Schaffhausen
1.5_216.198         Certified area
1.6_243.198         Georg Fischer Piping Systems Ltd., Schaffhausen
1.7_259.198         Georg Fischer Rohrleitungssysteme (Schweiz) AG,
1.8_276.198         Sales Company
1.9_293.198         Georg Fischer Wavin Ltd., Schaffhausen and
1.10_310.198         Subingen
1.11_327.198        Georg Fischer Building Technology Ltd., Schaffhausen
1.12_343.198        Georg Fischer Fluorpolymer Products GmbH,
1.13_360.198         E t t e n h e i m
1.14_401.198        Field of activity
1.15_428.198        The Piping Systems Group develops,  manufactures
1.16_445.198        and distributes plastic or metal components and
1.17_461.198        systems for  conducting, pumping, controlling,
1.18_478.198         measuring and  regulating liquids and gases and  to
1.19_495.198        this end provides comprehensive engineering and
1.20_512.198        related services
1.21_553.198        Normative bases
1.22_584.56         ISO 9001:2008 Quality Management System
1.23_611.56         ISO 14001:2004 Environmental Management System
1.24_776.69         SWISS
1.25_788.67         TIFI C A
1.26_789.80         T
1.27_757.198        X. Edelmann, President SQS T. Zahner, Managing 
Director SQS
1.28_66.194         Certificate
1.29_653.198        Swiss Association for Quality and 
1.30_665.198        Management Systems SQS
1.31_677.198        Bernstrasse 103, CH-3052 Zollikofen
1.32_689.198        Issue date: May 28, 2009
1.33_653.391        This SQS Certificate is valid up to 
1.34_665.391        and including May 27, 2012
1.35_677.391        Scope number 14
1.36_689.397        Registration number 10684
2.1_695.187         René Wasmer
2.2_710.175         President of IQNet
2.3_695.340         Theodor Zahner
2.4_710.321         Managing Director SQS
2.5_217.144         CERTIFICATE
2.6_234.249         IQNet and SQS
2.7_249.213         hereby certify that the organisation
2.8_315.264         Certified area
2.9_419.261         Field of activity
2.10_510.215        has implemented and maintains a
2.11_529.222        Management System
2.12_544.164        which fulfills the requirements of the following 
standard
2.13_573.149        ISO 9001:2008 / ISO 14001:2004
2.14_593.264        Scope No: 14
2.15_606.241        Issued on: 2009-05-28
2.16_619.237        Validity date: 2012-05-27
2.17_632.218        Registration Number: CH-10684
2.18_330.131        Georg Fischer Piping Systems Ltd., Schaffhausen
2.19_346.75         Georg Fischer Rohrleitungssysteme (Schweiz) AG, 
Sales Company
2.20_363.115        Georg Fischer Wavin Ltd., Schaffhausen and Subingen
2.21_380.115        Georg Fischer Building Technology Ltd., Schaffhausen
2.22_397.110        Georg Fischer Fluorpolymer Products GmbH, Ettenheim
2.23_277.132        Georg Fischer Piping Systems Ltd.
2.24_296.210        CH-8201 Schaffhausen
2.25_436.58         he Piping Systems Group develops, manufactures and 
distributes  plastic
2.26_453.56         or metal components and systems for conducting, 
pumping,  controlling,
2.27_469.68         measuring and regulating liquids and gases and to 
this end provides
2.28_486.135         comprehensive engineering and related services
2.29_733.269        IQNet Partners*:
2.30_746.63         AENOR Spain  AFNOR Certification France  
AIB-Vinçotte International Belgium  ANCE Mexico  APCER Portugal  CISQ 
Italy  CQC China  
2.31_756.86         CQM China  CQS Czech Republic  Cro Cert Croatia  DQS 
Holding GmbH Germany  DS Denmark  ELOT Greece  FCAV Brazil  
2.32_765.59         FONDONORMA Venezuela  HKQAA Hong Kong China  ICONTEC 
Colombia  IMNC Mexico  Inspecta Certification Finland  IRAM Argentina  
2.33_775.68         JQA Japan  KFQ Korea  MSZT Hungary  Nemko AS Norway  
NSAI Ireland  PCBC Poland  Quality Austria Austria  RR Russia  SII Israel  
2.34_784.64         SIQ Slovenia  SIRIM QAS International Malaysia  SQS 
Switzerland  SRAC Romania  TEST St Petersburg Russia  TSE Turkey  YUQS 
Serbia
2.35_797.134        IQNet is represented in the USA by: AFNOR 
Certification, CISQ, DQS Holding GmbH and NSAI Inc.

So you know the the PDFStreamEngine is working fine.

Cheers --- Iain


Georg Datterl wrote:
> Hello everybody,
>
> P0900842.pdf obvously has two pages, but PDDocument.load(pdfFile).getDocumentCatalog().getAllPages().size(); returns only one page. The two pages should be Objects 801 and 981. In PDFParser: 451 I can see Object 981 is parsed, but I can't see where it is connected to the parent element. According to Jeremias who had a quick look over it the parent element should be 797 which is not present in the PDF. Root->Pages->Kids contains only Object 801. 
>
> Could somebody on the list please have a look at the PDF and tell me, why it does not work as expected?
>
> Regards,
>  
> Georg Datterl
>  
> ------ Kontakt ------
>  
> Georg Datterl
>  
> Geneon media solutions gmbh
> Gutenstetter Straße 8a
> 90449 Nürnberg
>  
> HRB Nürnberg: 17193
> Geschäftsführer: Yong-Harry Steiert 
>
> Tel.: 0911/36 78 88 - 26
> Fax: 0911/36 78 88 - 20
>  
> www.geneon.de
>  
> Weitere Mitglieder der Willmy MediaGroup:
>  
> IRS Integrated Realization Services GmbH:    www.irs-nbg.de 
> Willmy PrintMedia GmbH:                            www.willmy.de
> Willmy Consult & Content GmbH:                 www.willmycc.de 
>   


AW: PDF without a page root?

Posted by Georg Datterl <ge...@geneon.de>.
Hi again, 

I'm sorry, I forgot attachments are not allowed. Here's a link to the pdf file: http://www.geneon.de/test/P0900842.pdf

Regards,
 
Georg Datterl
 
------ Kontakt ------
 
Georg Datterl
 
Geneon media solutions gmbh
Gutenstetter Straße 8a
90449 Nürnberg
 
HRB Nürnberg: 17193
Geschäftsführer: Yong-Harry Steiert 

Tel.: 0911/36 78 88 - 26
Fax: 0911/36 78 88 - 20
 
www.geneon.de
 
Weitere Mitglieder der Willmy MediaGroup:
 
IRS Integrated Realization Services GmbH:    www.irs-nbg.de 
Willmy PrintMedia GmbH:                            www.willmy.de
Willmy Consult & Content GmbH:                 www.willmycc.de 
-----Ursprüngliche Nachricht-----
Von: Georg Datterl [mailto:georg.datterl@geneon.de] 
Gesendet: Montag, 27. Juli 2009 17:24
An: pdfbox-users@incubator.apache.org
Betreff: PDF without a page root?

Hello everybody,

P0900842.pdf obvously has two pages, but PDDocument.load(pdfFile).getDocumentCatalog().getAllPages().size(); returns only one page. The two pages should be Objects 801 and 981. In PDFParser: 451 I can see Object 981 is parsed, but I can't see where it is connected to the parent element. According to Jeremias who had a quick look over it the parent element should be 797 which is not present in the PDF. Root->Pages->Kids contains only Object 801. 

Could somebody on the list please have a look at the PDF and tell me, why it does not work as expected?

Regards,
 
Georg Datterl
 
------ Kontakt ------
 
Georg Datterl
 
Geneon media solutions gmbh
Gutenstetter Straße 8a
90449 Nürnberg
 
HRB Nürnberg: 17193
Geschäftsführer: Yong-Harry Steiert 

Tel.: 0911/36 78 88 - 26
Fax: 0911/36 78 88 - 20
 
www.geneon.de
 
Weitere Mitglieder der Willmy MediaGroup:
 
IRS Integrated Realization Services GmbH:    www.irs-nbg.de 
Willmy PrintMedia GmbH:                            www.willmy.de
Willmy Consult & Content GmbH:                 www.willmycc.de