You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@pdfbox.apache.org by ju...@apache.org on 2008/03/08 15:24:46 UTC

svn commit: r634972 [3/5] - in /incubator/pdfbox/trunk/migration: ./ fontbox/ jempbox/ pdfbox/

Added: incubator/pdfbox/trunk/migration/pdfbox/features.xml
URL: http://svn.apache.org/viewvc/incubator/pdfbox/trunk/migration/pdfbox/features.xml?rev=634972&view=auto
==============================================================================
--- incubator/pdfbox/trunk/migration/pdfbox/features.xml (added)
+++ incubator/pdfbox/trunk/migration/pdfbox/features.xml Sat Mar  8 06:24:45 2008
@@ -0,0 +1,7017 @@
+<tracker version="1.0" xmlns:xsi="http://www.w3.org/2000/10/XMLSchema-instance" xsi:noNamespaceSchemaLocation="http://sourceforge.net/export/sf_tracker_export.xsd">
+	<artifact id="1878543">
+		<submitted_by>nobody</submitted_by>
+		<submitted_date>2008-01-23 14:38</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="100">None</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Nobody/Anonymous</assigned_to>
+		<priority id="5">5</priority>
+		<status>Open</status>
+		<resolution>None</resolution>
+		<summary>Insert jbig2 image into PDF</summary>
+		<detail>Please add support for embedding a jbig2 image into a PDF file.</detail>
+	</artifact>
+	<artifact id="1748641">
+		<submitted_by>robertsearle</submitted_by>
+		<submitted_date>2007-07-05 10:13</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="565624">PDModel</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="5">5</priority>
+		<status>Open</status>
+		<resolution>None</resolution>
+		<summary>Inserting TIF / BMP / GIF into PDF</summary>
+		<detail>I would like three new classes---PDtif, PDbmp, and PDgif.  These classes should work like PDJpeg.  I really hope the constructor is the same (pdfDoc, inputStream).</detail>
+	</artifact>
+	<artifact id="1719860">
+		<submitted_by>ulifu</submitted_by>
+		<submitted_date>2007-05-16 01:43</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="565624">PDModel</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="5">5</priority>
+		<status>Open</status>
+		<resolution>None</resolution>
+		<summary>add digital signature support</summary>
+		<detail>please add support to include a digital signature in a PDF file</detail>
+	</artifact>
+	<artifact id="1710857">
+		<submitted_by>cvonsee</submitted_by>
+		<submitted_date>2007-05-01 15:40</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="519376">text extraction</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="5">5</priority>
+		<status>Open</status>
+		<resolution>None</resolution>
+		<summary>Identify text rotation angle in TextPosition</summary>
+		<detail>Applications that use the TextPosition objects generated by PDFStreamEngine sometimes have to be aware of the text rotation angle so that they can handle text that is not horizontal.  For example, we process many PDFs that have vertically-oriented copyright info that appears either next to an image or at the right/left size of the page; this content doesn't extract correctly because it's vertical rather than horizontal.  Applications that attempt to join TextPosition objects have difficulty because they cannot currently distinguish between vertical and horizontal text.
+
+Thanks
+Chris von See
+TechAdapt, Inc.</detail>
+	</artifact>
+	<artifact id="1708294">
+		<submitted_by>rrufai</submitted_by>
+		<submitted_date>2007-04-26 11:13</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="519376">text extraction</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="5">5</priority>
+		<status>Open</status>
+		<resolution>None</resolution>
+		<summary>Updated PDFText2HTML</summary>
+		<detail>Hi Ben,
+
+I was wondering, are you accepting member to the project? 
+
+I'm using the PDFBox for importing PDF documents and would need more formatting information that is currently supported by PDFBox. The attached is what I've done so far: handles line breaks, bold, italics. Also added some comment delimiters for page boundaries.
+
+Two things I'll want to handle next are: 
+
+1. Underline
+2. Subscripts and superscripts
+
+Later on, I'll want to also handle the following:
+
+1. Images
+2. Hyperlinks
+3. Tables (I know this might be hard)
+
+I'll need all the help I can get in the form of pointers and clues.
+
+I look forward to reading from you soon.
+
+Many, many thanks for providing us with a great library.
+
+Regards,
+
+Raimi Rufai</detail>
+		<follow_ups>
+			<item>
+				<date>1177711497</date>
+				<sender>rrufai</sender>
+				<text>Logged In: YES 
+user_id=1776491
+Originator: YES
+
+File Added: TextPosition.java</text>
+			</item>
+			<item>
+				<date>1177711469</date>
+				<sender>rrufai</sender>
+				<text>Logged In: YES 
+user_id=1776491
+Originator: YES
+
+File Added: PDFStreamEngine.java</text>
+			</item>
+			<item>
+				<date>1177710704</date>
+				<sender>rrufai</sender>
+				<text>Logged In: YES 
+user_id=1776491
+Originator: YES
+
+File Added: PDFText2HTML.java</text>
+			</item>
+		</follow_ups>
+		<existingfiles>
+			<file>
+				<id>226946</id>
+				<name>TextPosition.java</name>
+				<description></description>
+				<filesize>6743</filesize>
+				<filetype>text/x-java-source</filetype>
+				<adddate>1177711497</adddate>
+				<submitted_by>1776491</submitted_by>
+			</file>
+			<file>
+				<id>226945</id>
+				<name>PDFStreamEngine.java</name>
+				<description></description>
+				<filesize>19797</filesize>
+				<filetype>text/x-java-source</filetype>
+				<adddate>1177711469</adddate>
+				<submitted_by>1776491</submitted_by>
+			</file>
+			<file>
+				<id>226943</id>
+				<name>PDFText2HTML.java</name>
+				<description></description>
+				<filesize>10342</filesize>
+				<filetype>text/x-java-source</filetype>
+				<adddate>1177710704</adddate>
+				<submitted_by>1776491</submitted_by>
+			</file>
+		</existingfiles>
+		<change_log>
+			<item>
+				<field>File Added</field>
+				<old_value>226946: TextPosition.java</old_value>
+				<date>1177711497</date>
+				<by>rrufai</by>
+			</item>
+			<item>
+				<field>File Added</field>
+				<old_value>226945: PDFStreamEngine.java</old_value>
+				<date>1177711469</date>
+				<by>rrufai</by>
+			</item>
+			<item>
+				<field>File Deleted</field>
+				<old_value>226760: </old_value>
+				<date>1177710704</date>
+				<by>rrufai</by>
+			</item>
+			<item>
+				<field>File Added</field>
+				<old_value>226943: PDFText2HTML.java</old_value>
+				<date>1177710704</date>
+				<by>rrufai</by>
+			</item>
+			<item>
+				<field>is_private</field>
+				<old_value>1</old_value>
+				<date>1177611319</date>
+				<by>rrufai</by>
+			</item>
+			<item>
+				<field>File Added</field>
+				<old_value>226760: PDFText2HTML.java</old_value>
+				<date>1177611228</date>
+				<by>rrufai</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="1703715">
+		<submitted_by>nobody</submitted_by>
+		<submitted_date>2007-04-19 08:14</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="519376">text extraction</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="5">5</priority>
+		<status>Open</status>
+		<resolution>None</resolution>
+		<summary>Import a part of a pdf page</summary>
+		<detail>It will be great to have the possibility to import a part of a pdf page defined by a rectangle.
+Everithing contained in this rectangle (text, immage...) can be copied elsewhere (same or other pdf).
+I found a such possibility in the commercial component pdfLib</detail>
+	</artifact>
+	<artifact id="1695597">
+		<submitted_by>matpil</submitted_by>
+		<submitted_date>2007-04-06 06:37</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="519376">text extraction</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="5">5</priority>
+		<status>Open</status>
+		<resolution>None</resolution>
+		<summary>colorspace as an array entry</summary>
+		<detail>there is a problem when PDFStripper try to extract text from a pdf file.
+
+stacktrace:
+
+java.io.IOException: Unknown colorspace array type:COSName{DeviceRGB} 
+at org.pdfbox.pdmodel.graphics.color.PDColorSpaceFactory.createColorSpace(PDColorSpaceFactory.java:116) 
+at org.pdfbox.pdmodel.PDResources.getColorSpaces(PDResources.java:264) 
+at org.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:193) 
+at org.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:174) 
+at org.pdfbox.util.PDFTextStripper.processPage(PDFTextStripper.java:336) 
+at org.pdfbox.util.PDFTextStripper.processPages(PDFTextStripper.java:259) 
+at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:216)
+
+
+in attachment the pdf file that generate the error</detail>
+		<existingfiles>
+			<file>
+				<id>223969</id>
+				<name>errorColorspace.pdf</name>
+				<description></description>
+				<filesize>145133</filesize>
+				<filetype>application/pdf</filetype>
+				<adddate>1175866670</adddate>
+				<submitted_by>1549951</submitted_by>
+			</file>
+		</existingfiles>
+		<change_log>
+			<item>
+				<field>File Added</field>
+				<old_value>223969: errorColorspace.pdf</old_value>
+				<date>1175866670</date>
+				<by>matpil</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="1695565">
+		<submitted_by>matpil</submitted_by>
+		<submitted_date>2007-04-06 05:30</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="519376">text extraction</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="5">5</priority>
+		<status>Open</status>
+		<resolution>None</resolution>
+		<summary>Problem during load pdf</summary>
+		<detail>There is a problem on load of some pdf file.
+
+this is a stacktrace of error:
+java.io.IOException: expected='/' actual='M'-77 org.pdfbox.io.PushBackInputStream@ebd7a9 
+at org.pdfbox.pdfparser.BaseParser.parseCOSName(BaseParser.java:730) 
+at org.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:205) 
+at org.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:858) 
+at org.pdfbox.pdfparser.PDFParser.parseObject(PDFParser.java:448) 
+at org.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:176) 
+at org.pdfbox.pdmodel.PDDocument.load(PDDocument.java:703) 
+at org.pdfbox.pdmodel.PDDocument.load(PDDocument.java:687) 
+
+
+in attachment the "impeached" pdf
+I use PDFBox 0.7.4</detail>
+		<existingfiles>
+			<file>
+				<id>223962</id>
+				<name>errorLoad.pdf</name>
+				<description></description>
+				<filesize>31409</filesize>
+				<filetype>application/pdf</filetype>
+				<adddate>1175862644</adddate>
+				<submitted_by>1549951</submitted_by>
+			</file>
+		</existingfiles>
+		<change_log>
+			<item>
+				<field>File Added</field>
+				<old_value>223962: errorLoad.pdf</old_value>
+				<date>1175862644</date>
+				<by>matpil</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="1685008">
+		<submitted_by>nunnu</submitted_by>
+		<submitted_date>2007-03-21 02:28</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="100">None</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Nobody/Anonymous</assigned_to>
+		<priority id="9">9</priority>
+		<status>Open</status>
+		<resolution>None</resolution>
+		<summary>Need support for PNG images</summary>
+		<detail>I have noticed 1277052, but I can't vote for it or change the priority, so I create a new feature request.
+
+I use PDBBox to extract images from PDF files, it works fine for jpg and tiff, but some customers of ours also need png.</detail>
+		<follow_ups>
+			<item>
+				<date>1180425914</date>
+				<sender>travisgu</sender>
+				<text>Logged In: YES 
+user_id=1743013
+Originator: NO
+
+Seems some types of PNG files are supported. Somes are not.</text>
+			</item>
+			<item>
+				<date>1174478972</date>
+				<sender>nunnu</sender>
+				<text>Logged In: YES 
+user_id=1726493
+Originator: YES
+
+Urgent :-)</text>
+			</item>
+		</follow_ups>
+		<change_log>
+			<item>
+				<field>priority</field>
+				<old_value>5</old_value>
+				<date>1174469319</date>
+				<by>nunnu</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="1602056">
+		<submitted_by>jesusaplsoft</submitted_by>
+		<submitted_date>2006-11-23 16:45</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="100">None</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="5">5</priority>
+		<status>Open</status>
+		<resolution>None</resolution>
+		<summary>bad print</summary>
+		<detail>As I print the attached pdf file I get printed a black spot where images are: upper left and right corners.</detail>
+		<existingfiles>
+			<file>
+				<id>204202</id>
+				<name>ALBARÁN.pdf</name>
+				<description>form background</description>
+				<filesize>46965</filesize>
+				<filetype>application/pdf</filetype>
+				<adddate>1164329116</adddate>
+				<submitted_by>821972</submitted_by>
+			</file>
+		</existingfiles>
+		<change_log>
+			<item>
+				<field>assigned_to</field>
+				<old_value>100</old_value>
+				<date>1171404793</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>File Added</field>
+				<old_value>204202: ALBARÁN.pdf</old_value>
+				<date>1164329116</date>
+				<by>jesusaplsoft</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="1657698">
+		<submitted_by>rodricksin</submitted_by>
+		<submitted_date>2007-02-11 20:44</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="525555">utilities</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="9">9</priority>
+		<status>Open</status>
+		<resolution>None</resolution>
+		<summary>Pdf Printing with/without Annotations</summary>
+		<detail>Hi all... can i print a PDF File with/without Annotations
+
+I need to give a print to printer by selecting with/ without annotations...
+
+Please give some idea or code..... I want to implement in java...
+
+Thanks in advance...
+
+-Rodricks
+
+</detail>
+		<change_log>
+			<item>
+				<field>assigned_to</field>
+				<old_value>100</old_value>
+				<date>1171255795</date>
+				<by>rodricksin</by>
+			</item>
+			<item>
+				<field>priority</field>
+				<old_value>5</old_value>
+				<date>1171255795</date>
+				<by>rodricksin</by>
+			</item>
+			<item>
+				<field>category_id</field>
+				<old_value>100</old_value>
+				<date>1171255795</date>
+				<by>rodricksin</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="1601785">
+		<submitted_by>isai</submitted_by>
+		<submitted_date>2006-11-23 06:24</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="525555">utilities</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="9">9</priority>
+		<status>Open</status>
+		<resolution>None</resolution>
+		<summary>xml extraction like in adobe professional</summary>
+		<detail>Adobe professional has save pdf as xml plug-in, the extraction can map tables in the pdfs, but this plug-in can olnly be invike since adobe.
+
+with this type of parse can be parse to enyting you wan.</detail>
+		<existingfiles>
+			<file>
+				<id>204148</id>
+				<name>IMSSComproBienesNoTerapeuticosVRS.rar</name>
+				<description>orginal pdf</description>
+				<filesize>244671</filesize>
+				<filetype>application/octet-stream</filetype>
+				<adddate>1164292382</adddate>
+				<submitted_by>1652385</submitted_by>
+			</file>
+			<file>
+				<id>204147</id>
+				<name>IMSSComproBienesNoTerapeuticosVRS.xml</name>
+				<description>example of save as xml in adobe</description>
+				<filesize>203738</filesize>
+				<filetype>text/xml</filetype>
+				<adddate>1164291851</adddate>
+				<submitted_by>1652385</submitted_by>
+			</file>
+		</existingfiles>
+		<change_log>
+			<item>
+				<field>priority</field>
+				<old_value>5</old_value>
+				<date>1164292435</date>
+				<by>isai</by>
+			</item>
+			<item>
+				<field>File Added</field>
+				<old_value>204148: IMSSComproBienesNoTerapeuticosVRS.rar</old_value>
+				<date>1164292382</date>
+				<by>isai</by>
+			</item>
+			<item>
+				<field>File Added</field>
+				<old_value>204147: IMSSComproBienesNoTerapeuticosVRS.xml</old_value>
+				<date>1164291851</date>
+				<by>isai</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="1597881">
+		<submitted_by>benlitchfield</submitted_by>
+		<submitted_date>2006-11-16 09:31</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="100">None</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="5">5</priority>
+		<status>Open</status>
+		<resolution>None</resolution>
+		<summary>Sign .NET DLLs for security</summary>
+		<detail>I`m using your tool at work. It is part of a bigger software solution and it works fine!
+
+Now we have a problem concerning your extracttext.exe – it cannot run from a network drive
+
+because of .net security policy. I wrote a tool that allows application to start from network drives, but
+
+only with one preliminary : it must be strong named.
+
+Could you sign your extracttext.exe and the dll`s used by it ?
+
+You really would save me a lot of work, otherwise I have to replace your tool and so on…</detail>
+		<follow_ups>
+			<item>
+				<date>1180560737</date>
+				<sender>danielwilson</sender>
+				<text>Logged In: YES 
+user_id=1737686
+Originator: NO
+
+I have added a -keyfile line in my build.xml:
+
+&lt;echo&gt;Building PDFBox&lt;/echo&gt; 
+&lt;exec executable="${ikvmc}"&gt; 
+&lt;arg value="-reference:${ikvm.dir}/bin/IKVM.GNU.Classpath.dll" /&gt; 
+&lt;arg value="-reference:${ikvm.dir}/bin/IKVM.AWT.WinForms.dll" /&gt; 
+&lt;arg value="-reference:bin/${fontbox.name}.dll" /&gt; 
+&lt;arg value="-reference:bin/${lucene.name}.dll" /&gt; 
+&lt;arg value="-reference:bin/${lucene-demos.name}.dll" /&gt; 
+&lt;arg value="-reference:bin/${bcprov.name}.dll" /&gt; 
+&lt;arg value="-reference:bin/${bcmail.name}.dll" /&gt; 
+&lt;arg value="-target:library" /&gt; 
+&lt;arg value="-compressresources" /&gt; 
+&lt;arg value="-out:bin\${project.name}.dll" /&gt; 
+&lt;arg value="-keyfile:\PerforceCode\StitchViewer\EmsVwrCtl\Viewer.snk" /&gt; 
+&lt;arg value="lib\${project.name}.jar" /&gt; 
+&lt;/exec&gt; 
+
+Your path &amp; filename will, of course, vary.
+
+The same -keyfile line should work when compiling the executable.</text>
+			</item>
+			<item>
+				<date>1179320705</date>
+				<sender>benlitchfield</sender>
+				<text>Logged In: YES 
+user_id=601708
+Originator: YES
+
+Today I found this post; need to try this and verify that this will solve this issue.  If anyone could help verify this that would be great.
+Ben
+
+http://forums.microsoft.com/MSDN/ShowPost.aspx?PostID=222499&amp;SiteID=1
+
+
+
+Hey all,
+
+I had a problem, where i strong named my project and then i got the error: Cannot emit assembly as referenced assembly is not strong named.
+
+My referenced assembly is created by IKVM tool compiling Java bytecode into .NET MSIL code. I have got one solution to strong name this dll:
+
+1. Create Messages.il using ildasm Messaged.dll /out:Messages.il
+2. Use the same key pair (used to strong name the project) to create back the Messages.dll: ilasm Messages.il /dll key="D:\sn.key"
+
+This worked for me, is there any better solution out there that i have missed out.
+
+Also, the relative path isn't working for me.
+I am using this: &lt;Assembly: AssemblyKeyFile("D:\sn.key")&gt;  instead of
+
+&lt;Assembly: AssemblyKeyFile("..\\..\\sn.key")&gt; 
+
+Thanks for any help
+Vinay Kant
+
+
+Hi Vinay,
+
+The ildasm / ilasm round trip sounds like your best bet here.  The AssemblyKeyFile path should be the relative path from the Visual Studio output location.
+
+-Shawn
+</text>
+			</item>
+		</follow_ups>
+	</artifact>
+	<artifact id="1589018">
+		<submitted_by>cetinsert</submitted_by>
+		<submitted_date>2006-11-01 17:50</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="519376">text extraction</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="5">5</priority>
+		<status>Open</status>
+		<resolution>None</resolution>
+		<summary>Text Extraction with Formatting</summary>
+		<detail>Is it possible to extract text from a PDF without
+ignoring the formatting?
+
+HTML tags might be used for example. I thought the
+PDFText2Html class would do the trick but it does not.
+Thank you for reading.</detail>
+		<follow_ups>
+			<item>
+				<date>1177634119</date>
+				<sender>rrufai</sender>
+				<text>Logged In: YES 
+user_id=1776491
+Originator: NO
+
+It's sent.</text>
+			</item>
+			<item>
+				<date>1177617028</date>
+				<sender>rrufai</sender>
+				<text>Logged In: YES 
+user_id=1776491
+Originator: NO
+
+What email address should I send it to? 
+
+</text>
+			</item>
+			<item>
+				<date>1177614928</date>
+				<sender>cetinsert</sender>
+				<text>Logged In: YES 
+user_id=1562185
+Originator: YES
+
+@ rruffai
+
+&gt; You might send a compiled 32-bit windows or linux binary personally to me.
+&gt; (I'm a user of pdftohtml.)
+
+I messed things up. This was also PDFBox. Hehe, sorry.</text>
+			</item>
+			<item>
+				<date>1177614254</date>
+				<sender>cetinsert</sender>
+				<text>Logged In: YES 
+user_id=1562185
+Originator: YES
+
+@ rrufai
+what is the trouble you have with handling underlines?
+
+You might send a compiled 32-bit windows or linux binary personally to me. (I'm a user of pdftohtml.)</text>
+			</item>
+			<item>
+				<date>1177605415</date>
+				<sender>rrufai</sender>
+				<text>Logged In: YES 
+user_id=1776491
+Originator: NO
+
+Hi Ben,
+&lt;p&gt;
+I've extended PDFText2Html to handle bold, new lines (with &amp;lt;br&amp;gt; tags). However, I'm having trouble figuring out how to handle underlines.
+&lt;/p&gt;
+
+&lt;p&gt;
+Also, I don't know how to post updates. 
+&lt;/p&gt;
+
+Regards,
+
+Raimi
+
+</text>
+			</item>
+			<item>
+				<date>1162437027</date>
+				<sender>cetinsert</sender>
+				<text>Logged In: YES 
+user_id=1562185
+
+Uhmm... well bold, italic, underlined etc... would be a good
+beginning but my ultimate wish would be something like
+quoted below:
+
+&lt;?xml version="1.0" encoding="ISO-8859-1"?&gt;
+&lt;!DOCTYPE pdf2xml SYSTEM "pdf2xml.dtd"&gt;
+
+&lt;pdf2xml&gt;
+&lt;page number="1" position="absolute" top="0" left="0"
+height="1262" width="892"&gt;
+	&lt;fontspec id="0" size="16" family="Times" color="#000000"/&gt;
+	&lt;fontspec id="1" size="16" family="Times" color="#000000"/&gt;
+	&lt;fontspec id="2" size="16" family="Times" color="#000000"/&gt;
+&lt;text top="110" left="106" width="137" height="18"
+font="0"&gt;&lt;i&gt;She &lt;/i&gt;told &lt;b&gt;me&lt;/b&gt;. äµß &lt;/text&gt;
+&lt;/page&gt;
+&lt;/pdf2xml&gt;
+
+I think I have made a mistake by naming it "Text Extraction
+with Formatting"... I should have put my question under a
+more fitting title, something like "PDF to (HTML/)XML
+Conversion with formatting".
+
+Thank you very much for your prompt replies. ^_^</text>
+			</item>
+			<item>
+				<date>1162436166</date>
+				<sender>benlitchfield</sender>
+				<text>Logged In: YES 
+user_id=601708
+
+Specifically are you looking only for bold &amp; italic or other things?</text>
+			</item>
+			<item>
+				<date>1162435647</date>
+				<sender>cetinsert</sender>
+				<text>Logged In: YES 
+user_id=1562185
+
+That's exactly what I am looking for. But is this not a
+priority issue for the PDFBox package? It would take me
+quite a time to extend the stripper on my own. One of the
+PDFBox developers might do it better I think.
+
+If you insist that it's a user's issue and PDFBox developers
+would not invest their time in such an extension, could you
+at least tell me whether you have any links to any
+information regarding this matter?</text>
+			</item>
+			<item>
+				<date>1162434962</date>
+				<sender>benlitchfield</sender>
+				<text>Logged In: YES 
+user_id=601708
+
+HTML tags are not used to format a PDF document.  Font information is available but can be tricky to get what you 
+want.  You will need to extend PDFTextStripper and override writeCharacters to get formatting such as bold/italic.  
+Is that what you are looking for?
+
+Ben</text>
+			</item>
+		</follow_ups>
+	</artifact>
+	<artifact id="1576966">
+		<submitted_by>nobody</submitted_by>
+		<submitted_date>2006-10-13 17:18</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="519376">text extraction</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="5">5</priority>
+		<status>Open</status>
+		<resolution>None</resolution>
+		<summary>Better metadata in conversion to HTML</summary>
+		<detail>It would be great to have better support for metadata 
+in conversion to HTML.
+
+- Being able to create a HTML page with the proper 
+document title in (not one simply guessed from the 
+text of the document).
+
+- Author, keywords, category etc. extracted from the 
+document and placed into metafields in the HTML
+
+- Chosen encoding included in the HTML header.
+
+I am using PDFbox in conjunction with mnoGoSearch to 
+index PDFs on a site. This additional metadata would 
+be extremely handy, since it would form a part of the 
+indexed details for the documents.
+
+Even if a simple tool could be created that would 
+*just* extract the metadata from a document [into 
+some kind of text format], that would be great. 
+External tools could then be built around that, e.g. 
+a templating tool that could create a final format of 
+any form, using the extracted text and the extracted 
+metadata.</detail>
+		<follow_ups>
+			<item>
+				<date>1160785210</date>
+				<sender>nobody</sender>
+				<text>Logged In: NO 
+
+BTW I've not used Java before, so don't have any code to 
+contribute, but if I do come up with anything, I'll post 
+it here.
+
+-- Jason
+(sorry - mislaid my login too)</text>
+			</item>
+		</follow_ups>
+	</artifact>
+	<artifact id="1546399">
+		<submitted_by>cvonsee</submitted_by>
+		<submitted_date>2006-08-24 20:01</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="519376">text extraction</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="5">5</priority>
+		<status>Closed</status>
+		<resolution>None</resolution>
+		<summary>Use get/set functions for separators in PDFTextStripper</summary>
+		<detail>Instead of directly using the constants that represent
+the page, line and word separators in PDFTextStripper:
+
+    private String lineSeparator =
+System.getProperty("line.separator");
+    private String pageSeparator =
+System.getProperty("line.separator");
+    private String wordSeparator = " ";
+
+call the getLineSeparator(), getPageSeparator() and
+getWordSeparator() functions so that if these are
+overridden in subclasses that the base PDFTextStripper
+logic will pick up the overridden methods.
+
+Thanks for a great tool!
+
+
+Cheers
+Chris</detail>
+		<follow_ups>
+			<item>
+				<date>1158801052</date>
+				<sender>benlitchfield</sender>
+				<text>Logged In: YES 
+user_id=601708
+
+just added to CVS, check it out.
+
+Ben</text>
+			</item>
+		</follow_ups>
+		<change_log>
+			<item>
+				<field>close_date</field>
+				<old_value>0</old_value>
+				<date>1158801052</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>status_id</field>
+				<old_value>1</old_value>
+				<date>1158801052</date>
+				<by>benlitchfield</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="826042">
+		<submitted_by>benlitchfield</submitted_by>
+		<submitted_date>2003-10-18 09:57</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="565624">PDModel</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Nobody/Anonymous</assigned_to>
+		<priority id="5">5</priority>
+		<status>Closed</status>
+		<resolution>None</resolution>
+		<summary>support inherited attributes at the page level</summary>
+		<detail>The page object has several inherited attributes( 
+MediaBox,CropBox...), the pdmodel needs to expose 
+these attributes and support the the inherited 
+structure.  
+
+This request comes from bug 823216.</detail>
+		<follow_ups>
+			<item>
+				<date>1066701088</date>
+				<sender>benlitchfield</sender>
+				<text>Logged In: YES 
+user_id=601708
+
+now available in CVS.</text>
+			</item>
+		</follow_ups>
+		<change_log>
+			<item>
+				<field>status_id</field>
+				<old_value>1</old_value>
+				<date>1066701088</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>close_date</field>
+				<old_value>0</old_value>
+				<date>1066701088</date>
+				<by>benlitchfield</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="815957">
+		<submitted_by>nobody</submitted_by>
+		<submitted_date>2003-10-01 09:04</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="100">None</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Nobody/Anonymous</assigned_to>
+		<priority id="5">5</priority>
+		<status>Closed</status>
+		<resolution>None</resolution>
+		<summary>PDFTextStripper</summary>
+		<detail>Would it be possible to add a page seperator string?</detail>
+		<follow_ups>
+			<item>
+				<date>1066430004</date>
+				<sender>benlitchfield</sender>
+				<text>Logged In: YES 
+user_id=601708
+
+see PDFTextStripper.setPageSeparator in CVS</text>
+			</item>
+		</follow_ups>
+		<change_log>
+			<item>
+				<field>status_id</field>
+				<old_value>1</old_value>
+				<date>1066430004</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>close_date</field>
+				<old_value>0</old_value>
+				<date>1066430004</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>category_id</field>
+				<old_value>519373</old_value>
+				<date>1065277982</date>
+				<by>benlitchfield</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="1469124">
+		<submitted_by>govardhana</submitted_by>
+		<submitted_date>2006-04-12 02:20</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="100">None</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="5">5</priority>
+		<status>Open</status>
+		<resolution>None</resolution>
+		<summary>PS TO PDF CONVERSION</summary>
+		<detail>Hi,
+
+There's no postscript to pdf file converter in the 
+open source which is built using Java or .Net. Only 
+one thing that is present is GhostScript which is 
+developed using C++. Can it be possible to use PDFBox 
+to build such a project which converts the PostScript 
+files to PDF format.
+
+Thank You
+
+Regards,
+Govardhana</detail>
+		<follow_ups>
+			<item>
+				<date>1158839966</date>
+				<sender>benlitchfield</sender>
+				<text>Logged In: YES 
+user_id=601708
+
+Hi Govardhana,
+
+As I mentioned in my earlier comment, this is certainly 
+possible but will be a significant amount of work.
+
+To do this you'll need to be familiar with the internals of 
+both PS and PDF, so review these two documents
+
+Postscript Reference
+http://www.adobe.com/products/postscript/pdfs/PLRM.pdf
+
+PDF Reference
+http://partners.adobe.com/public/developer/pdf/index_referen
+ce.html
+
+Then just get started,
+Ben
+
+"To change the world
+Start with one step.
+However small,
+The first step is hardest of all." -DMB</text>
+			</item>
+			<item>
+				<date>1158830501</date>
+				<sender>govardhana</sender>
+				<text>Logged In: YES 
+user_id=1452645
+
+Hi,
+
+I would be very much interested in doing so. But i am a 
+begginer and i would like to know all the information 
+which would be helpful in achieveing this kind of feature. 
+I am waiting for ur response</text>
+			</item>
+			<item>
+				<date>1154393353</date>
+				<sender>benlitchfield</sender>
+				<text>Logged In: YES 
+user_id=601708
+
+FYI, this is possible but PDFBox is far away from being 
+able to support this.  This is a great feature and will 
+stay as a request but if you want to see this in the near 
+future then I'll need some implementation help.
+
+Ben</text>
+			</item>
+		</follow_ups>
+		<change_log>
+			<item>
+				<field>assigned_to</field>
+				<old_value>100</old_value>
+				<date>1145294729</date>
+				<by>benlitchfield</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="1047923">
+		<submitted_by>thansson</submitted_by>
+		<submitted_date>2004-10-15 10:34</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="100">None</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="5">5</priority>
+		<status>Open</status>
+		<resolution>None</resolution>
+		<summary>Import/Export of XML Data Package files (XDP)   </summary>
+		<detail>Please, add support for import and export of XDP forms 
+data. Attached is a sample PDF form and the exported 
+XDP file. The PDF file was created using Adobe Designer 
+6.0
+
+--
+Tomas
+</detail>
+		<follow_ups>
+			<item>
+				<date>1106197893</date>
+				<sender>gnatware</sender>
+				<text>Logged In: YES 
+user_id=73363
+
+I just started playing with Adobe Designer (60-day trial). 
+What we need (but will probably run up against Adobe patents
+trying to implement) is an XDP processor that does some or
+all of what Adobe Form Server (or LiveCycle Forms) does:
+take the form specified in the XDP file and generate client
+side HTML/Javascript via servlets and JSPs.  The
+HTML/Javascript (I'm guessing, since I don't have $30000
+required to license Form Server) will include all the
+validation provided in the PDF form (through Adobe Reader),
+and the backend servlet will also be responsible for
+connecting to data sources (ODBC, XML w/ schema, etc),
+posting the updated data, providing pre-filled PDF and or
+XML versions of the data to be downloaded, emailed, etc.
+
+I suppose we could start by transforming the XDP using XSLT
+to generate simple text, checkbox, radio, select and submit
+inputs...</text>
+			</item>
+		</follow_ups>
+		<existingfiles>
+			<file>
+				<id>105253</id>
+				<name>Forms.zip</name>
+				<description></description>
+				<filesize>111975</filesize>
+				<filetype></filetype>
+				<adddate>1097861665</adddate>
+				<submitted_by>197342</submitted_by>
+			</file>
+		</existingfiles>
+		<change_log>
+			<item>
+				<field>assigned_to</field>
+				<old_value>100</old_value>
+				<date>1138844443</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>File Added</field>
+				<old_value>105253: Forms.zip</old_value>
+				<date>1097861666</date>
+				<by>thansson</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="1050558">
+		<submitted_by>nobody</submitted_by>
+		<submitted_date>2004-10-20 01:44</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="519376">text extraction</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="5">5</priority>
+		<status>Open</status>
+		<resolution>None</resolution>
+		<summary>extract the text of certain page at certain line</summary>
+		<detail>sometimes,it is unneeded to extraction all the text in a 
+pdf file :-)</detail>
+	</artifact>
+	<artifact id="1333383">
+		<submitted_by>cannonbeach</submitted_by>
+		<submitted_date>2005-10-20 09:20</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="100">None</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="5">5</priority>
+		<status>Closed</status>
+		<resolution>None</resolution>
+		<summary> In memory COSDocument  </summary>
+		<detail>This is an RFE submitted as a result of a post by user
+cannonbeach on 2005-19 called 'In memory COSDocument'.
+
+It involves the creation of an interface with two
+implementations, one for RAF and one for in memory.
+
+Here is Ben's response to the post:
+
+Yes, the correct implementation is to create an
+interface with two implementations, one for RAF and one
+for in memory. 
+ 
+This should be a straightforward implementation, can
+one of you create an RFE for it and I will get it in. 
+ 
+An interesting third implementation might be one that
+starts with in memory and then switches over to RAF
+after some threshold. I'd be curious if there were any
+performance gains. </detail>
+		<follow_ups>
+			<item>
+				<date>1138475446</date>
+				<sender>benlitchfield</sender>
+				<text>Logged In: YES 
+user_id=601708
+
+Sorry for the delay in doing this, but it is implemented 
+and will be available in tonights build.
+
+See org.pdfbox.pdfparserPDFParser#PDFParser( InputStream, 
+RandomAccess )
+and
+org.pdfbox.io.RandomAccessBuffer
+
+Ben</text>
+			</item>
+		</follow_ups>
+		<change_log>
+			<item>
+				<field>status_id</field>
+				<old_value>1</old_value>
+				<date>1138475446</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>close_date</field>
+				<old_value>0</old_value>
+				<date>1138475446</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>assigned_to</field>
+				<old_value>100</old_value>
+				<date>1129843446</date>
+				<by>cannonbeach</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="1023133">
+		<submitted_by>benlitchfield</submitted_by>
+		<submitted_date>2004-09-06 08:24</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="565624">PDModel</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="1">1</priority>
+		<status>Closed</status>
+		<resolution>None</resolution>
+		<summary>Support PDF Functions</summary>
+		<detail>Add support for PDF functions.  Fix 
+pdmodel.graphics.color.PDSeparation.getTintTransform
+
+Ben</detail>
+		<follow_ups>
+			<item>
+				<date>1143078600</date>
+				<sender>benlitchfield</sender>
+				<text>Logged In: YES 
+user_id=601708
+
+implemented in CVS!
+
+Woo hoo</text>
+			</item>
+		</follow_ups>
+		<change_log>
+			<item>
+				<field>status_id</field>
+				<old_value>1</old_value>
+				<date>1143078600</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>close_date</field>
+				<old_value>0</old_value>
+				<date>1143078600</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>priority</field>
+				<old_value>5</old_value>
+				<date>1096504960</date>
+				<by>benlitchfield</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="719773">
+		<submitted_by>benlitchfield</submitted_by>
+		<submitted_date>2003-04-11 09:21</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="519376">text extraction</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Nobody/Anonymous</assigned_to>
+		<priority id="5">5</priority>
+		<status>Closed</status>
+		<resolution>None</resolution>
+		<summary>Output extracted text to console</summary>
+		<detail>I want to see the output from PDFBox on shell (console) 
+rather than dumping it to a text file. This is similar to 
+what XPDF does when you give - as an
+argument at shell instead of text file name.
+
+Maybe have the same command line arguments as 
+xpdf, but I am not sure.</detail>
+		<follow_ups>
+			<item>
+				<date>1063469848</date>
+				<sender>benlitchfield</sender>
+				<text>Logged In: YES 
+user_id=601708
+
+org.pdfbox.Main now has a &amp;quot;-console&amp;quot; option.</text>
+			</item>
+		</follow_ups>
+		<change_log>
+			<item>
+				<field>status_id</field>
+				<old_value>1</old_value>
+				<date>1063469848</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>close_date</field>
+				<old_value>0</old_value>
+				<date>1063469848</date>
+				<by>benlitchfield</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="729563">
+		<submitted_by>benlitchfield</submitted_by>
+		<submitted_date>2003-04-29 06:42</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="525555">utilities</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Nobody/Anonymous</assigned_to>
+		<priority id="5">5</priority>
+		<status>Closed</status>
+		<resolution>None</resolution>
+		<summary>Split PDF's</summary>
+		<detail>Allow for the ability to split up a pdf.  Options might 
+include splitting every x number of pages.</detail>
+		<follow_ups>
+			<item>
+				<date>1090415746</date>
+				<sender>benlitchfield</sender>
+				<text>Logged In: YES 
+user_id=601708
+
+Released in version 0.6.6</text>
+			</item>
+		</follow_ups>
+		<change_log>
+			<item>
+				<field>status_id</field>
+				<old_value>1</old_value>
+				<date>1090415746</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>close_date</field>
+				<old_value>0</old_value>
+				<date>1090415746</date>
+				<by>benlitchfield</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="912459">
+		<submitted_by>benlitchfield</submitted_by>
+		<submitted_date>2004-03-08 19:25</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="565624">PDModel</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="5">5</priority>
+		<status>Open</status>
+		<resolution>None</resolution>
+		<summary>Set label on PDPushButton</summary>
+		<detail>Allow for ability to change the label on a push button.
+</detail>
+		<change_log>
+			<item>
+				<field>assigned_to</field>
+				<old_value>100</old_value>
+				<date>1093978841</date>
+				<by>benlitchfield</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="1045899">
+		<submitted_by>caisd</submitted_by>
+		<submitted_date>2004-10-12 22:24</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="565624">PDModel</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="5">5</priority>
+		<status>Open</status>
+		<resolution>None</resolution>
+		<summary>Add new form fields to an existing pdf file</summary>
+		<detail>To add some new form fields into an existing pdf file.
+
+There is a shareware named bfop(The Big Faceless PDF
+Library) can do it in an easy way, for example: 
+1. PDF pdf = new PDF(new PDFReader(new
+FileInputStream("example.pdf")));
+2.PDFPage page = pdf.getLastPage();
+3. Form form = pdf.getForm();
+4. FormText address = new FormText(page, 400, 660, 550,
+720);
+5. form.addElement("address", address);
+6. pdf.render(new FileOutputStream("FormCreation.pdf"));
+
+</detail>
+		<change_log>
+			<item>
+				<field>category_id</field>
+				<old_value>100</old_value>
+				<date>1097646279</date>
+				<by>caisd</by>
+			</item>
+			<item>
+				<field>assigned_to</field>
+				<old_value>100</old_value>
+				<date>1097646279</date>
+				<by>caisd</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="1037395">
+		<submitted_by>benlitchfield</submitted_by>
+		<submitted_date>2004-09-29 17:21</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="565624">PDModel</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="1">1</priority>
+		<status>Open</status>
+		<resolution>None</resolution>
+		<summary>support interactive actions</summary>
+		<detail>See PDF Reference 1.5 section 8.5 about actions that 
+exist.  Package has been created but need to implement.</detail>
+	</artifact>
+	<artifact id="1309441">
+		<submitted_by>cambalud2</submitted_by>
+		<submitted_date>2005-09-30 05:29</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="100">None</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="5">5</priority>
+		<status>Open</status>
+		<resolution>None</resolution>
+		<summary>Print PDF</summary>
+		<detail>Hi Ben,
+
+I send you two PDF documents. 
+
+First-(V2135-2.pdf) Where some parts of the code are 
+printed badly. 
+
+Second-(1_1.pdf) This pdf, I cant not print it. 
+
+Best regards,
+Nayra.</detail>
+		<existingfiles>
+			<file>
+				<id>150907</id>
+				<name>V2135-2.pdf</name>
+				<description></description>
+				<filesize>6084</filesize>
+				<filetype>application/pdf</filetype>
+				<adddate>1128083365</adddate>
+				<submitted_by>1353091</submitted_by>
+			</file>
+		</existingfiles>
+		<change_log>
+			<item>
+				<field>assigned_to</field>
+				<old_value>100</old_value>
+				<date>1128185598</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>File Added</field>
+				<old_value>150907: V2135-2.pdf</old_value>
+				<date>1128083366</date>
+				<by>cambalud2</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="852035">
+		<submitted_by>benlitchfield</submitted_by>
+		<submitted_date>2003-12-01 04:25</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="525555">utilities</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Nobody/Anonymous</assigned_to>
+		<priority id="5">5</priority>
+		<status>Closed</status>
+		<resolution>None</resolution>
+		<summary>Encrypt PDF/Decrypt PDF</summary>
+		<detail>Add high level API for setting security settings, provide 
+command line utility for encrypting/decrypting a PDF</detail>
+		<follow_ups>
+			<item>
+				<date>1091335007</date>
+				<sender>benlitchfield</sender>
+				<text>Logged In: YES 
+user_id=601708
+
+Created command line utilities org.pdfbox.Encrypt and 
+org.pdfbox.Decrypt </text>
+			</item>
+		</follow_ups>
+		<change_log>
+			<item>
+				<field>close_date</field>
+				<old_value>0</old_value>
+				<date>1091335007</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>status_id</field>
+				<old_value>1</old_value>
+				<date>1091335006</date>
+				<by>benlitchfield</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="1021012">
+		<submitted_by>jkronegg</submitted_by>
+		<submitted_date>2004-09-02 02:00</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="100">None</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="5">5</priority>
+		<status>Closed</status>
+		<resolution>None</resolution>
+		<summary>Provide nighlty builds</summary>
+		<detail>Hi ben,
+
+For people that doesn't rebuild PDFBox from the CVS
+source but that can wait until the next release, it
+will be convenient to have access to nightly builds.
+
+Is it possible to have access to such builds ?
+
+
+Thanks
+
+
+Julien</detail>
+		<follow_ups>
+			<item>
+				<date>1094395623</date>
+				<sender>benlitchfield</sender>
+				<text>Logged In: YES 
+user_id=601708
+
+builds are available off of
+
+http://www.csh.rit.edu/~ben/projects/pdfbox/nightly-release/
+
+also link to from the http://www.pdfbox.org main page.
+
+</text>
+			</item>
+			<item>
+				<date>1094131593</date>
+				<sender>benlitchfield</sender>
+				<text>Logged In: YES 
+user_id=601708
+
+This was actually on my todo list, I will do it sooner rather 
+than later now.
+
+Ben</text>
+			</item>
+		</follow_ups>
+		<change_log>
+			<item>
+				<field>status_id</field>
+				<old_value>1</old_value>
+				<date>1094395623</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>close_date</field>
+				<old_value>0</old_value>
+				<date>1094395623</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>assigned_to</field>
+				<old_value>100</old_value>
+				<date>1094131593</date>
+				<by>benlitchfield</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="1000109">
+		<submitted_by>benlitchfield</submitted_by>
+		<submitted_date>2004-07-29 06:27</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="565624">PDModel</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="5">5</priority>
+		<status>Open</status>
+		<resolution>None</resolution>
+		<summary>Add ability to digitally sign a PDF</summary>
+		<detail>
+Implementation Notes:
+What about the info on this site:
+http://www.codeproject.com/useritems/PdfDigiPad.asp
+
+Adobe PDF Public-Key Digital Signature and Encryption 
+Specification
+http://partners.adobe.com/asn/developer/pdfs/tn/ppk_pd
+fspec.pdf
+
+Adobe Acrobat Digital Signature API Reference
+http://partners.adobe.com/asn/acrobat/docs/digsig.pdf
+
+http://www.mail-archive.com/itext-
+questions@lists.sourceforge.net/msg11084.html
+
+http://groups.google.de/groups?
+q=sign+openssl+group:comp.text.pdf&amp;hl=de&amp;lr=lang_de|l
+ang_en&amp;ie=UTF-
+8&amp;group=comp.text.pdf&amp;selm=f55510dc.0403111256.f0a6
+513%40posting.google.com&amp;rnum=1
+
+</detail>
+		<follow_ups>
+			<item>
+				<date>1111115159</date>
+				<sender>notessensei</sender>
+				<text>Logged In: YES 
+user_id=675521
+
+I second the request</text>
+			</item>
+			<item>
+				<date>1096242484</date>
+				<sender>benlitchfield</sender>
+				<text>Logged In: YES 
+user_id=601708
+
+The org.pdfbox.pdmodel.interactive.digitalsignature package 
+has been created but needs to be implemented.
+
+Ben</text>
+			</item>
+			<item>
+				<date>1093977957</date>
+				<sender>benlitchfield</sender>
+				<text>Logged In: YES 
+user_id=601708
+
+java example of signing a pdf using an x509 certificate from a 
+file or a X506Certificate instance </text>
+			</item>
+		</follow_ups>
+		<change_log>
+			<item>
+				<field>assigned_to</field>
+				<old_value>100</old_value>
+				<date>1093978840</date>
+				<by>benlitchfield</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="1010721">
+		<submitted_by>benlitchfield</submitted_by>
+		<submitted_date>2004-08-17 06:24</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="565624">PDModel</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="5">5</priority>
+		<status>Open</status>
+		<resolution>None</resolution>
+		<summary>Add ability to extract comments</summary>
+		<detail>Create command line app to extract comments from a 
+document.
+
+Ben</detail>
+		<change_log>
+			<item>
+				<field>assigned_to</field>
+				<old_value>100</old_value>
+				<date>1093978838</date>
+				<by>benlitchfield</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="1016101">
+		<submitted_by>benlitchfield</submitted_by>
+		<submitted_date>2004-08-25 09:16</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="100">None</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="5">5</priority>
+		<status>Open</status>
+		<resolution>None</resolution>
+		<summary>Replace existing font with new font</summary>
+		<detail>Create a class, maybe an example that allows a user to 
+replace a font with a font in the filesystem or an existing 
+font in the PDF.  This is useful when a collection of PDF 
+documents have been appended and contain many 
+embedded fonts like 
+'BDED50A3 Times New Roman'
+'GH34DFH Times New Roman'
+
+In order to reduce file size it would be nice to be able to 
+reduce these to one embedded font.
+
+Where the font is embedded multiple times but is really 
+the same font over and over.
+
+Ben</detail>
+		<change_log>
+			<item>
+				<field>assigned_to</field>
+				<old_value>100</old_value>
+				<date>1093978837</date>
+				<by>benlitchfield</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="804468">
+		<submitted_by>benlitchfield</submitted_by>
+		<submitted_date>2003-09-11 07:38</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="100">None</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="9">9</priority>
+		<status>Closed</status>
+		<resolution>None</resolution>
+		<summary>FDF export</summary>
+		<detail>Add ability to create a .fdf file from a PDF file.  Maybe 
+give the ability to do batch exporting.</detail>
+		<follow_ups>
+			<item>
+				<date>1096855184</date>
+				<sender>benlitchfield</sender>
+				<text>Logged In: YES 
+user_id=601708
+
+Single export now available in CVS.
+
+Ben</text>
+			</item>
+			<item>
+				<date>1096508630</date>
+				<sender>benlitchfield</sender>
+				<text>Logged In: YES 
+user_id=601708
+
+The export really should be 
+ExportFormsToFDF
+ExportFormsToXFDF
+ExportCommentsToFDF
+ExportCommentsToXFDF</text>
+			</item>
+		</follow_ups>
+		<change_log>
+			<item>
+				<field>status_id</field>
+				<old_value>1</old_value>
+				<date>1096855184</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>close_date</field>
+				<old_value>0</old_value>
+				<date>1096855184</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>priority</field>
+				<old_value>7</old_value>
+				<date>1096504895</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>priority</field>
+				<old_value>5</old_value>
+				<date>1096051263</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>assigned_to</field>
+				<old_value>100</old_value>
+				<date>1093978845</date>
+				<by>benlitchfield</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="939868">
+		<submitted_by>ramyamd</submitted_by>
+		<submitted_date>2004-04-22 01:19</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="519376">text extraction</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="5">5</priority>
+		<status>Open</status>
+		<resolution>None</resolution>
+		<summary>text from box</summary>
+		<detail>I am attaching a file with this message. 
+
+problem:
+
+The text from the rectangles are not read sequentially 
+i.e not extract from a single rectangle at a time. it is 
+extracting randomly from different rectangles. I want to 
+get the text rectangle wise.
+
+
+for example - PDF page no - 89
+
+The text is to be extracted in this way
+
+A
+767
+FAULT ISOLATION/MAINT MANUAL
+
+PASSENGER ADDRESS
+AMPLIFIER BITE
+PROCEDURE
+
+PREREQUISITES
+MAKE SURE THIS CIRCUIT BREAKER IS CLOSED:
+11C22
+MAKE SURE THE AIRPLANE IS IN THIS CONFIGURATION:
+  ELECTRICAL POWER IS ON (AMM 24-22-00/201)
+
+1   SET THE FUNCTION SELECTOR NO
+SWITCH TO THE "LEVEL" POSITION
+ON THE PA AMPLIFIER FRONT
+PANEL AT E2-5.
+    DOES THE PA AMPLIFIER
+FRONT PANEL SHOW 69 TO 71
+VRMS?
+
+10  ADJUST THE "MASTER GAIN"
+FOR 69 TO 71 VRMS.
+    DOES THE PA AMPLIFIER
+FRONT PANEL SHOW 69 TO 71
+VRMS?
+YES
+NO
+ 20  REPLACE THE PA AMPLIFIER,
+M177 (AMM 23-31-01/401).
+YES
+2   SET THE FUNCTION SELECTOR
+SWITCH TO THE "LOAD" POSITION.
+    DOES THE PA AMPLIFIER NO
+FRONT PANEL SHOW 30 OHMS OR
+MORE?
+YES
+21  EXAMINE THE SPEAKER WIRING
+FOR SHORT CIRCUITS FROM
+PIN A13 TO B13 (IF USED) AND
+PIN A15 TO B15, OF CONNECTOR
+D455B, AT E2-5 (WDM 23-31-14
+THRU 23-31-17).
+    REPAIR THE PROBLEMS THAT
+YOU FIND.
+
+3   SET THE FUNCTION SELECTOR NO
+SWITCH TO THE "TONE" POSITION.
+    DO YOU HEAR SOUND FROM ALL
+THE PA SPEAKERS?
+YES
+4   SET THE FUNCTION SELECTOR
+SWITCH TO THE "OPERATE" POSI-
+TION.
+    THE SYSTEM IS OK.
+11  DO YOU HEAR NO SOUND AT
+ONE OF THE SPEAKERS?
+NO
+12  DO YOU HEAR NO SOUND FROM
+ALL OF THE SPEAKERS?
+NO
+YES
+YES
+22  REPLACE THE BAD SPEAKER.
+REFER TO TABLE 101.
+23  REPLACE THE PA AMPLIFIER,
+M177 (AMM 23-31-01/401).
+24  EXAMINE THE SPEAKER WIRING
+FOR OPEN CIRCUITS FROM A
+SPEAKER WITH THE SOUND TO A
+SPEAKER WITHOUT (WDM 23-31-14
+THRU 23-31-17).
+    REPAIR THE PROBLEMS THAT
+YOU FIND.
+
+NOTE:
+BITE DOES A TEST OF THESE SYSTEM COMPONENTS:
+  PA AMPLIFIER 
+  SPEAKERS
+  SPEAKER WIRING. 
+BITE DOES NOT DO A TEST OF THESE SYSTEM 
+COMPONENTS:
+  AUDIO ACCESSORY UNIT 
+  ZONE MULTIPLEXER. 
+SPEAKER
+LOCATION
+PSU
+GALLEY
+LAVATORY
+CEILING
+AMM
+REFERENCE
+23-31-02/401
+23-31-04/401
+23-31-05/401
+23-31-08/401
+TABLE 101
+Passenger Address Amplifier BITE Procedure
+Figure 103
+       
+
+its just the outline of how i need the information. The 
+text should  be read completely from one rectanble and 
+then switched to next rectangle etc.,
+
+Similar pages are also in this PDF document. pls test it 
+with that also.
+
+it will be useful for me if u give the details of the images 
+in this file(how it is stored and which format)
+
+pls give importance to this message.
+Thanks in advance. Waiting for ur reply.
+
+</detail>
+		<existingfiles>
+			<file>
+				<id>84628</id>
+				<name>task23.zip</name>
+				<description></description>
+				<filesize>138214</filesize>
+				<filetype></filetype>
+				<adddate>1082623690</adddate>
+				<submitted_by>1005150</submitted_by>
+			</file>
+		</existingfiles>
+		<change_log>
+			<item>
+				<field>assigned_to</field>
+				<old_value>100</old_value>
+				<date>1093978840</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>File Added</field>
+				<old_value>84628: task23.zip</old_value>
+				<date>1082623690</date>
+				<by>ramyamd</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="931628">
+		<submitted_by>vadimbit</submitted_by>
+		<submitted_date>2004-04-08 02:45</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="519376">text extraction</category>
+		<artifact_group id="293150"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="5">5</priority>
+		<status>Open</status>
+		<resolution>None</resolution>
+		<summary>CID to Unicode mapping</summary>
+		<detail>For extracting CJK text it would be usefull to map CID-
+keyed cheracters to Unicode. 
+
+For example, "90ms-RKSJ-UCS2" cmap file can be use 
+for retrieving unicodes for "90ms-RKSJ-H" and "90ms-
+RKSJ-V" encoding of CID-fonts.
+
+Now CMapParser parse "bfrange" and "bfchar". If is 
+enough for parsing ToUnicode CMap files. 
+
+So, as I understand, "encoding name to ToUnicode 
+CMap file name" mapping is needed only.</detail>
+		<follow_ups>
+			<item>
+				<date>1081498403</date>
+				<sender>vadimbit</sender>
+				<text>Logged In: YES 
+user_id=958555
+
+Some additional information...
+
+There is better way to retrieve unicode symbol:
+
+We should get CID, using natural CMap file and map it to 
+Unicode, using appropriate Uni* CMap file (backward 
+mapping)...
+
+So, "cidrange" and "cidchar" should be parsered too...</text>
+			</item>
+		</follow_ups>
+		<change_log>
+			<item>
+				<field>assigned_to</field>
+				<old_value>100</old_value>
+				<date>1093978841</date>
+				<by>benlitchfield</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="841169">
+		<submitted_by>nobody</submitted_by>
+		<submitted_date>2003-11-12 20:03</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="525555">utilities</category>
+		<artifact_group id="293150"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="5">5</priority>
+		<status>Open</status>
+		<resolution>None</resolution>
+		<summary>HTML -&amp;gt; PDF</summary>
+		<detail>It would be really nice to take a html and create a PDF 
+from it.</detail>
+		<change_log>
+			<item>
+				<field>assigned_to</field>
+				<old_value>100</old_value>
+				<date>1093978842</date>
+				<by>benlitchfield</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="1000668">
+		<submitted_by>benlitchfield</submitted_by>
+		<submitted_date>2004-07-30 05:22</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="565624">PDModel</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="1">1</priority>
+		<status>Open</status>
+		<resolution>None</resolution>
+		<summary>Add ability to set document information using PDFViewer</summary>
+		<detail>Enhance PDFViewer to show a dialog with the document 
+information, allow the user to edit this information and 
+save it back to the file system.  
+
+Also give the ability to change a set of PDF documents, 
+such as a subdirectory or something.
+
+</detail>
+		<change_log>
+			<item>
+				<field>priority</field>
+				<old_value>5</old_value>
+				<date>1096504786</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>assigned_to</field>
+				<old_value>100</old_value>
+				<date>1093978840</date>
+				<by>benlitchfield</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="830521">
+		<submitted_by>benlitchfield</submitted_by>
+		<submitted_date>2003-10-26 07:13</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="100">None</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="5">5</priority>
+		<status>Open</status>
+		<resolution>None</resolution>
+		<summary>alternate header form</summary>
+		<detail>Support PDF Reference 1.5 Impl Note 14 
+
+14. Acrobat viewers will also accept a header of the form
+%!PS&amp;amp;#8722;Adobe&amp;amp;#8722;N.n PDF&amp;amp;#8722;M.m</detail>
+		<change_log>
+			<item>
+				<field>assigned_to</field>
+				<old_value>100</old_value>
+				<date>1093978843</date>
+				<by>benlitchfield</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="794289">
+		<submitted_by>elfring</submitted_by>
+		<submitted_date>2003-08-24 12:50</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="525555">utilities</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="5">5</priority>
+		<status>Closed</status>
+		<resolution>None</resolution>
+		<summary>Notification about output progress</summary>
+		<detail>Does your programming interface provide hooks or callback 
+services to notify an application or an other service about the PDF 
+file creation progress?
+I would like that a function or an object 
+gets automatically informed if a new page will be started or 
+finished. An user interface can display this change in a progress 
+dialog.</detail>
+		<follow_ups>
+			<item>
+				<date>1107400351</date>
+				<sender>benlitchfield</sender>
+				<text>Logged In: YES 
+user_id=601708
+
+output progress of text extraction can be achieved by 
+extending PDFTextStripper, PDF creation progress depends on 
+how PDFBox is being used, so the client should track that.
+
+Ben</text>
+			</item>
+			<item>
+				<date>1063626789</date>
+				<sender>elfring</sender>
+				<text>Logged In: YES 
+user_id=572001
+
+Does your output process offer page events during the PDF 
+creation?</text>
+			</item>
+			<item>
+				<date>1063470482</date>
+				<sender>benlitchfield</sender>
+				<text>Logged In: YES 
+user_id=601708
+
+Do you mean text extraction or PDF creation?  </text>
+			</item>
+		</follow_ups>
+		<change_log>
+			<item>
+				<field>close_date</field>
+				<old_value>0</old_value>
+				<date>1107400353</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>status_id</field>
+				<old_value>1</old_value>
+				<date>1107400349</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>assigned_to</field>
+				<old_value>100</old_value>
+				<date>1093978845</date>
+				<by>benlitchfield</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="838444">
+		<submitted_by>benlitchfield</submitted_by>
+		<submitted_date>2003-11-08 07:34</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="565624">PDModel</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="5">5</priority>
+		<status>Closed</status>
+		<resolution>None</resolution>
+		<summary>Color Space support</summary>
+		<detail>Handle ColorSpaces including ICC et all.</detail>
+		<follow_ups>
+			<item>
+				<date>1096051321</date>
+				<sender>benlitchfield</sender>
+				<text>Logged In: YES 
+user_id=601708
+
+ColorSpaces are not supported in the CVS version of PDFBox.
+
+Ben</text>
+			</item>
+		</follow_ups>
+		<change_log>
+			<item>
+				<field>status_id</field>
+				<old_value>1</old_value>
+				<date>1096051321</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>close_date</field>
+				<old_value>0</old_value>
+				<date>1096051321</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>assigned_to</field>
+				<old_value>100</old_value>
+				<date>1093978842</date>
+				<by>benlitchfield</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="805623">
+		<submitted_by>benlitchfield</submitted_by>
+		<submitted_date>2003-09-13 07:38</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="565624">PDModel</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="5">5</priority>
+		<status>Open</status>
+		<resolution>None</resolution>
+		<summary>extract information from tagged PDF</summary>
+		<detail>Add the ability to extract information from a tagged PDF 
+document.  See taggedPDF.pdf for an example.</detail>
+		<follow_ups>
+			<item>
+				<date>1146050898</date>
+				<sender>qumar</sender>
+				<text>Logged In: YES 
+user_id=1468838
+
+Hi,
+we have to parse the PDF object structure tree; all
+structural elements are inside the object tree (see e.g.
+PDFReference 1.4 chapter 9.6 "Logical Structure").
+- parse the PDF page streams to extract drawing and text
+operations;these contain the actual content of the
+structural elements. This content is surrounded by BMC/EMC
+tags which contain information to which element object the
+contained content belongs.This is what i got from pdf reference.
+
+Regards,
+Qumar.
+</text>
+			</item>
+			<item>
+				<date>1145496779</date>
+				<sender>benlitchfield</sender>
+				<text>Logged In: YES 
+user_id=601708
+
+http://www.irs.gov/pub/irs-access/f1040ez_accessible.pdf
+would be a good form to start with.
+
+If you notice they are putting labels on the form fields.  
+these labels contain meta data critical to building tax 
+software in rapid fashion.  Without this meta data, the 
+name of the form field is meaningless. It would be nice to 
+extract this information so I can combine it with other 
+data about the field (name, type, location, etc).  I 
+already know PDFBox can extract the other information about 
+the fields.  I haven't done it with PDFBox, but I did it 
+with iText.</text>
+			</item>
+			<item>
+				<date>1143812760</date>
+				<sender>benlitchfield</sender>
+				<text>Logged In: YES 
+user_id=601708
+
+More comments from users
+
+Tagged PDF will be a big thing in government because 
+federal government procurement of Acrobat publishing 
+technology falls under Section 508.  States will likely 
+follow.
+
+ see:
+www.section508.gov
+
+http://www.irs.gov/pub/irs-access/
+or
+ftp://ftp.irs.gov/pub/irs-access/
+</text>
+			</item>
+			<item>
+				<date>1142420081</date>
+				<sender>qumar</sender>
+				<text>Logged In: YES 
+user_id=1468838
+
+Hi,
+
+ i was seeing the specification of pdf and came to know the
+structure information of pdf will be in PDSEdit
+layer,PDSEdit Layer gives access to structure tree with in a
+pdf and methods methods and objects are prefixed by PDS.So
+how can we get access to PDSEdit layer of pdf.</text>
+			</item>
+			<item>
+				<date>1142243392</date>
+				<sender>qumar</sender>
+				<text>Logged In: YES 
+user_id=1468838
+
+It would be nice if pdfbox can provide the ability to
+extract information from tagged PDF.As Adobre Acrobat Reader
+provides the tags for the pdf, pdfbox should also try to get
+the tagged pdfs.
+
+for example if iwe have a pdf file with a para1 under
+header1 and para2 under header 2 and a table with rows and
+columns.something like 
+ 
+Header1 
+This is a para 1 ,it describes about a disease.  
+Header2 
+This is a para2,describes remedies of disease. 
+Table 
+A B  
+C D 
+ 
+ 
+Now the tagged pdf looks like below in adobe acrobat reader
+ 
+&lt;Heading 1&gt; 
+Header1 
+&lt;Normal&gt;  
+This is a para 1 ,it describes about a disease. 
+&lt;Heading 1&gt; 
+Header1 
+&lt;Normal&gt;  
+This is a para2,describes remedies of disease. 
+&lt;Heading 1&gt; 
+Table 
+&lt;Table&gt; 
+&lt;TBody&gt; 
+&lt;TR&gt; 
+&lt;TD&gt; 
+&lt;Normal&gt; 
+A 
+&lt;TD&gt; 
+&lt;Normal&gt; 
+B 
+&lt;TR&gt; 
+&lt;TD&gt; 
+&lt;Normal&gt; 
+C 
+&lt;TD&gt; 
+&lt;Normal&gt; 
+D 
+
+how can we extract the Heading1 ,Heading 2 and tabular data
+using pdfbox.
+
+This is a good feature which should be added to the armory
+pdfbox.
+
+Please provide this feature.
+
+ </text>
+			</item>
+		</follow_ups>
+		<change_log>
+			<item>
+				<field>assigned_to</field>
+				<old_value>100</old_value>
+				<date>1093978844</date>
+				<by>benlitchfield</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="771559">
+		<submitted_by>elfring</submitted_by>
+		<submitted_date>2003-07-15 05:23</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="525555">utilities</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="5">5</priority>
+		<status>Closed</status>
+		<resolution>None</resolution>
+		<summary>Support for PDF/X-3</summary>
+		<detail>Would you like to support the standard 
+&amp;quot;http://pdfx3.org/faq.html&amp;quot; for high quality printing?</detail>
+		<follow_ups>
+			<item>
+				<date>1107400771</date>
+				<sender>benlitchfield</sender>
+				<text>Logged In: YES 
+user_id=601708
+
+This is a nice feature, but way out of the bounds of PDFBox.  
+I would gladly take any contributions that you have.  If there 
+is a need for this, then maybe a customer could fund the 
+development.
+
+Closing this issue for now, contribute a patch if you would like 
+it to be part of PDFBox.
+
+Ben</text>
+			</item>
+			<item>
+				<date>1063626294</date>
+				<sender>elfring</sender>
+				<text>Logged In: YES 
+user_id=572001
+
+What is the result if you check your PDF files with the 
+following tools?
+- Free PDF/X Test Patch
+  http://pdfx3.org/pdfxtestpatch.html
+
+- PDF/X-3 Inspector
+  http://pdfx3.org/download.html</text>
+			</item>
+			<item>
+				<date>1063470359</date>
+				<sender>benlitchfield</sender>
+				<text>Logged In: YES 
+user_id=601708
+
+Is this offer?  I have quite a bit of work with just the PDF 
+specification, not sure if my time would be best spent 
+supporting PDF/X specification as well.</text>
+			</item>
+		</follow_ups>
+		<change_log>
+			<item>
+				<field>status_id</field>
+				<old_value>1</old_value>
+				<date>1107400771</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>close_date</field>
+				<old_value>0</old_value>
+				<date>1107400771</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>assigned_to</field>
+				<old_value>100</old_value>
+				<date>1093978846</date>
+				<by>benlitchfield</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="804658">
+		<submitted_by>benlitchfield</submitted_by>
+		<submitted_date>2003-09-11 12:18</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="100">None</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="8">8</priority>
+		<status>Closed</status>
+		<resolution>None</resolution>
+		<summary>Extract Images</summary>
+		<detail>Add the ability to extract images to various formats.</detail>
+		<follow_ups>
+			<item>
+				<date>1102337792</date>
+				<sender>mariuso</sender>
+				<text>Logged In: YES 
+user_id=109405
+
+I'm interested too in such a feature. 
+Ben, do you think is easy to modify PDFTextStripper in order 
+to aquire that ?
+</text>
+			</item>
+			<item>
+				<date>1102327962</date>
+				<sender>nobody</sender>
+				<text>Logged In: NO 
+
+I have a pdf containing text and images on the same page.
+
+I need a posibility to know the text and the place of images. 
+For example i need a output like that:
+
+Text text text
+&lt;image /&gt;
+
+text another text
+&lt;image /&gt;
+...
+text text 
+
+
+Is there any posibility to do that using pdfbox ?
+
+(I'm using TextStripper right now, but this in not giving me 
+any image info).
+
+Thanx in advance
+</text>
+			</item>
+			<item>
+				<date>1100260167</date>
+				<sender>ninki</sender>
+				<text>Logged In: YES 
+user_id=370177
+
+Ben, I did some testing and I'm getting the following problems:
+1. PDDeviceGray.createColorModel is not implemented. I need
+this to read PDFs produced by our scanner (black and white
+only).
+Exception in thread "main" java.io.IOException: Not implemented
+        at
+org.pdfbox.pdmodel.graphics.color.PDDeviceGray.createColorModel(PDDeviceGray.java:101)
+
+2. CCITTFaxDecode.decode is not implemented.
+
+3. DCTFilter.decode is not implemented
+
+4. Running ExtractImages on one of my PDFs, I got the
+following  error:
+Exception in thread "main" java.lang.ClassCastException:
+org.pdfbox.cos.COSArray
+        at
+org.pdfbox.pdmodel.graphics.image.PDXObjectImageFactory.getImage(PDXObjectImageFactory.ja
+va:38)
+        at
+org.pdfbox.pdmodel.PDResources.getImages(PDResources.java:155)
+        at
+org.pdfbox.ExtractImages.extractImages(ExtractImages.java:159)
+
+5. I had partial success on one PDF, and produced an image!
+However, the image was upside down, and I used Paint to flip
+it vertically to correct this.
+
+Finally, I will send you the 3 test PDFs I tried this on.</text>
+			</item>
+			<item>
+				<date>1100224996</date>
+				<sender>benlitchfield</sender>
+				<text>Logged In: YES 
+user_id=601708
+
+pdfbox now contains a command line app called 
+org.pdfbox.ExtractImages.</text>
+			</item>
+		</follow_ups>
+		<change_log>
+			<item>
+				<field>status_id</field>
+				<old_value>1</old_value>
+				<date>1100224996</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>close_date</field>
+				<old_value>0</old_value>
+				<date>1100224996</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>priority</field>
+				<old_value>6</old_value>
+				<date>1100023232</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>priority</field>
+				<old_value>5</old_value>
+				<date>1100023113</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>assigned_to</field>
+				<old_value>100</old_value>
+				<date>1093978844</date>
+				<by>benlitchfield</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="802407">
+		<submitted_by>winstanley_john</submitted_by>
+		<submitted_date>2003-09-08 04:42</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="525555">utilities</category>
+		<artifact_group id="293150"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="5">5</priority>
+		<status>Open</status>
+		<resolution>None</resolution>
+		<summary>PDF to HTML conversion </summary>
+		<detail>PDF to HTML conversion. 
+Conserve formating.
+
+check out www.sourceforge.net/projects/pdftohtml 
+for a hack of this process.
+
+</detail>
+		<follow_ups>
+			<item>
+				<date>1066046163</date>
+				<sender>winstanley_john</sender>
+				<text>Logged In: YES 
+user_id=747013
+
+Also conversion to xml or word etc would be amazing.
+</text>
+			</item>
+		</follow_ups>
+		<change_log>
+			<item>
+				<field>assigned_to</field>
+				<old_value>100</old_value>
+				<date>1093978845</date>
+				<by>benlitchfield</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="765686">
+		<submitted_by>bguan</submitted_by>
+		<submitted_date>2003-07-03 17:57</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="519376">text extraction</category>
+		<artifact_group id="293150"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="5">5</priority>
+		<status>Open</status>
+		<resolution>None</resolution>
+		<summary>CJK decoding</summary>
+		<detail>Another feature I need a lot is the correct interpretation 
+of CJK encoding.
+
+Yes, I know PDF can be a pain when it comes to 
+correctly interpreting CJK charsets, as many factors are 
+involved, including whether a font (or its subset) is 
+embeded or not.
+
+Attached is a simple Korean PDF that so far has not 
+been correctly interpreted by any java based 
+opensource libraries.  Though it could be rendered 
+correctly by XPDF on linux and also Windows.</detail>
+		<follow_ups>
+			<item>
+				<date>1079382000</date>
+				<sender>bguan</sender>
+				<text>Logged In: YES 
+user_id=815589
+
+Hello Ben,
+
+Thanks for the response.  I just downloaded PDFBox 0.6.5 and 
+wrote a little sample program to test it against 3 CJK PDF files 
+I have, and the output is still no good.  I have attached my 
+sample program, the 3 PDFs and the output in the attached 
+zip file.
+
+Can you tell me what I am foing wrong?
+
+The PDF files were generated by using Adobe Acrobat 5.0 
+using embeded fonts I believe.
+
+Thank you.</text>
+			</item>
+			<item>
+				<date>1079293585</date>
+				<sender>benlitchfield</sender>
+				<text>Logged In: YES 
+user_id=601708
+
+There was no attachment with this.  I have done some CJK 
+work in the 0.6.5 release.  Please attach the document and I 
+can take a look at it.(Make sure you check the 'attach file' 
+checkbox)
+
+Ben</text>
+			</item>
+		</follow_ups>
+		<existingfiles>
+			<file>
+				<id>80181</id>
+				<name>CJK.zip</name>
+				<description>CJK PDF, output and test program</description>
+				<filesize>142061</filesize>
+				<filetype></filetype>
+				<adddate>1079382000</adddate>
+				<submitted_by>815589</submitted_by>
+			</file>
+		</existingfiles>
+		<change_log>
+			<item>
+				<field>assigned_to</field>
+				<old_value>100</old_value>
+				<date>1093978846</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>File Added</field>
+				<old_value>80181: CJK.zip</old_value>
+				<date>1079382001</date>
+				<by>bguan</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="1019898">
+		<submitted_by>benlitchfield</submitted_by>
+		<submitted_date>2004-08-31 11:15</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="100">None</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="9">9</priority>
+		<status>Closed</status>
+		<resolution>None</resolution>
+		<summary>Convert FDF to XFDF and vice versa</summary>
+		<detail>title says it all</detail>
+		<follow_ups>
+			<item>
+				<date>1096855207</date>
+				<sender>benlitchfield</sender>
+				<text>Logged In: YES 
+user_id=601708
+
+Now available in CVS.
+
+Ben</text>
+			</item>
+		</follow_ups>
+		<change_log>
+			<item>
+				<field>status_id</field>
+				<old_value>1</old_value>
+				<date>1096855207</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>close_date</field>
+				<old_value>0</old_value>
+				<date>1096855207</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>priority</field>
+				<old_value>7</old_value>
+				<date>1096504885</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>priority</field>
+				<old_value>5</old_value>
+				<date>1096051369</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>assigned_to</field>
+				<old_value>100</old_value>
+				<date>1093978837</date>
+				<by>benlitchfield</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="1033054">
+		<submitted_by>benlitchfield</submitted_by>
+		<submitted_date>2004-09-22 17:42</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="100">None</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="1">1</priority>
+		<status>Open</status>
+		<resolution>None</resolution>
+		<summary>Linearize command line tool</summary>
+		<detail>PDFBox should come with a utility to convert a pdf to a 
+linearized pdf document.
+
+</detail>
+		<follow_ups>
+			<item>
+				<date>1167786164</date>
+				<sender>renaudw</sender>
+				<text>Logged In: YES 
+user_id=609291
+Originator: NO
+
+I wish to voice my support for this feature request. It would be very useful to us too. Thanks for the great work!
+</text>
+			</item>
+			<item>
+				<date>1120458956</date>
+				<sender>meshcurrent</sender>
+				<text>Logged In: YES 
+user_id=989425
+
+I am involved in a free project where we are digitizing a very 
+large number of books and turning them into PDF using open 
+source software to serve on the Net. I thought I'd post a 
+comment to say that linearization would be a really attractive 
+feature for us in PDFBox if implemented.
+
+Youssef Eldakar
+Bibliotheca Alexandrina</text>
+			</item>
+			<item>
+				<date>1107721514</date>
+				<sender>benlitchfield</sender>
+				<text>Logged In: YES 
+user_id=601708
+
+Some example pdfs (linearized and not):
+	The linearized version is created by an evaluation 
+version of
+PdfLib. Don't worry about the blank page.
+	The pdf is also beeing validated by our in-house pdf 
+expert. I've
+tryed it today.
+	Sadly, it is urgent for us to deliver a correct version 
+of the pdf
+to our customer, I think we will buy a version of 
+	PdfLib (we control
+it via a JNI Bridge)..
+	Anyhow, if you find a way of implementing the 
+linearization in
+PDFBox, I will be happy to throw away PdfLib.
+	I think a constructor like 
+org.pdfbox.pdmodel.PDDocument(COSDocument
+doc, boolean linearize) would be nice, and sorry 	I 
+don't have time to
+help you in enhancing PdfBox now. (Maybe I'll write some 
+examples of basic
+usage pattern of your 	library)
+
+
+See the following for examples.
+Linearized_c_14720040602en00010001.pdf
+Not_Linearized_c_14720040602en00010001.pdf
+
+</text>
+			</item>
+		</follow_ups>
+		<change_log>
+			<item>
+				<field>assigned_to</field>
+				<old_value>100</old_value>
+				<date>1096051448</date>
+				<by>benlitchfield</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="1035196">
+		<submitted_by>benlitchfield</submitted_by>
+		<submitted_date>2004-09-26 16:09</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="565624">PDModel</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="1">1</priority>
+		<status>Open</status>
+		<resolution>None</resolution>
+		<summary>Implement File Specification</summary>
+		<detail>org.pdfbox.pdmodel.common.filespecification has been 
+created but not implemented.
+
+See PDF Reference 1.5 section 3.10 for File Specification 
+Details.
+
+</detail>
+	</artifact>
+	<artifact id="830508">
+		<submitted_by>benlitchfield</submitted_by>
+		<submitted_date>2003-10-26 06:35</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="100">None</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="8">8</priority>
+		<status>Closed</status>
+		<resolution>None</resolution>
+		<summary>Handle new PDF 1.5 cross reference streams</summary>
+		<detail>see PDF Reference 1.5 Cross-Reference Streams</detail>
+		<follow_ups>
+			<item>
+				<date>1098037373</date>
+				<sender>benlitchfield</sender>
+				<text>Logged In: YES 
+user_id=601708
+
+supports reading them, does not support writing cross 
+reference streams.
+
+Ben</text>
+			</item>
+		</follow_ups>
+		<change_log>
+			<item>
+				<field>status_id</field>
+				<old_value>1</old_value>
+				<date>1098037373</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>close_date</field>
+				<old_value>0</old_value>
+				<date>1098037373</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>priority</field>
+				<old_value>6</old_value>
+				<date>1096504927</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>priority</field>
+				<old_value>5</old_value>
+				<date>1095168152</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>assigned_to</field>
+				<old_value>100</old_value>
+				<date>1093978843</date>
+				<by>benlitchfield</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="807179">
+		<submitted_by>benlitchfield</submitted_by>
+		<submitted_date>2003-09-16 07:57</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="565624">PDModel</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="9">9</priority>
+		<status>Closed</status>
+		<resolution>None</resolution>
+		<summary>XFDF Import/Export</summary>
+		<detail>Allow uses to export/import xfdf data.</detail>
+		<follow_ups>
+			<item>
+				<date>1096855150</date>
+				<sender>benlitchfield</sender>
+				<text>Logged In: YES 
+user_id=601708
+
+Now available in CVS.
+
+Ben</text>
+			</item>
+		</follow_ups>
+		<change_log>
+			<item>
+				<field>status_id</field>
+				<old_value>1</old_value>
+				<date>1096855150</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>close_date</field>
+				<old_value>0</old_value>
+				<date>1096855150</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>priority</field>
+				<old_value>7</old_value>
+				<date>1096504904</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>priority</field>
+				<old_value>6</old_value>
+				<date>1095168125</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>priority</field>
+				<old_value>5</old_value>
+				<date>1095168087</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>assigned_to</field>
+				<old_value>100</old_value>
+				<date>1093978844</date>
+				<by>benlitchfield</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="830505">
+		<submitted_by>benlitchfield</submitted_by>
+		<submitted_date>2003-10-26 06:34</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="100">None</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="8">8</priority>
+		<status>Closed</status>
+		<resolution>None</resolution>
+		<summary>Handle new PDF 1.5 object streams</summary>
+		<detail>See PDF Reference section 3.4.6</detail>
+		<follow_ups>
+			<item>
+				<date>1098037418</date>
+				<sender>benlitchfield</sender>
+				<text>Logged In: YES 
+user_id=601708
+
+Now supported tonights nightly build.
+
+Ben</text>
+			</item>
+			<item>
+				<date>1068326456</date>
+				<sender>adrianhumphreys</sender>
+				<text>Logged In: YES 
+user_id=904851
+
+This is essential since you can't tell a 1.5 pdf from
+earlier versions when you're doing automated parsing. 
+
+For example I attempted to parse &amp;quot;PDFReference15_v6.pdf&amp;quot; from
+http://partners.adobe.com/asn/tech/pdf/specifications.jsp.
+Adobe warns that this is &amp;quot;compressed using Acrobat 6
+compression&amp;quot;.
+
+0.6.3 PDFparser.parse() throws an IOException &amp;quot;error:
+Expected integer type, actual=&amp;quot;&amp;quot;.</text>
+			</item>
+		</follow_ups>
+		<change_log>
+			<item>
+				<field>status_id</field>
+				<old_value>1</old_value>
+				<date>1098037418</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>close_date</field>
+				<old_value>0</old_value>
+				<date>1098037418</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>priority</field>
+				<old_value>6</old_value>
+				<date>1096504915</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>priority</field>
+				<old_value>5</old_value>
+				<date>1095168172</date>
+				<by>benlitchfield</by>
+			</item>
+			<item>
+				<field>assigned_to</field>
+				<old_value>100</old_value>
+				<date>1093978843</date>
+				<by>benlitchfield</by>
+			</item>
+		</change_log>
+	</artifact>
+	<artifact id="1035635">
+		<submitted_by>benlitchfield</submitted_by>
+		<submitted_date>2004-09-27 11:35</submitted_date>
+		<artifact_type id="552835">552835</artifact_type>
+		<category id="565624">PDModel</category>
+		<artifact_group id="100"></artifact_group>
+		<assigned_to>Ben Litchfield</assigned_to>
+		<priority id="5">5</priority>
+		<status>Open</status>

[... 4020 lines stripped ...]