You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by Andreas Lehmkuehler <an...@lehmi.de> on 2012/03/06 19:43:23 UTC

Re: PDFBox Project for GSoC 2012

Hi,

Am 29.02.2012 03:50, schrieb Tharaka Nayanajith Wijebandara:
> Hi,
>
>
> I'm university student in Sri Lanka and a newbie to Open Source
> Development. I would like to participate for Google Summer of Code 2012
> with an Apache Project. Since I'm familiar with Java and I have used PDFBox
> Library for my academic project, I like to develop new feature for PDFBox
> as my GSoC project. First of all I want to know that is it possible to
> participate for GSoC 2012 with PDFBox project?
>
>
> If it is yes, I want help from PDFBox development community to select
> appropriate PDFBox task for GSoC.
There is a lot to do and I'm sure that some of those jobs should qualify as GSoC 
task.

> If you have any idea about good project or advice for me, please reply to
> this.
You will need a mentor and I'm not sure if you will find one among our ranks. 
I'd like to support you, but I can't do it due to personal reasons.

Anybody else?


BR
Andreas Lehmkühler

Re: PDFBox Project for GSoC 2012

Posted by Jeremias Maerki <de...@jeremias-maerki.ch>.
Hey guys

On 18.03.2012 03:16:14 Tharaka Nayanajith Wijebandara wrote:
> Hi,
> 
> 
> Thanks mehdi.
> 
> 
> I have two ideas for a GSoC task, but need all of your help to select
> suitable one.
> 
> 
>    - One project is HTML to PDF and vise versa converter. This feature can
>    be found in JIRA also (https://issues.apache.org/jira/browse/PDFBOX-6,
>    https://issues.apache.org/jira/browse/PDFBOX-9)

HTML to PDF sounds like it requires a full layout engine which is a big
undertaking. Using (parts of) Apache FOP as a base would seem to me to
be the better base than PDFBox because of its infrastructure to generate
many output formats, not just PDF. Please note that there are already
tools (like Flying Saucer) that do that although they have the "wrong"
license. :-) Anyway, having a good HTML/CSS engine @Apache would be a
killer. But it's something that clearly goes beyong a GSoC project.

As an alternative to that, some of you may remember recent discussions
about the desire for an API to create simple layouts with PDFBox. I
think that was coming from the XDocReport and Apache ODF Toolkit corners
(search for "Angelo Zerr").

PDF to HTML is surely a great use case for PDFBox. One thing that could
be very interesting in this context would be to use the structure tree 
(tagged PDF) if it is available to improve the HTML output. Pure text
extraction might also profit from that.

> 
>    - Other one is enhancing features of PDF reader and zooming features,
>    page display features, bookmark navigator, page thumbnail viewer can be
>    very much useful. Since I have previous experience in awt, swing and
>    java2d, it will be easy for me.

Improving the PDF Viewer would be soooo cool! I'm still dreaming of that
Adobe Acrobat Professional analog using PDFBox, i.e. an well-designed
GUI-application base that can easily be extended with plug-ins for more
than just PDF viewing: integrated PDF Debugger (which I use all the time),
type writer feature, object inspection by point and click, page rotation,
insert/remove/move pages, extract images, image to PDF etc. etc. After
all, PDFBox already has so many of the features required but they are
mostly accessible only to developers or from the command-line. If only I
had time to do it, I would already have started with it.

> 
> There might be several other tasks which are important than this. So all of
> you are welcome, to reply with good ideas.
> 
> On Sat, Mar 17, 2012 at 5:01 PM, mehdi houshmand <me...@gmail.com> wrote:
> 
> > Hi Tharaka,
> >
> > Have you had any more thoughts on a project you'd like to undertake?
> > Have you applied and been through all the admin needed to be accepted
> > into GSoC 2012? Let me know if you need any help.
> >
> > Mehdi
> >
> > On 9 March 2012 06:25, Andreas Lehmkuehler <an...@lehmi.de> wrote:
> > > Hi,
> > >
> > > Am 07.03.2012 07:40, schrieb mehdi houshmand:
> > >>
> > >> Hi Andreas,
> > >>
> > >> Sorry, maybe I wasn't clear, I am an ASF committer... Just not to
> > >> PDFBox.. . I do have domain expertise being a full-time FOP developer
> > >> and having dealt with PDFs and fonts quite a bit. Should I pop an
> > >> email to dev-community to see if it's ok? It seems like such a waste
> > >> to have an interested applicant but no mentor...
> > >
> > > I'm not an GSoC expert but that sounds good to me. You may double check
> > with
> > > the dev-community, but IMHO it's not necessary.
> > > I'm glad that you volunteer to help us, thanks in advance. I'll try to
> > help
> > > as much as I can.
> > >
> > >
> > > BR
> > > Andreas Lehmkühler
> > >
> > >
> > >> Mehdi
> > >>
> > >> On 6 March 2012 21:32, Andreas Lehmkuehler<an...@lehmi.de>  wrote:
> > >>>
> > >>> Hi,
> > >>>
> > >>>
> > >>> Am 06.03.2012 21:24, schrieb mehdi houshmand:
> > >>>
> > >>>> Hi Andreas,
> > >>>>
> > >>>> Does the mentor need to be a PDFBox committer? If not, I wouldn't mind
> > >>>> putting myself forward as a candidate... Of course, that is if no one
> > >>>> else does.
> > >>>
> > >>>
> > >>> Thanks for the offer, but AFAIKT it's not possible. According to [1]
> > the
> > >>> mentor has to be an ASF member or committer.
> > >>>
> > >>>
> > >>>> Mehdi
> > >>>>
> > >>>> On 6 March 2012 18:43, Andreas Lehmkuehler<an...@lehmi.de>
> >  wrote:
> > >>>>>
> > >>>>>
> > >>>>> Hi,
> > >>>>>
> > >>>>> Am 29.02.2012 03:50, schrieb Tharaka Nayanajith Wijebandara:
> > >>>>>
> > >>>>>> Hi,
> > >>>>>>
> > >>>>>>
> > >>>>>> I'm university student in Sri Lanka and a newbie to Open Source
> > >>>>>> Development. I would like to participate for Google Summer of Code
> > >>>>>> 2012
> > >>>>>> with an Apache Project. Since I'm familiar with Java and I have used
> > >>>>>> PDFBox
> > >>>>>> Library for my academic project, I like to develop new feature for
> > >>>>>> PDFBox
> > >>>>>> as my GSoC project. First of all I want to know that is it possible
> > to
> > >>>>>> participate for GSoC 2012 with PDFBox project?
> > >>>>>>
> > >>>>>>
> > >>>>>> If it is yes, I want help from PDFBox development community to
> > select
> > >>>>>> appropriate PDFBox task for GSoC.
> > >>>>>
> > >>>>>
> > >>>>>
> > >>>>> There is a lot to do and I'm sure that some of those jobs should
> > >>>>> qualify
> > >>>>> as
> > >>>>> GSoC task.
> > >>>>>
> > >>>>>
> > >>>>>> If you have any idea about good project or advice for me, please
> > reply
> > >>>>>> to
> > >>>>>> this.
> > >>>>>
> > >>>>>
> > >>>>>
> > >>>>> You will need a mentor and I'm not sure if you will find one among
> > our
> > >>>>> ranks. I'd like to support you, but I can't do it due to personal
> > >>>>> reasons.
> > >>>>>
> > >>>>> Anybody else?
> > >>>>>
> > >>>>>
> > >>>>> BR
> > >>>>> Andreas Lehmkühler
> > >>>
> > >>>
> > >>>
> > >>> BR
> > >>> Andreas Lehmkühler
> > >>>
> > >>> [1] http://community.apache.org/guide-to-being-a-mentor.html
> > >
> > >
> >
> 
> 
> 
> -- 
> Thanks & Regards,
> Tharaka Wijebandara,
> Faculty of Information Technology,
> University of Moratuwa.




Jeremias Maerki


Re: PDFBox Project for GSoC 2012

Posted by Maruan Sahyoun <sa...@fileaffairs.de>.
Hi,

suggestions:

1) Mapping PDF features to version and standards e.g. layers in PDF are a PDF 1.5 feature, AES encryption for PDF 1.6. 
2) PDF Writer to support versions and standards
3) OpenType support
4) Performance improvements (e.g. with some applications we developed iText is approx. 30% faster merging PDFs than PDFBox)
5) Widgets for PDF generation (Text, Tables …) as there seems to be some demand in using PDFBox for generation of PDFs from scratch although I think that one could use e.g FOP for that purpose.
6) Documentation. 

With kind regards

Maruan Sahyoun


Am 19.03.2012 um 07:45 schrieb Andreas Lehmkuehler:

> Hi,
> 
> Am 18.03.2012 03:16, schrieb Tharaka Nayanajith Wijebandara:
>> Hi,
>> 
>> 
>> Thanks mehdi.
>> 
>> 
>> I have two ideas for a GSoC task, but need all of your help to select
>> suitable one.
>> 
>> 
>>    - One project is HTML to PDF and vise versa converter. This feature can
>>    be found in JIRA also (https://issues.apache.org/jira/browse/PDFBOX-6,
>>    https://issues.apache.org/jira/browse/PDFBOX-9)
> Good idea, but complicated, as some of the feature you would need aren't yet implemented.
> 
>>    - Other one is enhancing features of PDF reader and zooming features,
>>    page display features, bookmark navigator, page thumbnail viewer can be
>>    very much useful. Since I have previous experience in awt, swing and
>>    java2d, it will be easy for me.
> I like this idea. It would be a nice feature.
> 
>> There might be several other tasks which are important than this. So all of
>> you are welcome, to reply with good ideas.
> Yes there are a lot things to do, probably someone else might come up with a wish?
> 
>> On Sat, Mar 17, 2012 at 5:01 PM, mehdi houshmand<me...@gmail.com>  wrote:
>> 
>>> Hi Tharaka,
>>> 
>>> Have you had any more thoughts on a project you'd like to undertake?
>>> Have you applied and been through all the admin needed to be accepted
>>> into GSoC 2012? Let me know if you need any help.
>>> 
>>> Mehdi
>>> 
>>> On 9 March 2012 06:25, Andreas Lehmkuehler<an...@lehmi.de>  wrote:
>>>> Hi,
>>>> 
>>>> Am 07.03.2012 07:40, schrieb mehdi houshmand:
>>>>> 
>>>>> Hi Andreas,
>>>>> 
>>>>> Sorry, maybe I wasn't clear, I am an ASF committer... Just not to
>>>>> PDFBox.. . I do have domain expertise being a full-time FOP developer
>>>>> and having dealt with PDFs and fonts quite a bit. Should I pop an
>>>>> email to dev-community to see if it's ok? It seems like such a waste
>>>>> to have an interested applicant but no mentor...
>>>> 
>>>> I'm not an GSoC expert but that sounds good to me. You may double check
>>> with
>>>> the dev-community, but IMHO it's not necessary.
>>>> I'm glad that you volunteer to help us, thanks in advance. I'll try to
>>> help
>>>> as much as I can.
>>>> 
> <SNIP>
> 
> BR
> Andreas Lehmkühler


Re: PDFBox Project for GSoC 2012

Posted by mehdi houshmand <me...@gmail.com>.
Hi,

If you're looking for more ideas, I've got a couple suggestions concerning
PDF post-processing which are available with other PDF libraries:

1) PDF optimizations - This could be PDF linearization, image-resolution
reduction, font subsetting (when fonts are fully embedded) etc... There are
quite a few things that could be done here, we could look at what Adobe
Acrobat Pro does as a post-process action for inspiration on projects.

2) Font de-duplication -  This isn't indifferent to the suggestion above,
but is probably a project on it's own. This has caught out a couple people
(myself included) of late and could be a very technically interesting
project. Basically when PDFs are merged/overlaid all resources are blindly
copied, if we could find some way of re-subsetting fonts to reduce
duplication it would significantly reduce the overhead of RIP-ing merged
PDFs. (FYI, Adobe Acrobat Pro does this very badly, it's not a simple
project, but I think we can do it better with a little diligence).

I can't think of anything else, if anyone else has any thoughts, please do
come forward with project proposals.

Mehdi

On 19 March 2012 06:45, Andreas Lehmkuehler <an...@lehmi.de> wrote:

> Hi,
>
> Am 18.03.2012 03:16, schrieb Tharaka Nayanajith Wijebandara:
>
>> Hi,
>>
>>
>> Thanks mehdi.
>>
>>
>> I have two ideas for a GSoC task, but need all of your help to select
>> suitable one.
>>
>>
>>    - One project is HTML to PDF and vise versa converter. This feature can
>>    be found in JIRA also (https://issues.apache.org/**
>> jira/browse/PDFBOX-6 <https://issues.apache.org/jira/browse/PDFBOX-6>,
>>    https://issues.apache.org/**jira/browse/PDFBOX-9<https://issues.apache.org/jira/browse/PDFBOX-9>
>> )
>>
> Good idea, but complicated, as some of the feature you would need aren't
> yet implemented.
>
>
>     - Other one is enhancing features of PDF reader and zooming features,
>>    page display features, bookmark navigator, page thumbnail viewer can be
>>    very much useful. Since I have previous experience in awt, swing and
>>    java2d, it will be easy for me.
>>
> I like this idea. It would be a nice feature.
>
>
>  There might be several other tasks which are important than this. So all
>> of
>> you are welcome, to reply with good ideas.
>>
> Yes there are a lot things to do, probably someone else might come up with
> a wish?
>
>
>  On Sat, Mar 17, 2012 at 5:01 PM, mehdi houshmand<me...@gmail.com>
>>  wrote:
>>
>>  Hi Tharaka,
>>>
>>> Have you had any more thoughts on a project you'd like to undertake?
>>> Have you applied and been through all the admin needed to be accepted
>>> into GSoC 2012? Let me know if you need any help.
>>>
>>> Mehdi
>>>
>>> On 9 March 2012 06:25, Andreas Lehmkuehler<an...@lehmi.de>  wrote:
>>>
>>>> Hi,
>>>>
>>>> Am 07.03.2012 07:40, schrieb mehdi houshmand:
>>>>
>>>>>
>>>>> Hi Andreas,
>>>>>
>>>>> Sorry, maybe I wasn't clear, I am an ASF committer... Just not to
>>>>> PDFBox.. . I do have domain expertise being a full-time FOP developer
>>>>> and having dealt with PDFs and fonts quite a bit. Should I pop an
>>>>> email to dev-community to see if it's ok? It seems like such a waste
>>>>> to have an interested applicant but no mentor...
>>>>>
>>>>
>>>> I'm not an GSoC expert but that sounds good to me. You may double check
>>>>
>>> with
>>>
>>>> the dev-community, but IMHO it's not necessary.
>>>> I'm glad that you volunteer to help us, thanks in advance. I'll try to
>>>>
>>> help
>>>
>>>> as much as I can.
>>>>
>>>>  <SNIP>
>
> BR
> Andreas Lehmkühler
>

Re: PDFBox Project for GSoC 2012

Posted by mehdi houshmand <me...@gmail.com>.
Hi Tharaka,

I've just been trawling through my emails and there's been some confusion
here. The google-melange deadline is 20th April for project submissions,
however, because the ASF is a large and single organization it gets an
allocation of GSoC places. The ASF then needs to go through all the project
proposals and their respective ranking reviews to decide which projects get
allocated places.

As such the deadline for ranking the projects was the 16th, I'll send you
the email explaining as such.

Mehdi

On 17 April 2012 21:07, mehdi houshmand <me...@gmail.com> wrote:

> Hi Tharaka,
>
> Really? I could have sworn there was a 16th April deadline! I haven't had
> an email with your request (
> https://issues.apache.org/jira/secure/IssueNavigator!executeAdvanced.jspa),
> the dev-community gets spammed with everyone's projects proposals, and
> there have been a lot. I just found yours at (for anyone else looking for
> it):
>
> http://www.google-melange.com/gsoc/proposal/review/google/gsoc2012/tharaka/1
>
> The ASF protocol is to create an issue on JIRA (link above) and then do
> one on google-melange, it allows a link for searching and keeping track of
> things. I think there are other reasons, but I forget them, anywho, my
> sincerest apologies, I'll review it in the morning.
>
> Mehdi
>
>
> On 17 April 2012 17:01, Tharaka Nayanajith Wijebandara <
> tharaka.nw@gmail.com> wrote:
>
>> Hi Mehdi,
>>
>>
>> I can't understand what actually happened. I submitted my proposal before
>> the deadline (April 6:19:00 UTC) and still it's available in
>> google-melange
>> site. Also according to timeline mentors are supposed to review the
>> proposal before the 20th April. I may be wrong, but don't have any idea
>> about why you said that we have missed the deadline. Anyway I'm still
>> ready
>> to continue with this project.
>>
>> On Tue, Apr 17, 2012 at 4:31 PM, Dexter Mishra <dexter.mishra@gmail.com
>> >wrote:
>>
>> > Well Still who ever wants want to work can work on that. It will be
>> their
>> > credit. GSc is not the only way.
>> >
>> > On Tue, Apr 17, 2012 at 12:24 PM, mehdi houshmand <me...@gmail.com>
>> > wrote:
>> >
>> > > Ok, well, disappointingly we've missed the deadline to submit a
>> proposal
>> > > and review said proposal. Thanks for all the support from everyone,
>> sorry
>> > > we couldn't arrange it this year.
>> > >
>> > > Mehdi
>> > >
>> > > On 10 April 2012 08:22, mehdi houshmand <me...@gmail.com> wrote:
>> > >
>> > > > Hi Tharaka,
>> > > >
>> > > > Sorry for the slow response, my bad, I don't have any worthwhile
>> excuse
>> > > > other than it's holiday season over here. As for the proposal, thats
>> > > fine,
>> > > > I'd be happy with that proposal. If no one else has any comments on
>> > this,
>> > > > I'd suggest moving writing this up on google-melange and getting the
>> > ball
>> > > > rolling.
>> > > >
>> > > > Keep up the good work,
>> > > >
>> > > > Mehdi
>> > > >
>> > > >
>> > > > On 5 April 2012 08:45, Tharaka Nayanajith Wijebandara <
>> > > > tharaka.nw@gmail.com> wrote:
>> > > >
>> > > >> Hi,
>> > > >>
>> > > >> Thank you very much for your feedbacks and suggestions.
>> > > >>
>> > > >> After going through all of your feedbacks, I modified the proposal
>> > with
>> > > >> them and used some my own ideas also. Additionally, I included the
>> > > >> proposed
>> > > >> time line of the project and I would like to see your comments
>> here.
>> > > >>
>> > > >>
>> > > >> *Project Title*
>> > > >>
>> > > >> Implement PDF object inspector and enhance features of PDFReader
>> GUI.
>> > > >>
>> > > >>
>> > > >> *Abstract*
>> > > >>
>> > > >> PDFReader is small GUI application which comes with Apache PDFBox
>> pure
>> > > >> java
>> > > >> PDF library. Currently it is only capable of rendering PDFs. This
>> > > project
>> > > >> is to redesign PDFReader GUI with a plugin framework and implement
>> > some
>> > > >> developer oriented features for it. Primary objectives of project
>> are
>> > as
>> > > >> follows.
>> > > >>
>> > > >> (1). Design a plugin framework for the PDFReader.
>> > > >>
>> > > >> (2). Implement PDF basic viewing features. (Zooming, rotating, page
>> > > >> thumbnail viewer)
>> > > >>
>> > > >> (3). Implement a PDF object inspector to display PDF object
>> > properties.
>> > > >>
>> > > >> (4). Implement Export command for converting PDFs into different
>> file
>> > > >> formats (JPEG, PNG, TXT).
>> > > >>
>> > > >>
>> > > >>
>> > > >> *Project Details*
>> > > >>
>> > > >> *
>> > > >> *
>> > > >>
>> > > >> PDFBox already has so many features required, but currently most of
>> > them
>> > > >> only available for developers and from command line. Aim of this
>> > project
>> > > >> is
>> > > >> extending PDFReader into well designed application which allows
>> access
>> > > to
>> > > >> some PDFBox features through GUI and integrating some developer
>> > oriented
>> > > >> features. However, project itself is not going to consider GUI
>> access
>> > to
>> > > >> all available PDFBox features. Instead of that it re-architects
>> > > PDFReader
>> > > >> into pluggable application which developer can easily integrate
>> other
>> > > >> features by developing plugins.
>> > > >>
>> > > >>
>> > > >> In that case, most important primary objective of the project is
>> > > designing
>> > > >> a simple plugin framework for the PDFReader. In the
>> implementation, we
>> > > can
>> > > >> use JAR Service Provider mechanism as the base of the plugin
>> > framework.
>> > > >> For
>> > > >> that first, it's required to identify what are pluggable classes
>> which
>> > > is
>> > > >> going to use in the application and define interfaces for those
>> > classes
>> > > >> with the basic methods. Then we can implement some mechanism for
>> > plugin
>> > > >> developer to define what are the classes to use, and where (which
>> > > >> JAR/directory) to load them from. When application loads, it will
>> > check
>> > > >> specific locations for plugins and create instance of those plugins
>> > > using
>> > > >> class loader. Here after application can use those plugin
>> instances as
>> > > >> components of application itself.
>> > > >>
>> > > >>
>> > > >> Rotating and zooming functions are basic features which is
>> available
>> > in
>> > > >> every PDF viewer and it allow user to view PDF pages from different
>> > > >> perspectives. However current PDFReader is unable to provide those
>> > > >> features. This project will address to this problem as part of
>> second
>> > > >> primary objective by implementing some menu commands for those
>> > features.
>> > > >> In
>> > > >> the implementation, it will be matter of changing some codes in
>> > > PageDrawer
>> > > >> and other related class which are responsible to render PDF pages
>> > using
>> > > >> Java2D.
>> > > >>
>> > > >>
>> > > >> Other interesting feature which is going to archive as another
>> part of
>> > > >> second objective is developing page thumbnail viewer for the
>> > PDFReader.
>> > > >> This will be cool GUI component for user to navigate through the
>> PDF
>> > > >> Document very easily. It will be image list which has small image
>> of
>> > the
>> > > >> every PDF page of the current document with page numbers. User can
>> > > simply
>> > > >> click on the image to jump the particular page. This component
>> will be
>> > > >> developed as plugin for PDFReader and user will be able to show and
>> > hide
>> > > >> using the menu command.
>> > > >>
>> > > >>
>> > > >> Third objective is implementing an object inspector for PDFReader
>> and
>> > it
>> > > >> will be very much useful to developer who wants to debug PDF
>> > documents.
>> > > >> User can enable click one the any object on the PDF viewer and then
>> > in a
>> > > >> small window, it will show the information about particular object
>> > while
>> > > >> highlighting the object inside the viewer. As an example if user
>> click
>> > > on
>> > > >> image of the PDF it will show the information such as image size,
>> > color
>> > > >> space, resolution and bit depth. If object is text, it will show
>> the
>> > > >> details about font, color space and color values. In implementation
>> > this
>> > > >> also will be developed as another plugin for the PDFReader.
>> > > >>
>> > > >>
>> > > >> Export feature is last primary objective which I'm going to
>> implement
>> > > >> within this project and it also will be developed as several
>> plugins.
>> > As
>> > > >> most of the available software, it's good idea to add 'Export' menu
>> > item
>> > > >> into file menu. This menu item will have submenu which contains
>> > > different
>> > > >> commands for different file formats (JPEG, PNG, TXT) and each of
>> those
>> > > >> commands will popup small dialog box which allows user to change
>> > export
>> > > >> settings such as page range, password, encoding type, file name and
>> > > >> location. After change those settings, user can simply click 'OK'
>> > button
>> > > >> of
>> > > >> the settings dialog box to start the converting process. Every
>> command
>> > > for
>> > > >> particular file format will be developed as a plugin and in future
>> > > >> developers can easily integrate commands for new file formats to
>> > convert
>> > > >> PDFs.
>> > > >>
>> > > >>
>> > > >> In addition to these primary objectives there are few secondary
>> > > objectives
>> > > >> which I'm going to consider with this project and major one will be
>> > the
>> > > >> 'Add Text' feature. It will allow user to draw textbox in any
>> place of
>> > > the
>> > > >> PDF page and type new text. There will be Small window or view will
>> > > appear
>> > > >> in this mode and user can change the font size, color and other
>> > related
>> > > >> properties. These new text objects will added to PDF page by
>> inserting
>> > > new
>> > > >> optional content group to the page and in case user wants to
>> remove or
>> > > >> edit
>> > > >> this newly added text, it's possible before saving PDF permanently.
>> > > >>
>> > > >>
>> > > >>
>> > > >> Add 'print' menu item, develop plugin for mange (add, edit, delete)
>> > > >> bookmarks and View for the basic PDF properties will be the set of
>> > > another
>> > > >> secondary objectives which I'm going to look at within this
>> project.
>> > > Since
>> > > >> most of them are already available in command line, it will be
>> easy to
>> > > add
>> > > >> those features as plugins for GUI if there will be enough time
>> after
>> > > >> achieving primary objectives.
>> > > >>
>> > > >>
>> > > >>
>> > > >> *Benefits to PDFBox*
>> > > >>
>> > > >> Most of the modern code libraries are delivered with GUI
>> application
>> > to
>> > > >> test the features before studying API. However, features of PDFBox
>> are
>> > > >> mostly accessible through the command line interface and it
>> > discourages
>> > > >> users and new developers from using PDFBox. By having GUI for
>> access
>> > > these
>> > > >> features any user can simply use them and developer can test the
>> > > features
>> > > >> without going through the code. Since designing plugin framework
>> is a
>> > > >> primary objective of the project, developers will be able to easily
>> > > append
>> > > >> new features for GUI in future.
>> > > >>
>> > > >> PDF object inspector will be great tool for developer and it will
>> make
>> > > >> easy
>> > > >> debugging PDFs. Furthermore, it will help to new developers to
>> study
>> > the
>> > > >> structure of PDF file and understanding PDFBox core
>> functionalities.
>> > > >>
>> > > >>
>> > > >>
>> > > >> *Project Timeline and Deliverables*
>> > > >>
>> > > >> *May 21 - May 28*
>> > > >>
>> > > >> ·   Read and understand the current PDFReader Code
>> > > >>
>> > > >> ·   Create basic design for plugin framework
>> > > >>
>> > > >> *May 28 - Jun 08*
>> > > >>
>> > > >>                Discuss plugin framework design with community
>> > > >>
>> > > >>  *Jun 08 - Jun 18*
>> > > >>
>> > > >>                Implement plugin framework
>> > > >>
>> > > >>  *Jun 18 - Jun 25*
>> > > >>
>> > > >>                Code some basic examples and test cases for plugin
>> > > >> framework
>> > > >>
>> > > >>  *Jun 25 - Jul 02*
>> > > >>
>> > > >>                Implement basic zooming and rotating features
>> > > >>
>> > > >>  *Jul 02 - Jul 09*
>> > > >>
>> > > >>                Implement page thumbnail viewer
>> > > >>
>> > > >>  *Jul 09 - Jul 11*
>> > > >>
>> > > >>                Study and Create design for Object Inspector
>> > > >>
>> > > >>  *Jul 12 - Jul 13*
>> > > >>
>> > > >>                Submit mid-term evaluation
>> > > >>
>> > > >>  *Jul 13 - Jul 23*
>> > > >>
>> > > >>                Implement PDF Object Inspector
>> > > >>
>> > > >>  *Jul 23 - Jul 30*
>> > > >>
>> > > >>                Implement export PDF feature
>> > > >>
>> > > >>  *Jul 30 - Aug 06*
>> > > >>
>> > > >>                Implement some secondary objectives
>> > > >>
>> > > >>  *Aug 06 - Aug 13*
>> > > >>
>> > > >>                Test, debug the code and complete documentation
>> > > >>
>> > > >>  *Aug 13 - Aug 20*
>> > > >>
>> > > >>                Submit the design and conclusion
>> > > >>
>> > > >>
>> > > >>
>> > > >> *Exams and other commitments*
>> > > >>
>> > > >> I have semester end examination of University from 28th May to 8th
>> > June.
>> > > >> After that I can work fulltime (40+ hours per week) on this project
>> > > until
>> > > >> end.
>> > > >>
>> > > >>
>> > > >>
>> > > >>
>> > > >> On Wed, Apr 4, 2012 at 12:44 PM, mehdi houshmand <
>> med1985@gmail.com>
>> > > >> wrote:
>> > > >>
>> > > >> > >
>> > > >> > >
>> > > >> > > <snip>
>> > > >> > > How about just implementing the "add text" feature? That
>> should be
>> > > >> > > relatively easy. Add a new optional content group to the page
>> and
>> > > add
>> > > >> > > all text as part of that. That makes this a cool tool to fill
>> out
>> > > >> forms
>> > > >> > > that were scanned, or that are not AcroForms. I fully agree
>> that
>> > the
>> > > >> > > replace feature is going to be tough, relatively speaking.
>> > > >> > >
>> > > >> >
>> > > >> > Yeah, my bad here, I shouldn't have missed it but I'm inclined to
>> > > agree
>> > > >> > with Andreas. If we implemented deleting text, it could get
>> pretty
>> > > >> tricksy,
>> > > >> > especially when CID coded fonts are used. This is a perfect
>> > candidate
>> > > >> of a
>> > > >> > secondary objective.
>> > > >> >
>> > > >> >
>> > > >> > > I wouldn't put the bar too high for this project. Get some
>> > > low-hanging
>> > > >> > > fruit that adds some cool and useful functionality to the GUI.
>> > More
>> > > >> can
>> > > >> > > always be added later.
>> > > >> > >
>> > > >> > > More ideas for low-hanging fruit (as additional tasks if you
>> end
>> > up
>> > > >> with
>> > > >> > > spare time after the main objectives above):
>> > > >> > > - Allow the "Save As..." menu item not only save a modified
>> PDF,
>> > but
>> > > >> > > also bitmaps (PNG, JPEG), plain text and HTML (i.e. from text
>> > > >> > extraction).
>> > > >> > > That would make a plug-in for each output format, so we could
>> add
>> > > >> others
>> > > >> > > later.
>> > > >> > > - Add a "Print..." menu item.
>> > > >> > > - Integrate preflight as a plug-in into the application.
>> > > >> > > - Split, concatenate, page reorder, page rotation (not just
>> > viewing
>> > > >> but
>> > > >> > > modifying the PDF), overlay/underlay.
>> > > >> > > - Menu item: "Create PDF from image(s)" and "Add page from
>> image".
>> > > >> > > - A dialog/view listing the fonts in the PDF.
>> > > >> > > - A dialog/view for the basic PDF properties like in Acrobat's
>> > > >> > > Properties Dialog.
>> > > >> > > - A dialog/view listing all images with their properties and
>> > > effective
>> > > >> > > resolution in the PDF.
>> > > >> > > - I could go on and on... :-) I'm seeing a serious itch going
>> to
>> > > catch
>> > > >> > > me once you've set up a good base for that GUI. I'm so looking
>> > > forward
>> > > >> > > to this!
>> > > >> > >
>> > > >> > > Anyway, points 1 and 2 will be the most important tasks because
>> > they
>> > > >> > > provide the basis for easily adding additional functionality.
>> > > >> > >
>> > > >> >
>> > > >> > Yeah, those look like good tasks, maybe we'll add the two you
>> > suggest
>> > > to
>> > > >> > the primary objectives and the rest into secondary or even
>> tertiary
>> > > >> > objectives. It's good to see you getting geared up for this
>> > Jeremias,
>> > > >> we're
>> > > >> > no doubt going to call on your expertise ;-).
>> > > >> >
>> > > >> > Thanks for the support
>> > > >> >
>> > > >> > Mehdi
>> > > >> >
>> > > >>
>> > > >>
>> > > >>
>> > > >> --
>> > > >> Thanks & Regards,
>> > > >> Tharaka Wijebandara,
>> > > >> Faculty of Information Technology,
>> > > >> University of Moratuwa.
>> > > >>
>> > > >
>> > > >
>> > >
>> >
>>
>>
>>
>> --
>> Thanks & Regards,
>> Tharaka Wijebandara,
>> Faculty of Information Technology,
>> University of Moratuwa.
>>
>
>

Re: PDFBox Project for GSoC 2012

Posted by mehdi houshmand <me...@gmail.com>.
Hi Tharaka,

Really? I could have sworn there was a 16th April deadline! I haven't had
an email with your request (
https://issues.apache.org/jira/secure/IssueNavigator!executeAdvanced.jspa),
the dev-community gets spammed with everyone's projects proposals, and
there have been a lot. I just found yours at (for anyone else looking for
it):
http://www.google-melange.com/gsoc/proposal/review/google/gsoc2012/tharaka/1

The ASF protocol is to create an issue on JIRA (link above) and then do one
on google-melange, it allows a link for searching and keeping track of
things. I think there are other reasons, but I forget them, anywho, my
sincerest apologies, I'll review it in the morning.

Mehdi

On 17 April 2012 17:01, Tharaka Nayanajith Wijebandara <tharaka.nw@gmail.com
> wrote:

> Hi Mehdi,
>
>
> I can't understand what actually happened. I submitted my proposal before
> the deadline (April 6:19:00 UTC) and still it's available in google-melange
> site. Also according to timeline mentors are supposed to review the
> proposal before the 20th April. I may be wrong, but don't have any idea
> about why you said that we have missed the deadline. Anyway I'm still ready
> to continue with this project.
>
> On Tue, Apr 17, 2012 at 4:31 PM, Dexter Mishra <dexter.mishra@gmail.com
> >wrote:
>
> > Well Still who ever wants want to work can work on that. It will be their
> > credit. GSc is not the only way.
> >
> > On Tue, Apr 17, 2012 at 12:24 PM, mehdi houshmand <me...@gmail.com>
> > wrote:
> >
> > > Ok, well, disappointingly we've missed the deadline to submit a
> proposal
> > > and review said proposal. Thanks for all the support from everyone,
> sorry
> > > we couldn't arrange it this year.
> > >
> > > Mehdi
> > >
> > > On 10 April 2012 08:22, mehdi houshmand <me...@gmail.com> wrote:
> > >
> > > > Hi Tharaka,
> > > >
> > > > Sorry for the slow response, my bad, I don't have any worthwhile
> excuse
> > > > other than it's holiday season over here. As for the proposal, thats
> > > fine,
> > > > I'd be happy with that proposal. If no one else has any comments on
> > this,
> > > > I'd suggest moving writing this up on google-melange and getting the
> > ball
> > > > rolling.
> > > >
> > > > Keep up the good work,
> > > >
> > > > Mehdi
> > > >
> > > >
> > > > On 5 April 2012 08:45, Tharaka Nayanajith Wijebandara <
> > > > tharaka.nw@gmail.com> wrote:
> > > >
> > > >> Hi,
> > > >>
> > > >> Thank you very much for your feedbacks and suggestions.
> > > >>
> > > >> After going through all of your feedbacks, I modified the proposal
> > with
> > > >> them and used some my own ideas also. Additionally, I included the
> > > >> proposed
> > > >> time line of the project and I would like to see your comments here.
> > > >>
> > > >>
> > > >> *Project Title*
> > > >>
> > > >> Implement PDF object inspector and enhance features of PDFReader
> GUI.
> > > >>
> > > >>
> > > >> *Abstract*
> > > >>
> > > >> PDFReader is small GUI application which comes with Apache PDFBox
> pure
> > > >> java
> > > >> PDF library. Currently it is only capable of rendering PDFs. This
> > > project
> > > >> is to redesign PDFReader GUI with a plugin framework and implement
> > some
> > > >> developer oriented features for it. Primary objectives of project
> are
> > as
> > > >> follows.
> > > >>
> > > >> (1). Design a plugin framework for the PDFReader.
> > > >>
> > > >> (2). Implement PDF basic viewing features. (Zooming, rotating, page
> > > >> thumbnail viewer)
> > > >>
> > > >> (3). Implement a PDF object inspector to display PDF object
> > properties.
> > > >>
> > > >> (4). Implement Export command for converting PDFs into different
> file
> > > >> formats (JPEG, PNG, TXT).
> > > >>
> > > >>
> > > >>
> > > >> *Project Details*
> > > >>
> > > >> *
> > > >> *
> > > >>
> > > >> PDFBox already has so many features required, but currently most of
> > them
> > > >> only available for developers and from command line. Aim of this
> > project
> > > >> is
> > > >> extending PDFReader into well designed application which allows
> access
> > > to
> > > >> some PDFBox features through GUI and integrating some developer
> > oriented
> > > >> features. However, project itself is not going to consider GUI
> access
> > to
> > > >> all available PDFBox features. Instead of that it re-architects
> > > PDFReader
> > > >> into pluggable application which developer can easily integrate
> other
> > > >> features by developing plugins.
> > > >>
> > > >>
> > > >> In that case, most important primary objective of the project is
> > > designing
> > > >> a simple plugin framework for the PDFReader. In the implementation,
> we
> > > can
> > > >> use JAR Service Provider mechanism as the base of the plugin
> > framework.
> > > >> For
> > > >> that first, it's required to identify what are pluggable classes
> which
> > > is
> > > >> going to use in the application and define interfaces for those
> > classes
> > > >> with the basic methods. Then we can implement some mechanism for
> > plugin
> > > >> developer to define what are the classes to use, and where (which
> > > >> JAR/directory) to load them from. When application loads, it will
> > check
> > > >> specific locations for plugins and create instance of those plugins
> > > using
> > > >> class loader. Here after application can use those plugin instances
> as
> > > >> components of application itself.
> > > >>
> > > >>
> > > >> Rotating and zooming functions are basic features which is available
> > in
> > > >> every PDF viewer and it allow user to view PDF pages from different
> > > >> perspectives. However current PDFReader is unable to provide those
> > > >> features. This project will address to this problem as part of
> second
> > > >> primary objective by implementing some menu commands for those
> > features.
> > > >> In
> > > >> the implementation, it will be matter of changing some codes in
> > > PageDrawer
> > > >> and other related class which are responsible to render PDF pages
> > using
> > > >> Java2D.
> > > >>
> > > >>
> > > >> Other interesting feature which is going to archive as another part
> of
> > > >> second objective is developing page thumbnail viewer for the
> > PDFReader.
> > > >> This will be cool GUI component for user to navigate through the PDF
> > > >> Document very easily. It will be image list which has small image of
> > the
> > > >> every PDF page of the current document with page numbers. User can
> > > simply
> > > >> click on the image to jump the particular page. This component will
> be
> > > >> developed as plugin for PDFReader and user will be able to show and
> > hide
> > > >> using the menu command.
> > > >>
> > > >>
> > > >> Third objective is implementing an object inspector for PDFReader
> and
> > it
> > > >> will be very much useful to developer who wants to debug PDF
> > documents.
> > > >> User can enable click one the any object on the PDF viewer and then
> > in a
> > > >> small window, it will show the information about particular object
> > while
> > > >> highlighting the object inside the viewer. As an example if user
> click
> > > on
> > > >> image of the PDF it will show the information such as image size,
> > color
> > > >> space, resolution and bit depth. If object is text, it will show the
> > > >> details about font, color space and color values. In implementation
> > this
> > > >> also will be developed as another plugin for the PDFReader.
> > > >>
> > > >>
> > > >> Export feature is last primary objective which I'm going to
> implement
> > > >> within this project and it also will be developed as several
> plugins.
> > As
> > > >> most of the available software, it's good idea to add 'Export' menu
> > item
> > > >> into file menu. This menu item will have submenu which contains
> > > different
> > > >> commands for different file formats (JPEG, PNG, TXT) and each of
> those
> > > >> commands will popup small dialog box which allows user to change
> > export
> > > >> settings such as page range, password, encoding type, file name and
> > > >> location. After change those settings, user can simply click 'OK'
> > button
> > > >> of
> > > >> the settings dialog box to start the converting process. Every
> command
> > > for
> > > >> particular file format will be developed as a plugin and in future
> > > >> developers can easily integrate commands for new file formats to
> > convert
> > > >> PDFs.
> > > >>
> > > >>
> > > >> In addition to these primary objectives there are few secondary
> > > objectives
> > > >> which I'm going to consider with this project and major one will be
> > the
> > > >> 'Add Text' feature. It will allow user to draw textbox in any place
> of
> > > the
> > > >> PDF page and type new text. There will be Small window or view will
> > > appear
> > > >> in this mode and user can change the font size, color and other
> > related
> > > >> properties. These new text objects will added to PDF page by
> inserting
> > > new
> > > >> optional content group to the page and in case user wants to remove
> or
> > > >> edit
> > > >> this newly added text, it's possible before saving PDF permanently.
> > > >>
> > > >>
> > > >>
> > > >> Add 'print' menu item, develop plugin for mange (add, edit, delete)
> > > >> bookmarks and View for the basic PDF properties will be the set of
> > > another
> > > >> secondary objectives which I'm going to look at within this project.
> > > Since
> > > >> most of them are already available in command line, it will be easy
> to
> > > add
> > > >> those features as plugins for GUI if there will be enough time after
> > > >> achieving primary objectives.
> > > >>
> > > >>
> > > >>
> > > >> *Benefits to PDFBox*
> > > >>
> > > >> Most of the modern code libraries are delivered with GUI application
> > to
> > > >> test the features before studying API. However, features of PDFBox
> are
> > > >> mostly accessible through the command line interface and it
> > discourages
> > > >> users and new developers from using PDFBox. By having GUI for access
> > > these
> > > >> features any user can simply use them and developer can test the
> > > features
> > > >> without going through the code. Since designing plugin framework is
> a
> > > >> primary objective of the project, developers will be able to easily
> > > append
> > > >> new features for GUI in future.
> > > >>
> > > >> PDF object inspector will be great tool for developer and it will
> make
> > > >> easy
> > > >> debugging PDFs. Furthermore, it will help to new developers to study
> > the
> > > >> structure of PDF file and understanding PDFBox core functionalities.
> > > >>
> > > >>
> > > >>
> > > >> *Project Timeline and Deliverables*
> > > >>
> > > >> *May 21 - May 28*
> > > >>
> > > >> ·   Read and understand the current PDFReader Code
> > > >>
> > > >> ·   Create basic design for plugin framework
> > > >>
> > > >> *May 28 - Jun 08*
> > > >>
> > > >>                Discuss plugin framework design with community
> > > >>
> > > >>  *Jun 08 - Jun 18*
> > > >>
> > > >>                Implement plugin framework
> > > >>
> > > >>  *Jun 18 - Jun 25*
> > > >>
> > > >>                Code some basic examples and test cases for plugin
> > > >> framework
> > > >>
> > > >>  *Jun 25 - Jul 02*
> > > >>
> > > >>                Implement basic zooming and rotating features
> > > >>
> > > >>  *Jul 02 - Jul 09*
> > > >>
> > > >>                Implement page thumbnail viewer
> > > >>
> > > >>  *Jul 09 - Jul 11*
> > > >>
> > > >>                Study and Create design for Object Inspector
> > > >>
> > > >>  *Jul 12 - Jul 13*
> > > >>
> > > >>                Submit mid-term evaluation
> > > >>
> > > >>  *Jul 13 - Jul 23*
> > > >>
> > > >>                Implement PDF Object Inspector
> > > >>
> > > >>  *Jul 23 - Jul 30*
> > > >>
> > > >>                Implement export PDF feature
> > > >>
> > > >>  *Jul 30 - Aug 06*
> > > >>
> > > >>                Implement some secondary objectives
> > > >>
> > > >>  *Aug 06 - Aug 13*
> > > >>
> > > >>                Test, debug the code and complete documentation
> > > >>
> > > >>  *Aug 13 - Aug 20*
> > > >>
> > > >>                Submit the design and conclusion
> > > >>
> > > >>
> > > >>
> > > >> *Exams and other commitments*
> > > >>
> > > >> I have semester end examination of University from 28th May to 8th
> > June.
> > > >> After that I can work fulltime (40+ hours per week) on this project
> > > until
> > > >> end.
> > > >>
> > > >>
> > > >>
> > > >>
> > > >> On Wed, Apr 4, 2012 at 12:44 PM, mehdi houshmand <med1985@gmail.com
> >
> > > >> wrote:
> > > >>
> > > >> > >
> > > >> > >
> > > >> > > <snip>
> > > >> > > How about just implementing the "add text" feature? That should
> be
> > > >> > > relatively easy. Add a new optional content group to the page
> and
> > > add
> > > >> > > all text as part of that. That makes this a cool tool to fill
> out
> > > >> forms
> > > >> > > that were scanned, or that are not AcroForms. I fully agree that
> > the
> > > >> > > replace feature is going to be tough, relatively speaking.
> > > >> > >
> > > >> >
> > > >> > Yeah, my bad here, I shouldn't have missed it but I'm inclined to
> > > agree
> > > >> > with Andreas. If we implemented deleting text, it could get pretty
> > > >> tricksy,
> > > >> > especially when CID coded fonts are used. This is a perfect
> > candidate
> > > >> of a
> > > >> > secondary objective.
> > > >> >
> > > >> >
> > > >> > > I wouldn't put the bar too high for this project. Get some
> > > low-hanging
> > > >> > > fruit that adds some cool and useful functionality to the GUI.
> > More
> > > >> can
> > > >> > > always be added later.
> > > >> > >
> > > >> > > More ideas for low-hanging fruit (as additional tasks if you end
> > up
> > > >> with
> > > >> > > spare time after the main objectives above):
> > > >> > > - Allow the "Save As..." menu item not only save a modified PDF,
> > but
> > > >> > > also bitmaps (PNG, JPEG), plain text and HTML (i.e. from text
> > > >> > extraction).
> > > >> > > That would make a plug-in for each output format, so we could
> add
> > > >> others
> > > >> > > later.
> > > >> > > - Add a "Print..." menu item.
> > > >> > > - Integrate preflight as a plug-in into the application.
> > > >> > > - Split, concatenate, page reorder, page rotation (not just
> > viewing
> > > >> but
> > > >> > > modifying the PDF), overlay/underlay.
> > > >> > > - Menu item: "Create PDF from image(s)" and "Add page from
> image".
> > > >> > > - A dialog/view listing the fonts in the PDF.
> > > >> > > - A dialog/view for the basic PDF properties like in Acrobat's
> > > >> > > Properties Dialog.
> > > >> > > - A dialog/view listing all images with their properties and
> > > effective
> > > >> > > resolution in the PDF.
> > > >> > > - I could go on and on... :-) I'm seeing a serious itch going to
> > > catch
> > > >> > > me once you've set up a good base for that GUI. I'm so looking
> > > forward
> > > >> > > to this!
> > > >> > >
> > > >> > > Anyway, points 1 and 2 will be the most important tasks because
> > they
> > > >> > > provide the basis for easily adding additional functionality.
> > > >> > >
> > > >> >
> > > >> > Yeah, those look like good tasks, maybe we'll add the two you
> > suggest
> > > to
> > > >> > the primary objectives and the rest into secondary or even
> tertiary
> > > >> > objectives. It's good to see you getting geared up for this
> > Jeremias,
> > > >> we're
> > > >> > no doubt going to call on your expertise ;-).
> > > >> >
> > > >> > Thanks for the support
> > > >> >
> > > >> > Mehdi
> > > >> >
> > > >>
> > > >>
> > > >>
> > > >> --
> > > >> Thanks & Regards,
> > > >> Tharaka Wijebandara,
> > > >> Faculty of Information Technology,
> > > >> University of Moratuwa.
> > > >>
> > > >
> > > >
> > >
> >
>
>
>
> --
> Thanks & Regards,
> Tharaka Wijebandara,
> Faculty of Information Technology,
> University of Moratuwa.
>

Re: PDFBox Project for GSoC 2012

Posted by Tharaka Nayanajith Wijebandara <th...@gmail.com>.
Hi Mehdi,


I can't understand what actually happened. I submitted my proposal before
the deadline (April 6:19:00 UTC) and still it's available in google-melange
site. Also according to timeline mentors are supposed to review the
proposal before the 20th April. I may be wrong, but don't have any idea
about why you said that we have missed the deadline. Anyway I'm still ready
to continue with this project.

On Tue, Apr 17, 2012 at 4:31 PM, Dexter Mishra <de...@gmail.com>wrote:

> Well Still who ever wants want to work can work on that. It will be their
> credit. GSc is not the only way.
>
> On Tue, Apr 17, 2012 at 12:24 PM, mehdi houshmand <me...@gmail.com>
> wrote:
>
> > Ok, well, disappointingly we've missed the deadline to submit a proposal
> > and review said proposal. Thanks for all the support from everyone, sorry
> > we couldn't arrange it this year.
> >
> > Mehdi
> >
> > On 10 April 2012 08:22, mehdi houshmand <me...@gmail.com> wrote:
> >
> > > Hi Tharaka,
> > >
> > > Sorry for the slow response, my bad, I don't have any worthwhile excuse
> > > other than it's holiday season over here. As for the proposal, thats
> > fine,
> > > I'd be happy with that proposal. If no one else has any comments on
> this,
> > > I'd suggest moving writing this up on google-melange and getting the
> ball
> > > rolling.
> > >
> > > Keep up the good work,
> > >
> > > Mehdi
> > >
> > >
> > > On 5 April 2012 08:45, Tharaka Nayanajith Wijebandara <
> > > tharaka.nw@gmail.com> wrote:
> > >
> > >> Hi,
> > >>
> > >> Thank you very much for your feedbacks and suggestions.
> > >>
> > >> After going through all of your feedbacks, I modified the proposal
> with
> > >> them and used some my own ideas also. Additionally, I included the
> > >> proposed
> > >> time line of the project and I would like to see your comments here.
> > >>
> > >>
> > >> *Project Title*
> > >>
> > >> Implement PDF object inspector and enhance features of PDFReader GUI.
> > >>
> > >>
> > >> *Abstract*
> > >>
> > >> PDFReader is small GUI application which comes with Apache PDFBox pure
> > >> java
> > >> PDF library. Currently it is only capable of rendering PDFs. This
> > project
> > >> is to redesign PDFReader GUI with a plugin framework and implement
> some
> > >> developer oriented features for it. Primary objectives of project are
> as
> > >> follows.
> > >>
> > >> (1). Design a plugin framework for the PDFReader.
> > >>
> > >> (2). Implement PDF basic viewing features. (Zooming, rotating, page
> > >> thumbnail viewer)
> > >>
> > >> (3). Implement a PDF object inspector to display PDF object
> properties.
> > >>
> > >> (4). Implement Export command for converting PDFs into different file
> > >> formats (JPEG, PNG, TXT).
> > >>
> > >>
> > >>
> > >> *Project Details*
> > >>
> > >> *
> > >> *
> > >>
> > >> PDFBox already has so many features required, but currently most of
> them
> > >> only available for developers and from command line. Aim of this
> project
> > >> is
> > >> extending PDFReader into well designed application which allows access
> > to
> > >> some PDFBox features through GUI and integrating some developer
> oriented
> > >> features. However, project itself is not going to consider GUI access
> to
> > >> all available PDFBox features. Instead of that it re-architects
> > PDFReader
> > >> into pluggable application which developer can easily integrate other
> > >> features by developing plugins.
> > >>
> > >>
> > >> In that case, most important primary objective of the project is
> > designing
> > >> a simple plugin framework for the PDFReader. In the implementation, we
> > can
> > >> use JAR Service Provider mechanism as the base of the plugin
> framework.
> > >> For
> > >> that first, it's required to identify what are pluggable classes which
> > is
> > >> going to use in the application and define interfaces for those
> classes
> > >> with the basic methods. Then we can implement some mechanism for
> plugin
> > >> developer to define what are the classes to use, and where (which
> > >> JAR/directory) to load them from. When application loads, it will
> check
> > >> specific locations for plugins and create instance of those plugins
> > using
> > >> class loader. Here after application can use those plugin instances as
> > >> components of application itself.
> > >>
> > >>
> > >> Rotating and zooming functions are basic features which is available
> in
> > >> every PDF viewer and it allow user to view PDF pages from different
> > >> perspectives. However current PDFReader is unable to provide those
> > >> features. This project will address to this problem as part of second
> > >> primary objective by implementing some menu commands for those
> features.
> > >> In
> > >> the implementation, it will be matter of changing some codes in
> > PageDrawer
> > >> and other related class which are responsible to render PDF pages
> using
> > >> Java2D.
> > >>
> > >>
> > >> Other interesting feature which is going to archive as another part of
> > >> second objective is developing page thumbnail viewer for the
> PDFReader.
> > >> This will be cool GUI component for user to navigate through the PDF
> > >> Document very easily. It will be image list which has small image of
> the
> > >> every PDF page of the current document with page numbers. User can
> > simply
> > >> click on the image to jump the particular page. This component will be
> > >> developed as plugin for PDFReader and user will be able to show and
> hide
> > >> using the menu command.
> > >>
> > >>
> > >> Third objective is implementing an object inspector for PDFReader and
> it
> > >> will be very much useful to developer who wants to debug PDF
> documents.
> > >> User can enable click one the any object on the PDF viewer and then
> in a
> > >> small window, it will show the information about particular object
> while
> > >> highlighting the object inside the viewer. As an example if user click
> > on
> > >> image of the PDF it will show the information such as image size,
> color
> > >> space, resolution and bit depth. If object is text, it will show the
> > >> details about font, color space and color values. In implementation
> this
> > >> also will be developed as another plugin for the PDFReader.
> > >>
> > >>
> > >> Export feature is last primary objective which I'm going to implement
> > >> within this project and it also will be developed as several plugins.
> As
> > >> most of the available software, it's good idea to add 'Export' menu
> item
> > >> into file menu. This menu item will have submenu which contains
> > different
> > >> commands for different file formats (JPEG, PNG, TXT) and each of those
> > >> commands will popup small dialog box which allows user to change
> export
> > >> settings such as page range, password, encoding type, file name and
> > >> location. After change those settings, user can simply click 'OK'
> button
> > >> of
> > >> the settings dialog box to start the converting process. Every command
> > for
> > >> particular file format will be developed as a plugin and in future
> > >> developers can easily integrate commands for new file formats to
> convert
> > >> PDFs.
> > >>
> > >>
> > >> In addition to these primary objectives there are few secondary
> > objectives
> > >> which I'm going to consider with this project and major one will be
> the
> > >> 'Add Text' feature. It will allow user to draw textbox in any place of
> > the
> > >> PDF page and type new text. There will be Small window or view will
> > appear
> > >> in this mode and user can change the font size, color and other
> related
> > >> properties. These new text objects will added to PDF page by inserting
> > new
> > >> optional content group to the page and in case user wants to remove or
> > >> edit
> > >> this newly added text, it's possible before saving PDF permanently.
> > >>
> > >>
> > >>
> > >> Add 'print' menu item, develop plugin for mange (add, edit, delete)
> > >> bookmarks and View for the basic PDF properties will be the set of
> > another
> > >> secondary objectives which I'm going to look at within this project.
> > Since
> > >> most of them are already available in command line, it will be easy to
> > add
> > >> those features as plugins for GUI if there will be enough time after
> > >> achieving primary objectives.
> > >>
> > >>
> > >>
> > >> *Benefits to PDFBox*
> > >>
> > >> Most of the modern code libraries are delivered with GUI application
> to
> > >> test the features before studying API. However, features of PDFBox are
> > >> mostly accessible through the command line interface and it
> discourages
> > >> users and new developers from using PDFBox. By having GUI for access
> > these
> > >> features any user can simply use them and developer can test the
> > features
> > >> without going through the code. Since designing plugin framework is a
> > >> primary objective of the project, developers will be able to easily
> > append
> > >> new features for GUI in future.
> > >>
> > >> PDF object inspector will be great tool for developer and it will make
> > >> easy
> > >> debugging PDFs. Furthermore, it will help to new developers to study
> the
> > >> structure of PDF file and understanding PDFBox core functionalities.
> > >>
> > >>
> > >>
> > >> *Project Timeline and Deliverables*
> > >>
> > >> *May 21 - May 28*
> > >>
> > >> ·   Read and understand the current PDFReader Code
> > >>
> > >> ·   Create basic design for plugin framework
> > >>
> > >> *May 28 - Jun 08*
> > >>
> > >>                Discuss plugin framework design with community
> > >>
> > >>  *Jun 08 - Jun 18*
> > >>
> > >>                Implement plugin framework
> > >>
> > >>  *Jun 18 - Jun 25*
> > >>
> > >>                Code some basic examples and test cases for plugin
> > >> framework
> > >>
> > >>  *Jun 25 - Jul 02*
> > >>
> > >>                Implement basic zooming and rotating features
> > >>
> > >>  *Jul 02 - Jul 09*
> > >>
> > >>                Implement page thumbnail viewer
> > >>
> > >>  *Jul 09 - Jul 11*
> > >>
> > >>                Study and Create design for Object Inspector
> > >>
> > >>  *Jul 12 - Jul 13*
> > >>
> > >>                Submit mid-term evaluation
> > >>
> > >>  *Jul 13 - Jul 23*
> > >>
> > >>                Implement PDF Object Inspector
> > >>
> > >>  *Jul 23 - Jul 30*
> > >>
> > >>                Implement export PDF feature
> > >>
> > >>  *Jul 30 - Aug 06*
> > >>
> > >>                Implement some secondary objectives
> > >>
> > >>  *Aug 06 - Aug 13*
> > >>
> > >>                Test, debug the code and complete documentation
> > >>
> > >>  *Aug 13 - Aug 20*
> > >>
> > >>                Submit the design and conclusion
> > >>
> > >>
> > >>
> > >> *Exams and other commitments*
> > >>
> > >> I have semester end examination of University from 28th May to 8th
> June.
> > >> After that I can work fulltime (40+ hours per week) on this project
> > until
> > >> end.
> > >>
> > >>
> > >>
> > >>
> > >> On Wed, Apr 4, 2012 at 12:44 PM, mehdi houshmand <me...@gmail.com>
> > >> wrote:
> > >>
> > >> > >
> > >> > >
> > >> > > <snip>
> > >> > > How about just implementing the "add text" feature? That should be
> > >> > > relatively easy. Add a new optional content group to the page and
> > add
> > >> > > all text as part of that. That makes this a cool tool to fill out
> > >> forms
> > >> > > that were scanned, or that are not AcroForms. I fully agree that
> the
> > >> > > replace feature is going to be tough, relatively speaking.
> > >> > >
> > >> >
> > >> > Yeah, my bad here, I shouldn't have missed it but I'm inclined to
> > agree
> > >> > with Andreas. If we implemented deleting text, it could get pretty
> > >> tricksy,
> > >> > especially when CID coded fonts are used. This is a perfect
> candidate
> > >> of a
> > >> > secondary objective.
> > >> >
> > >> >
> > >> > > I wouldn't put the bar too high for this project. Get some
> > low-hanging
> > >> > > fruit that adds some cool and useful functionality to the GUI.
> More
> > >> can
> > >> > > always be added later.
> > >> > >
> > >> > > More ideas for low-hanging fruit (as additional tasks if you end
> up
> > >> with
> > >> > > spare time after the main objectives above):
> > >> > > - Allow the "Save As..." menu item not only save a modified PDF,
> but
> > >> > > also bitmaps (PNG, JPEG), plain text and HTML (i.e. from text
> > >> > extraction).
> > >> > > That would make a plug-in for each output format, so we could add
> > >> others
> > >> > > later.
> > >> > > - Add a "Print..." menu item.
> > >> > > - Integrate preflight as a plug-in into the application.
> > >> > > - Split, concatenate, page reorder, page rotation (not just
> viewing
> > >> but
> > >> > > modifying the PDF), overlay/underlay.
> > >> > > - Menu item: "Create PDF from image(s)" and "Add page from image".
> > >> > > - A dialog/view listing the fonts in the PDF.
> > >> > > - A dialog/view for the basic PDF properties like in Acrobat's
> > >> > > Properties Dialog.
> > >> > > - A dialog/view listing all images with their properties and
> > effective
> > >> > > resolution in the PDF.
> > >> > > - I could go on and on... :-) I'm seeing a serious itch going to
> > catch
> > >> > > me once you've set up a good base for that GUI. I'm so looking
> > forward
> > >> > > to this!
> > >> > >
> > >> > > Anyway, points 1 and 2 will be the most important tasks because
> they
> > >> > > provide the basis for easily adding additional functionality.
> > >> > >
> > >> >
> > >> > Yeah, those look like good tasks, maybe we'll add the two you
> suggest
> > to
> > >> > the primary objectives and the rest into secondary or even tertiary
> > >> > objectives. It's good to see you getting geared up for this
> Jeremias,
> > >> we're
> > >> > no doubt going to call on your expertise ;-).
> > >> >
> > >> > Thanks for the support
> > >> >
> > >> > Mehdi
> > >> >
> > >>
> > >>
> > >>
> > >> --
> > >> Thanks & Regards,
> > >> Tharaka Wijebandara,
> > >> Faculty of Information Technology,
> > >> University of Moratuwa.
> > >>
> > >
> > >
> >
>



-- 
Thanks & Regards,
Tharaka Wijebandara,
Faculty of Information Technology,
University of Moratuwa.

Re: PDFBox Project for GSoC 2012

Posted by Dexter Mishra <de...@gmail.com>.
Well Still who ever wants want to work can work on that. It will be their
credit. GSc is not the only way.

On Tue, Apr 17, 2012 at 12:24 PM, mehdi houshmand <me...@gmail.com> wrote:

> Ok, well, disappointingly we've missed the deadline to submit a proposal
> and review said proposal. Thanks for all the support from everyone, sorry
> we couldn't arrange it this year.
>
> Mehdi
>
> On 10 April 2012 08:22, mehdi houshmand <me...@gmail.com> wrote:
>
> > Hi Tharaka,
> >
> > Sorry for the slow response, my bad, I don't have any worthwhile excuse
> > other than it's holiday season over here. As for the proposal, thats
> fine,
> > I'd be happy with that proposal. If no one else has any comments on this,
> > I'd suggest moving writing this up on google-melange and getting the ball
> > rolling.
> >
> > Keep up the good work,
> >
> > Mehdi
> >
> >
> > On 5 April 2012 08:45, Tharaka Nayanajith Wijebandara <
> > tharaka.nw@gmail.com> wrote:
> >
> >> Hi,
> >>
> >> Thank you very much for your feedbacks and suggestions.
> >>
> >> After going through all of your feedbacks, I modified the proposal with
> >> them and used some my own ideas also. Additionally, I included the
> >> proposed
> >> time line of the project and I would like to see your comments here.
> >>
> >>
> >> *Project Title*
> >>
> >> Implement PDF object inspector and enhance features of PDFReader GUI.
> >>
> >>
> >> *Abstract*
> >>
> >> PDFReader is small GUI application which comes with Apache PDFBox pure
> >> java
> >> PDF library. Currently it is only capable of rendering PDFs. This
> project
> >> is to redesign PDFReader GUI with a plugin framework and implement some
> >> developer oriented features for it. Primary objectives of project are as
> >> follows.
> >>
> >> (1). Design a plugin framework for the PDFReader.
> >>
> >> (2). Implement PDF basic viewing features. (Zooming, rotating, page
> >> thumbnail viewer)
> >>
> >> (3). Implement a PDF object inspector to display PDF object properties.
> >>
> >> (4). Implement Export command for converting PDFs into different file
> >> formats (JPEG, PNG, TXT).
> >>
> >>
> >>
> >> *Project Details*
> >>
> >> *
> >> *
> >>
> >> PDFBox already has so many features required, but currently most of them
> >> only available for developers and from command line. Aim of this project
> >> is
> >> extending PDFReader into well designed application which allows access
> to
> >> some PDFBox features through GUI and integrating some developer oriented
> >> features. However, project itself is not going to consider GUI access to
> >> all available PDFBox features. Instead of that it re-architects
> PDFReader
> >> into pluggable application which developer can easily integrate other
> >> features by developing plugins.
> >>
> >>
> >> In that case, most important primary objective of the project is
> designing
> >> a simple plugin framework for the PDFReader. In the implementation, we
> can
> >> use JAR Service Provider mechanism as the base of the plugin framework.
> >> For
> >> that first, it's required to identify what are pluggable classes which
> is
> >> going to use in the application and define interfaces for those classes
> >> with the basic methods. Then we can implement some mechanism for plugin
> >> developer to define what are the classes to use, and where (which
> >> JAR/directory) to load them from. When application loads, it will check
> >> specific locations for plugins and create instance of those plugins
> using
> >> class loader. Here after application can use those plugin instances as
> >> components of application itself.
> >>
> >>
> >> Rotating and zooming functions are basic features which is available in
> >> every PDF viewer and it allow user to view PDF pages from different
> >> perspectives. However current PDFReader is unable to provide those
> >> features. This project will address to this problem as part of second
> >> primary objective by implementing some menu commands for those features.
> >> In
> >> the implementation, it will be matter of changing some codes in
> PageDrawer
> >> and other related class which are responsible to render PDF pages using
> >> Java2D.
> >>
> >>
> >> Other interesting feature which is going to archive as another part of
> >> second objective is developing page thumbnail viewer for the PDFReader.
> >> This will be cool GUI component for user to navigate through the PDF
> >> Document very easily. It will be image list which has small image of the
> >> every PDF page of the current document with page numbers. User can
> simply
> >> click on the image to jump the particular page. This component will be
> >> developed as plugin for PDFReader and user will be able to show and hide
> >> using the menu command.
> >>
> >>
> >> Third objective is implementing an object inspector for PDFReader and it
> >> will be very much useful to developer who wants to debug PDF documents.
> >> User can enable click one the any object on the PDF viewer and then in a
> >> small window, it will show the information about particular object while
> >> highlighting the object inside the viewer. As an example if user click
> on
> >> image of the PDF it will show the information such as image size, color
> >> space, resolution and bit depth. If object is text, it will show the
> >> details about font, color space and color values. In implementation this
> >> also will be developed as another plugin for the PDFReader.
> >>
> >>
> >> Export feature is last primary objective which I'm going to implement
> >> within this project and it also will be developed as several plugins. As
> >> most of the available software, it's good idea to add 'Export' menu item
> >> into file menu. This menu item will have submenu which contains
> different
> >> commands for different file formats (JPEG, PNG, TXT) and each of those
> >> commands will popup small dialog box which allows user to change export
> >> settings such as page range, password, encoding type, file name and
> >> location. After change those settings, user can simply click 'OK' button
> >> of
> >> the settings dialog box to start the converting process. Every command
> for
> >> particular file format will be developed as a plugin and in future
> >> developers can easily integrate commands for new file formats to convert
> >> PDFs.
> >>
> >>
> >> In addition to these primary objectives there are few secondary
> objectives
> >> which I'm going to consider with this project and major one will be the
> >> 'Add Text' feature. It will allow user to draw textbox in any place of
> the
> >> PDF page and type new text. There will be Small window or view will
> appear
> >> in this mode and user can change the font size, color and other related
> >> properties. These new text objects will added to PDF page by inserting
> new
> >> optional content group to the page and in case user wants to remove or
> >> edit
> >> this newly added text, it's possible before saving PDF permanently.
> >>
> >>
> >>
> >> Add 'print' menu item, develop plugin for mange (add, edit, delete)
> >> bookmarks and View for the basic PDF properties will be the set of
> another
> >> secondary objectives which I'm going to look at within this project.
> Since
> >> most of them are already available in command line, it will be easy to
> add
> >> those features as plugins for GUI if there will be enough time after
> >> achieving primary objectives.
> >>
> >>
> >>
> >> *Benefits to PDFBox*
> >>
> >> Most of the modern code libraries are delivered with GUI application to
> >> test the features before studying API. However, features of PDFBox are
> >> mostly accessible through the command line interface and it discourages
> >> users and new developers from using PDFBox. By having GUI for access
> these
> >> features any user can simply use them and developer can test the
> features
> >> without going through the code. Since designing plugin framework is a
> >> primary objective of the project, developers will be able to easily
> append
> >> new features for GUI in future.
> >>
> >> PDF object inspector will be great tool for developer and it will make
> >> easy
> >> debugging PDFs. Furthermore, it will help to new developers to study the
> >> structure of PDF file and understanding PDFBox core functionalities.
> >>
> >>
> >>
> >> *Project Timeline and Deliverables*
> >>
> >> *May 21 - May 28*
> >>
> >> ·   Read and understand the current PDFReader Code
> >>
> >> ·   Create basic design for plugin framework
> >>
> >> *May 28 - Jun 08*
> >>
> >>                Discuss plugin framework design with community
> >>
> >>  *Jun 08 - Jun 18*
> >>
> >>                Implement plugin framework
> >>
> >>  *Jun 18 - Jun 25*
> >>
> >>                Code some basic examples and test cases for plugin
> >> framework
> >>
> >>  *Jun 25 - Jul 02*
> >>
> >>                Implement basic zooming and rotating features
> >>
> >>  *Jul 02 - Jul 09*
> >>
> >>                Implement page thumbnail viewer
> >>
> >>  *Jul 09 - Jul 11*
> >>
> >>                Study and Create design for Object Inspector
> >>
> >>  *Jul 12 - Jul 13*
> >>
> >>                Submit mid-term evaluation
> >>
> >>  *Jul 13 - Jul 23*
> >>
> >>                Implement PDF Object Inspector
> >>
> >>  *Jul 23 - Jul 30*
> >>
> >>                Implement export PDF feature
> >>
> >>  *Jul 30 - Aug 06*
> >>
> >>                Implement some secondary objectives
> >>
> >>  *Aug 06 - Aug 13*
> >>
> >>                Test, debug the code and complete documentation
> >>
> >>  *Aug 13 - Aug 20*
> >>
> >>                Submit the design and conclusion
> >>
> >>
> >>
> >> *Exams and other commitments*
> >>
> >> I have semester end examination of University from 28th May to 8th June.
> >> After that I can work fulltime (40+ hours per week) on this project
> until
> >> end.
> >>
> >>
> >>
> >>
> >> On Wed, Apr 4, 2012 at 12:44 PM, mehdi houshmand <me...@gmail.com>
> >> wrote:
> >>
> >> > >
> >> > >
> >> > > <snip>
> >> > > How about just implementing the "add text" feature? That should be
> >> > > relatively easy. Add a new optional content group to the page and
> add
> >> > > all text as part of that. That makes this a cool tool to fill out
> >> forms
> >> > > that were scanned, or that are not AcroForms. I fully agree that the
> >> > > replace feature is going to be tough, relatively speaking.
> >> > >
> >> >
> >> > Yeah, my bad here, I shouldn't have missed it but I'm inclined to
> agree
> >> > with Andreas. If we implemented deleting text, it could get pretty
> >> tricksy,
> >> > especially when CID coded fonts are used. This is a perfect candidate
> >> of a
> >> > secondary objective.
> >> >
> >> >
> >> > > I wouldn't put the bar too high for this project. Get some
> low-hanging
> >> > > fruit that adds some cool and useful functionality to the GUI. More
> >> can
> >> > > always be added later.
> >> > >
> >> > > More ideas for low-hanging fruit (as additional tasks if you end up
> >> with
> >> > > spare time after the main objectives above):
> >> > > - Allow the "Save As..." menu item not only save a modified PDF, but
> >> > > also bitmaps (PNG, JPEG), plain text and HTML (i.e. from text
> >> > extraction).
> >> > > That would make a plug-in for each output format, so we could add
> >> others
> >> > > later.
> >> > > - Add a "Print..." menu item.
> >> > > - Integrate preflight as a plug-in into the application.
> >> > > - Split, concatenate, page reorder, page rotation (not just viewing
> >> but
> >> > > modifying the PDF), overlay/underlay.
> >> > > - Menu item: "Create PDF from image(s)" and "Add page from image".
> >> > > - A dialog/view listing the fonts in the PDF.
> >> > > - A dialog/view for the basic PDF properties like in Acrobat's
> >> > > Properties Dialog.
> >> > > - A dialog/view listing all images with their properties and
> effective
> >> > > resolution in the PDF.
> >> > > - I could go on and on... :-) I'm seeing a serious itch going to
> catch
> >> > > me once you've set up a good base for that GUI. I'm so looking
> forward
> >> > > to this!
> >> > >
> >> > > Anyway, points 1 and 2 will be the most important tasks because they
> >> > > provide the basis for easily adding additional functionality.
> >> > >
> >> >
> >> > Yeah, those look like good tasks, maybe we'll add the two you suggest
> to
> >> > the primary objectives and the rest into secondary or even tertiary
> >> > objectives. It's good to see you getting geared up for this Jeremias,
> >> we're
> >> > no doubt going to call on your expertise ;-).
> >> >
> >> > Thanks for the support
> >> >
> >> > Mehdi
> >> >
> >>
> >>
> >>
> >> --
> >> Thanks & Regards,
> >> Tharaka Wijebandara,
> >> Faculty of Information Technology,
> >> University of Moratuwa.
> >>
> >
> >
>

Re: PDFBox Project for GSoC 2012

Posted by mehdi houshmand <me...@gmail.com>.
Ok, well, disappointingly we've missed the deadline to submit a proposal
and review said proposal. Thanks for all the support from everyone, sorry
we couldn't arrange it this year.

Mehdi

On 10 April 2012 08:22, mehdi houshmand <me...@gmail.com> wrote:

> Hi Tharaka,
>
> Sorry for the slow response, my bad, I don't have any worthwhile excuse
> other than it's holiday season over here. As for the proposal, thats fine,
> I'd be happy with that proposal. If no one else has any comments on this,
> I'd suggest moving writing this up on google-melange and getting the ball
> rolling.
>
> Keep up the good work,
>
> Mehdi
>
>
> On 5 April 2012 08:45, Tharaka Nayanajith Wijebandara <
> tharaka.nw@gmail.com> wrote:
>
>> Hi,
>>
>> Thank you very much for your feedbacks and suggestions.
>>
>> After going through all of your feedbacks, I modified the proposal with
>> them and used some my own ideas also. Additionally, I included the
>> proposed
>> time line of the project and I would like to see your comments here.
>>
>>
>> *Project Title*
>>
>> Implement PDF object inspector and enhance features of PDFReader GUI.
>>
>>
>> *Abstract*
>>
>> PDFReader is small GUI application which comes with Apache PDFBox pure
>> java
>> PDF library. Currently it is only capable of rendering PDFs. This project
>> is to redesign PDFReader GUI with a plugin framework and implement some
>> developer oriented features for it. Primary objectives of project are as
>> follows.
>>
>> (1). Design a plugin framework for the PDFReader.
>>
>> (2). Implement PDF basic viewing features. (Zooming, rotating, page
>> thumbnail viewer)
>>
>> (3). Implement a PDF object inspector to display PDF object properties.
>>
>> (4). Implement Export command for converting PDFs into different file
>> formats (JPEG, PNG, TXT).
>>
>>
>>
>> *Project Details*
>>
>> *
>> *
>>
>> PDFBox already has so many features required, but currently most of them
>> only available for developers and from command line. Aim of this project
>> is
>> extending PDFReader into well designed application which allows access to
>> some PDFBox features through GUI and integrating some developer oriented
>> features. However, project itself is not going to consider GUI access to
>> all available PDFBox features. Instead of that it re-architects PDFReader
>> into pluggable application which developer can easily integrate other
>> features by developing plugins.
>>
>>
>> In that case, most important primary objective of the project is designing
>> a simple plugin framework for the PDFReader. In the implementation, we can
>> use JAR Service Provider mechanism as the base of the plugin framework.
>> For
>> that first, it's required to identify what are pluggable classes which is
>> going to use in the application and define interfaces for those classes
>> with the basic methods. Then we can implement some mechanism for plugin
>> developer to define what are the classes to use, and where (which
>> JAR/directory) to load them from. When application loads, it will check
>> specific locations for plugins and create instance of those plugins using
>> class loader. Here after application can use those plugin instances as
>> components of application itself.
>>
>>
>> Rotating and zooming functions are basic features which is available in
>> every PDF viewer and it allow user to view PDF pages from different
>> perspectives. However current PDFReader is unable to provide those
>> features. This project will address to this problem as part of second
>> primary objective by implementing some menu commands for those features.
>> In
>> the implementation, it will be matter of changing some codes in PageDrawer
>> and other related class which are responsible to render PDF pages using
>> Java2D.
>>
>>
>> Other interesting feature which is going to archive as another part of
>> second objective is developing page thumbnail viewer for the PDFReader.
>> This will be cool GUI component for user to navigate through the PDF
>> Document very easily. It will be image list which has small image of the
>> every PDF page of the current document with page numbers. User can simply
>> click on the image to jump the particular page. This component will be
>> developed as plugin for PDFReader and user will be able to show and hide
>> using the menu command.
>>
>>
>> Third objective is implementing an object inspector for PDFReader and it
>> will be very much useful to developer who wants to debug PDF documents.
>> User can enable click one the any object on the PDF viewer and then in a
>> small window, it will show the information about particular object while
>> highlighting the object inside the viewer. As an example if user click on
>> image of the PDF it will show the information such as image size, color
>> space, resolution and bit depth. If object is text, it will show the
>> details about font, color space and color values. In implementation this
>> also will be developed as another plugin for the PDFReader.
>>
>>
>> Export feature is last primary objective which I'm going to implement
>> within this project and it also will be developed as several plugins. As
>> most of the available software, it's good idea to add 'Export' menu item
>> into file menu. This menu item will have submenu which contains different
>> commands for different file formats (JPEG, PNG, TXT) and each of those
>> commands will popup small dialog box which allows user to change export
>> settings such as page range, password, encoding type, file name and
>> location. After change those settings, user can simply click 'OK' button
>> of
>> the settings dialog box to start the converting process. Every command for
>> particular file format will be developed as a plugin and in future
>> developers can easily integrate commands for new file formats to convert
>> PDFs.
>>
>>
>> In addition to these primary objectives there are few secondary objectives
>> which I'm going to consider with this project and major one will be the
>> 'Add Text' feature. It will allow user to draw textbox in any place of the
>> PDF page and type new text. There will be Small window or view will appear
>> in this mode and user can change the font size, color and other related
>> properties. These new text objects will added to PDF page by inserting new
>> optional content group to the page and in case user wants to remove or
>> edit
>> this newly added text, it's possible before saving PDF permanently.
>>
>>
>>
>> Add 'print' menu item, develop plugin for mange (add, edit, delete)
>> bookmarks and View for the basic PDF properties will be the set of another
>> secondary objectives which I'm going to look at within this project. Since
>> most of them are already available in command line, it will be easy to add
>> those features as plugins for GUI if there will be enough time after
>> achieving primary objectives.
>>
>>
>>
>> *Benefits to PDFBox*
>>
>> Most of the modern code libraries are delivered with GUI application to
>> test the features before studying API. However, features of PDFBox are
>> mostly accessible through the command line interface and it discourages
>> users and new developers from using PDFBox. By having GUI for access these
>> features any user can simply use them and developer can test the features
>> without going through the code. Since designing plugin framework is a
>> primary objective of the project, developers will be able to easily append
>> new features for GUI in future.
>>
>> PDF object inspector will be great tool for developer and it will make
>> easy
>> debugging PDFs. Furthermore, it will help to new developers to study the
>> structure of PDF file and understanding PDFBox core functionalities.
>>
>>
>>
>> *Project Timeline and Deliverables*
>>
>> *May 21 - May 28*
>>
>> ·   Read and understand the current PDFReader Code
>>
>> ·   Create basic design for plugin framework
>>
>> *May 28 - Jun 08*
>>
>>                Discuss plugin framework design with community
>>
>>  *Jun 08 - Jun 18*
>>
>>                Implement plugin framework
>>
>>  *Jun 18 - Jun 25*
>>
>>                Code some basic examples and test cases for plugin
>> framework
>>
>>  *Jun 25 - Jul 02*
>>
>>                Implement basic zooming and rotating features
>>
>>  *Jul 02 - Jul 09*
>>
>>                Implement page thumbnail viewer
>>
>>  *Jul 09 - Jul 11*
>>
>>                Study and Create design for Object Inspector
>>
>>  *Jul 12 - Jul 13*
>>
>>                Submit mid-term evaluation
>>
>>  *Jul 13 - Jul 23*
>>
>>                Implement PDF Object Inspector
>>
>>  *Jul 23 - Jul 30*
>>
>>                Implement export PDF feature
>>
>>  *Jul 30 - Aug 06*
>>
>>                Implement some secondary objectives
>>
>>  *Aug 06 - Aug 13*
>>
>>                Test, debug the code and complete documentation
>>
>>  *Aug 13 - Aug 20*
>>
>>                Submit the design and conclusion
>>
>>
>>
>> *Exams and other commitments*
>>
>> I have semester end examination of University from 28th May to 8th June.
>> After that I can work fulltime (40+ hours per week) on this project until
>> end.
>>
>>
>>
>>
>> On Wed, Apr 4, 2012 at 12:44 PM, mehdi houshmand <me...@gmail.com>
>> wrote:
>>
>> > >
>> > >
>> > > <snip>
>> > > How about just implementing the "add text" feature? That should be
>> > > relatively easy. Add a new optional content group to the page and add
>> > > all text as part of that. That makes this a cool tool to fill out
>> forms
>> > > that were scanned, or that are not AcroForms. I fully agree that the
>> > > replace feature is going to be tough, relatively speaking.
>> > >
>> >
>> > Yeah, my bad here, I shouldn't have missed it but I'm inclined to agree
>> > with Andreas. If we implemented deleting text, it could get pretty
>> tricksy,
>> > especially when CID coded fonts are used. This is a perfect candidate
>> of a
>> > secondary objective.
>> >
>> >
>> > > I wouldn't put the bar too high for this project. Get some low-hanging
>> > > fruit that adds some cool and useful functionality to the GUI. More
>> can
>> > > always be added later.
>> > >
>> > > More ideas for low-hanging fruit (as additional tasks if you end up
>> with
>> > > spare time after the main objectives above):
>> > > - Allow the "Save As..." menu item not only save a modified PDF, but
>> > > also bitmaps (PNG, JPEG), plain text and HTML (i.e. from text
>> > extraction).
>> > > That would make a plug-in for each output format, so we could add
>> others
>> > > later.
>> > > - Add a "Print..." menu item.
>> > > - Integrate preflight as a plug-in into the application.
>> > > - Split, concatenate, page reorder, page rotation (not just viewing
>> but
>> > > modifying the PDF), overlay/underlay.
>> > > - Menu item: "Create PDF from image(s)" and "Add page from image".
>> > > - A dialog/view listing the fonts in the PDF.
>> > > - A dialog/view for the basic PDF properties like in Acrobat's
>> > > Properties Dialog.
>> > > - A dialog/view listing all images with their properties and effective
>> > > resolution in the PDF.
>> > > - I could go on and on... :-) I'm seeing a serious itch going to catch
>> > > me once you've set up a good base for that GUI. I'm so looking forward
>> > > to this!
>> > >
>> > > Anyway, points 1 and 2 will be the most important tasks because they
>> > > provide the basis for easily adding additional functionality.
>> > >
>> >
>> > Yeah, those look like good tasks, maybe we'll add the two you suggest to
>> > the primary objectives and the rest into secondary or even tertiary
>> > objectives. It's good to see you getting geared up for this Jeremias,
>> we're
>> > no doubt going to call on your expertise ;-).
>> >
>> > Thanks for the support
>> >
>> > Mehdi
>> >
>>
>>
>>
>> --
>> Thanks & Regards,
>> Tharaka Wijebandara,
>> Faculty of Information Technology,
>> University of Moratuwa.
>>
>
>

Re: PDFBox Project for GSoC 2012

Posted by mehdi houshmand <me...@gmail.com>.
Hi Tharaka,

Sorry for the slow response, my bad, I don't have any worthwhile excuse
other than it's holiday season over here. As for the proposal, thats fine,
I'd be happy with that proposal. If no one else has any comments on this,
I'd suggest moving writing this up on google-melange and getting the ball
rolling.

Keep up the good work,

Mehdi

On 5 April 2012 08:45, Tharaka Nayanajith Wijebandara
<th...@gmail.com>wrote:

> Hi,
>
> Thank you very much for your feedbacks and suggestions.
>
> After going through all of your feedbacks, I modified the proposal with
> them and used some my own ideas also. Additionally, I included the proposed
> time line of the project and I would like to see your comments here.
>
>
> *Project Title*
>
> Implement PDF object inspector and enhance features of PDFReader GUI.
>
>
> *Abstract*
>
> PDFReader is small GUI application which comes with Apache PDFBox pure java
> PDF library. Currently it is only capable of rendering PDFs. This project
> is to redesign PDFReader GUI with a plugin framework and implement some
> developer oriented features for it. Primary objectives of project are as
> follows.
>
> (1). Design a plugin framework for the PDFReader.
>
> (2). Implement PDF basic viewing features. (Zooming, rotating, page
> thumbnail viewer)
>
> (3). Implement a PDF object inspector to display PDF object properties.
>
> (4). Implement Export command for converting PDFs into different file
> formats (JPEG, PNG, TXT).
>
>
>
> *Project Details*
>
> *
> *
>
> PDFBox already has so many features required, but currently most of them
> only available for developers and from command line. Aim of this project is
> extending PDFReader into well designed application which allows access to
> some PDFBox features through GUI and integrating some developer oriented
> features. However, project itself is not going to consider GUI access to
> all available PDFBox features. Instead of that it re-architects PDFReader
> into pluggable application which developer can easily integrate other
> features by developing plugins.
>
>
> In that case, most important primary objective of the project is designing
> a simple plugin framework for the PDFReader. In the implementation, we can
> use JAR Service Provider mechanism as the base of the plugin framework. For
> that first, it's required to identify what are pluggable classes which is
> going to use in the application and define interfaces for those classes
> with the basic methods. Then we can implement some mechanism for plugin
> developer to define what are the classes to use, and where (which
> JAR/directory) to load them from. When application loads, it will check
> specific locations for plugins and create instance of those plugins using
> class loader. Here after application can use those plugin instances as
> components of application itself.
>
>
> Rotating and zooming functions are basic features which is available in
> every PDF viewer and it allow user to view PDF pages from different
> perspectives. However current PDFReader is unable to provide those
> features. This project will address to this problem as part of second
> primary objective by implementing some menu commands for those features. In
> the implementation, it will be matter of changing some codes in PageDrawer
> and other related class which are responsible to render PDF pages using
> Java2D.
>
>
> Other interesting feature which is going to archive as another part of
> second objective is developing page thumbnail viewer for the PDFReader.
> This will be cool GUI component for user to navigate through the PDF
> Document very easily. It will be image list which has small image of the
> every PDF page of the current document with page numbers. User can simply
> click on the image to jump the particular page. This component will be
> developed as plugin for PDFReader and user will be able to show and hide
> using the menu command.
>
>
> Third objective is implementing an object inspector for PDFReader and it
> will be very much useful to developer who wants to debug PDF documents.
> User can enable click one the any object on the PDF viewer and then in a
> small window, it will show the information about particular object while
> highlighting the object inside the viewer. As an example if user click on
> image of the PDF it will show the information such as image size, color
> space, resolution and bit depth. If object is text, it will show the
> details about font, color space and color values. In implementation this
> also will be developed as another plugin for the PDFReader.
>
>
> Export feature is last primary objective which I'm going to implement
> within this project and it also will be developed as several plugins. As
> most of the available software, it's good idea to add 'Export' menu item
> into file menu. This menu item will have submenu which contains different
> commands for different file formats (JPEG, PNG, TXT) and each of those
> commands will popup small dialog box which allows user to change export
> settings such as page range, password, encoding type, file name and
> location. After change those settings, user can simply click 'OK' button of
> the settings dialog box to start the converting process. Every command for
> particular file format will be developed as a plugin and in future
> developers can easily integrate commands for new file formats to convert
> PDFs.
>
>
> In addition to these primary objectives there are few secondary objectives
> which I'm going to consider with this project and major one will be the
> 'Add Text' feature. It will allow user to draw textbox in any place of the
> PDF page and type new text. There will be Small window or view will appear
> in this mode and user can change the font size, color and other related
> properties. These new text objects will added to PDF page by inserting new
> optional content group to the page and in case user wants to remove or edit
> this newly added text, it's possible before saving PDF permanently.
>
>
>
> Add 'print' menu item, develop plugin for mange (add, edit, delete)
> bookmarks and View for the basic PDF properties will be the set of another
> secondary objectives which I'm going to look at within this project. Since
> most of them are already available in command line, it will be easy to add
> those features as plugins for GUI if there will be enough time after
> achieving primary objectives.
>
>
>
> *Benefits to PDFBox*
>
> Most of the modern code libraries are delivered with GUI application to
> test the features before studying API. However, features of PDFBox are
> mostly accessible through the command line interface and it discourages
> users and new developers from using PDFBox. By having GUI for access these
> features any user can simply use them and developer can test the features
> without going through the code. Since designing plugin framework is a
> primary objective of the project, developers will be able to easily append
> new features for GUI in future.
>
> PDF object inspector will be great tool for developer and it will make easy
> debugging PDFs. Furthermore, it will help to new developers to study the
> structure of PDF file and understanding PDFBox core functionalities.
>
>
>
> *Project Timeline and Deliverables*
>
> *May 21 - May 28*
>
> ·   Read and understand the current PDFReader Code
>
> ·   Create basic design for plugin framework
>
> *May 28 - Jun 08*
>
>                Discuss plugin framework design with community
>
>  *Jun 08 - Jun 18*
>
>                Implement plugin framework
>
>  *Jun 18 - Jun 25*
>
>                Code some basic examples and test cases for plugin framework
>
>  *Jun 25 - Jul 02*
>
>                Implement basic zooming and rotating features
>
>  *Jul 02 - Jul 09*
>
>                Implement page thumbnail viewer
>
>  *Jul 09 - Jul 11*
>
>                Study and Create design for Object Inspector
>
>  *Jul 12 - Jul 13*
>
>                Submit mid-term evaluation
>
>  *Jul 13 - Jul 23*
>
>                Implement PDF Object Inspector
>
>  *Jul 23 - Jul 30*
>
>                Implement export PDF feature
>
>  *Jul 30 - Aug 06*
>
>                Implement some secondary objectives
>
>  *Aug 06 - Aug 13*
>
>                Test, debug the code and complete documentation
>
>  *Aug 13 - Aug 20*
>
>                Submit the design and conclusion
>
>
>
> *Exams and other commitments*
>
> I have semester end examination of University from 28th May to 8th June.
> After that I can work fulltime (40+ hours per week) on this project until
> end.
>
>
>
>
> On Wed, Apr 4, 2012 at 12:44 PM, mehdi houshmand <me...@gmail.com>
> wrote:
>
> > >
> > >
> > > <snip>
> > > How about just implementing the "add text" feature? That should be
> > > relatively easy. Add a new optional content group to the page and add
> > > all text as part of that. That makes this a cool tool to fill out forms
> > > that were scanned, or that are not AcroForms. I fully agree that the
> > > replace feature is going to be tough, relatively speaking.
> > >
> >
> > Yeah, my bad here, I shouldn't have missed it but I'm inclined to agree
> > with Andreas. If we implemented deleting text, it could get pretty
> tricksy,
> > especially when CID coded fonts are used. This is a perfect candidate of
> a
> > secondary objective.
> >
> >
> > > I wouldn't put the bar too high for this project. Get some low-hanging
> > > fruit that adds some cool and useful functionality to the GUI. More can
> > > always be added later.
> > >
> > > More ideas for low-hanging fruit (as additional tasks if you end up
> with
> > > spare time after the main objectives above):
> > > - Allow the "Save As..." menu item not only save a modified PDF, but
> > > also bitmaps (PNG, JPEG), plain text and HTML (i.e. from text
> > extraction).
> > > That would make a plug-in for each output format, so we could add
> others
> > > later.
> > > - Add a "Print..." menu item.
> > > - Integrate preflight as a plug-in into the application.
> > > - Split, concatenate, page reorder, page rotation (not just viewing but
> > > modifying the PDF), overlay/underlay.
> > > - Menu item: "Create PDF from image(s)" and "Add page from image".
> > > - A dialog/view listing the fonts in the PDF.
> > > - A dialog/view for the basic PDF properties like in Acrobat's
> > > Properties Dialog.
> > > - A dialog/view listing all images with their properties and effective
> > > resolution in the PDF.
> > > - I could go on and on... :-) I'm seeing a serious itch going to catch
> > > me once you've set up a good base for that GUI. I'm so looking forward
> > > to this!
> > >
> > > Anyway, points 1 and 2 will be the most important tasks because they
> > > provide the basis for easily adding additional functionality.
> > >
> >
> > Yeah, those look like good tasks, maybe we'll add the two you suggest to
> > the primary objectives and the rest into secondary or even tertiary
> > objectives. It's good to see you getting geared up for this Jeremias,
> we're
> > no doubt going to call on your expertise ;-).
> >
> > Thanks for the support
> >
> > Mehdi
> >
>
>
>
> --
> Thanks & Regards,
> Tharaka Wijebandara,
> Faculty of Information Technology,
> University of Moratuwa.
>

Re: PDFBox Project for GSoC 2012

Posted by Tharaka Nayanajith Wijebandara <th...@gmail.com>.
Hi,

Thank you very much for your feedbacks and suggestions.

After going through all of your feedbacks, I modified the proposal with
them and used some my own ideas also. Additionally, I included the proposed
time line of the project and I would like to see your comments here.


*Project Title*

Implement PDF object inspector and enhance features of PDFReader GUI.


*Abstract*

PDFReader is small GUI application which comes with Apache PDFBox pure java
PDF library. Currently it is only capable of rendering PDFs. This project
is to redesign PDFReader GUI with a plugin framework and implement some
developer oriented features for it. Primary objectives of project are as
follows.

(1). Design a plugin framework for the PDFReader.

(2). Implement PDF basic viewing features. (Zooming, rotating, page
thumbnail viewer)

(3). Implement a PDF object inspector to display PDF object properties.

(4). Implement Export command for converting PDFs into different file
formats (JPEG, PNG, TXT).



*Project Details*

*
*

PDFBox already has so many features required, but currently most of them
only available for developers and from command line. Aim of this project is
extending PDFReader into well designed application which allows access to
some PDFBox features through GUI and integrating some developer oriented
features. However, project itself is not going to consider GUI access to
all available PDFBox features. Instead of that it re-architects PDFReader
into pluggable application which developer can easily integrate other
features by developing plugins.


In that case, most important primary objective of the project is designing
a simple plugin framework for the PDFReader. In the implementation, we can
use JAR Service Provider mechanism as the base of the plugin framework. For
that first, it's required to identify what are pluggable classes which is
going to use in the application and define interfaces for those classes
with the basic methods. Then we can implement some mechanism for plugin
developer to define what are the classes to use, and where (which
JAR/directory) to load them from. When application loads, it will check
specific locations for plugins and create instance of those plugins using
class loader. Here after application can use those plugin instances as
components of application itself.


Rotating and zooming functions are basic features which is available in
every PDF viewer and it allow user to view PDF pages from different
perspectives. However current PDFReader is unable to provide those
features. This project will address to this problem as part of second
primary objective by implementing some menu commands for those features. In
the implementation, it will be matter of changing some codes in PageDrawer
and other related class which are responsible to render PDF pages using
Java2D.


Other interesting feature which is going to archive as another part of
second objective is developing page thumbnail viewer for the PDFReader.
This will be cool GUI component for user to navigate through the PDF
Document very easily. It will be image list which has small image of the
every PDF page of the current document with page numbers. User can simply
click on the image to jump the particular page. This component will be
developed as plugin for PDFReader and user will be able to show and hide
using the menu command.


Third objective is implementing an object inspector for PDFReader and it
will be very much useful to developer who wants to debug PDF documents.
User can enable click one the any object on the PDF viewer and then in a
small window, it will show the information about particular object while
highlighting the object inside the viewer. As an example if user click on
image of the PDF it will show the information such as image size, color
space, resolution and bit depth. If object is text, it will show the
details about font, color space and color values. In implementation this
also will be developed as another plugin for the PDFReader.


Export feature is last primary objective which I'm going to implement
within this project and it also will be developed as several plugins. As
most of the available software, it's good idea to add 'Export' menu item
into file menu. This menu item will have submenu which contains different
commands for different file formats (JPEG, PNG, TXT) and each of those
commands will popup small dialog box which allows user to change export
settings such as page range, password, encoding type, file name and
location. After change those settings, user can simply click 'OK' button of
the settings dialog box to start the converting process. Every command for
particular file format will be developed as a plugin and in future
developers can easily integrate commands for new file formats to convert
PDFs.


In addition to these primary objectives there are few secondary objectives
which I'm going to consider with this project and major one will be the
'Add Text' feature. It will allow user to draw textbox in any place of the
PDF page and type new text. There will be Small window or view will appear
in this mode and user can change the font size, color and other related
properties. These new text objects will added to PDF page by inserting new
optional content group to the page and in case user wants to remove or edit
this newly added text, it's possible before saving PDF permanently.



Add 'print' menu item, develop plugin for mange (add, edit, delete)
bookmarks and View for the basic PDF properties will be the set of another
secondary objectives which I'm going to look at within this project. Since
most of them are already available in command line, it will be easy to add
those features as plugins for GUI if there will be enough time after
achieving primary objectives.



*Benefits to PDFBox*

Most of the modern code libraries are delivered with GUI application to
test the features before studying API. However, features of PDFBox are
mostly accessible through the command line interface and it discourages
users and new developers from using PDFBox. By having GUI for access these
features any user can simply use them and developer can test the features
without going through the code. Since designing plugin framework is a
primary objective of the project, developers will be able to easily append
new features for GUI in future.

PDF object inspector will be great tool for developer and it will make easy
debugging PDFs. Furthermore, it will help to new developers to study the
structure of PDF file and understanding PDFBox core functionalities.



*Project Timeline and Deliverables*

*May 21 - May 28*

·   Read and understand the current PDFReader Code

·   Create basic design for plugin framework

*May 28 - Jun 08*

                Discuss plugin framework design with community

 *Jun 08 - Jun 18*

                Implement plugin framework

 *Jun 18 - Jun 25*

                Code some basic examples and test cases for plugin framework

 *Jun 25 - Jul 02*

                Implement basic zooming and rotating features

 *Jul 02 - Jul 09*

                Implement page thumbnail viewer

 *Jul 09 - Jul 11*

                Study and Create design for Object Inspector

 *Jul 12 - Jul 13*

                Submit mid-term evaluation

 *Jul 13 - Jul 23*

                Implement PDF Object Inspector

 *Jul 23 - Jul 30*

                Implement export PDF feature

 *Jul 30 - Aug 06*

                Implement some secondary objectives

 *Aug 06 - Aug 13*

                Test, debug the code and complete documentation

 *Aug 13 - Aug 20*

                Submit the design and conclusion



*Exams and other commitments*

I have semester end examination of University from 28th May to 8th June.
After that I can work fulltime (40+ hours per week) on this project until
end.




On Wed, Apr 4, 2012 at 12:44 PM, mehdi houshmand <me...@gmail.com> wrote:

> >
> >
> > <snip>
> > How about just implementing the "add text" feature? That should be
> > relatively easy. Add a new optional content group to the page and add
> > all text as part of that. That makes this a cool tool to fill out forms
> > that were scanned, or that are not AcroForms. I fully agree that the
> > replace feature is going to be tough, relatively speaking.
> >
>
> Yeah, my bad here, I shouldn't have missed it but I'm inclined to agree
> with Andreas. If we implemented deleting text, it could get pretty tricksy,
> especially when CID coded fonts are used. This is a perfect candidate of a
> secondary objective.
>
>
> > I wouldn't put the bar too high for this project. Get some low-hanging
> > fruit that adds some cool and useful functionality to the GUI. More can
> > always be added later.
> >
> > More ideas for low-hanging fruit (as additional tasks if you end up with
> > spare time after the main objectives above):
> > - Allow the "Save As..." menu item not only save a modified PDF, but
> > also bitmaps (PNG, JPEG), plain text and HTML (i.e. from text
> extraction).
> > That would make a plug-in for each output format, so we could add others
> > later.
> > - Add a "Print..." menu item.
> > - Integrate preflight as a plug-in into the application.
> > - Split, concatenate, page reorder, page rotation (not just viewing but
> > modifying the PDF), overlay/underlay.
> > - Menu item: "Create PDF from image(s)" and "Add page from image".
> > - A dialog/view listing the fonts in the PDF.
> > - A dialog/view for the basic PDF properties like in Acrobat's
> > Properties Dialog.
> > - A dialog/view listing all images with their properties and effective
> > resolution in the PDF.
> > - I could go on and on... :-) I'm seeing a serious itch going to catch
> > me once you've set up a good base for that GUI. I'm so looking forward
> > to this!
> >
> > Anyway, points 1 and 2 will be the most important tasks because they
> > provide the basis for easily adding additional functionality.
> >
>
> Yeah, those look like good tasks, maybe we'll add the two you suggest to
> the primary objectives and the rest into secondary or even tertiary
> objectives. It's good to see you getting geared up for this Jeremias, we're
> no doubt going to call on your expertise ;-).
>
> Thanks for the support
>
> Mehdi
>



-- 
Thanks & Regards,
Tharaka Wijebandara,
Faculty of Information Technology,
University of Moratuwa.

Re: PDFBox Project for GSoC 2012

Posted by mehdi houshmand <me...@gmail.com>.
>
>
> <snip>
> How about just implementing the "add text" feature? That should be
> relatively easy. Add a new optional content group to the page and add
> all text as part of that. That makes this a cool tool to fill out forms
> that were scanned, or that are not AcroForms. I fully agree that the
> replace feature is going to be tough, relatively speaking.
>

Yeah, my bad here, I shouldn't have missed it but I'm inclined to agree
with Andreas. If we implemented deleting text, it could get pretty tricksy,
especially when CID coded fonts are used. This is a perfect candidate of a
secondary objective.


> I wouldn't put the bar too high for this project. Get some low-hanging
> fruit that adds some cool and useful functionality to the GUI. More can
> always be added later.
>
> More ideas for low-hanging fruit (as additional tasks if you end up with
> spare time after the main objectives above):
> - Allow the "Save As..." menu item not only save a modified PDF, but
> also bitmaps (PNG, JPEG), plain text and HTML (i.e. from text extraction).
> That would make a plug-in for each output format, so we could add others
> later.
> - Add a "Print..." menu item.
> - Integrate preflight as a plug-in into the application.
> - Split, concatenate, page reorder, page rotation (not just viewing but
> modifying the PDF), overlay/underlay.
> - Menu item: "Create PDF from image(s)" and "Add page from image".
> - A dialog/view listing the fonts in the PDF.
> - A dialog/view for the basic PDF properties like in Acrobat's
> Properties Dialog.
> - A dialog/view listing all images with their properties and effective
> resolution in the PDF.
> - I could go on and on... :-) I'm seeing a serious itch going to catch
> me once you've set up a good base for that GUI. I'm so looking forward
> to this!
>
> Anyway, points 1 and 2 will be the most important tasks because they
> provide the basis for easily adding additional functionality.
>

Yeah, those look like good tasks, maybe we'll add the two you suggest to
the primary objectives and the rest into secondary or even tertiary
objectives. It's good to see you getting geared up for this Jeremias, we're
no doubt going to call on your expertise ;-).

Thanks for the support

Mehdi

Re: PDFBox Project for GSoC 2012

Posted by Jeremias Maerki <de...@jeremias-maerki.ch>.
I like it, too. More below...

On 03.04.2012 19:55:40 Andreas Lehmkuehler wrote:
> Hi,
> 
> I like your proposal, comments see below ...
> 
> Am 03.04.2012 02:34, schrieb Tharaka Nayanajith Wijebandara:
> > Hi,
> >
> > I have completed some parts of the the GSoC proposal and I need some
> > feedback from the community about the that. Here a I used swing approach
> > since I have some doubts about RCP. Anyway I also understand RCP will
> > easy approach rather than this and if someone going to help me on that I'm
> > still ready to go in that way. Still I have to prepare 'Project Timeline
> > and Deliverables' part and I will send those part soon.
> >
> > *Project Title*
> >
> > *=============*
> >
> >
> >
> > Implement PDF object inspector and enhance features of PDFReader GUI
> >
> >
> >
> >
> >
> > *Abstract*
> >
> > *========*
> >
> >
> >
> > PDFReader is small GUI application which comes with Apache PDFBox pure java
> > PDF library. Currently it is only capable of rendering PDFs. This project
> > is to redesign PDFReader GUI with plugin framework and implement some
> > developer oriented features for it. Primary objectives of project are as
> > follows.
> >
> >
> >
> > (1). Design a plugin framework for the PDFReader.
> +1

+1

> > (2). Implement PDF basic viewing features. (Zooming, rotating, page
> > thumbnail viewer)
> +1

+1

> > (3). Implement a PDF object inspector to display PDF object properties.
> Could be complicated ...

Possible, so I suggest to stick to the easy targets first. Prioritize.
Getting information about an XObject should be fairly easy and allows to
create the basic infrastructure (finding the bounding box, painting a
selection rectangle etc.) to support additional objects such as text
passages. However, it may be a good idea to switch order/priority between
point 3 and 4.

> > (4). Implement a type-writer feature to add/remove text from a content
> > stream.
> IMO, you should consider to drop this idea. The search/replace feature was 
> discussed many times on the mailing list. As PDF is not a text oriented format, 
> it is even difficult to find text but much more complicated to replace it.

How about just implementing the "add text" feature? That should be
relatively easy. Add a new optional content group to the page and add
all text as part of that. That makes this a cool tool to fill out forms
that were scanned, or that are not AcroForms. I fully agree that the
replace feature is going to be tough, relatively speaking.

I wouldn't put the bar too high for this project. Get some low-hanging
fruit that adds some cool and useful functionality to the GUI. More can
always be added later.

More ideas for low-hanging fruit (as additional tasks if you end up with
spare time after the main objectives above):
- Allow the "Save As..." menu item not only save a modified PDF, but
also bitmaps (PNG, JPEG), plain text and HTML (i.e. from text extraction).
That would make a plug-in for each output format, so we could add others
later.
- Add a "Print..." menu item.
- Integrate preflight as a plug-in into the application.
- Split, concatenate, page reorder, page rotation (not just viewing but
modifying the PDF), overlay/underlay.
- Menu item: "Create PDF from image(s)" and "Add page from image".
- A dialog/view listing the fonts in the PDF.
- A dialog/view for the basic PDF properties like in Acrobat's
Properties Dialog.
- A dialog/view listing all images with their properties and effective
resolution in the PDF.
- I could go on and on... :-) I'm seeing a serious itch going to catch
me once you've set up a good base for that GUI. I'm so looking forward
to this!

Anyway, points 1 and 2 will be the most important tasks because they
provide the basis for easily adding additional functionality.

> <SNIP>
> 
> 
> BR
> Andreas Lehmkühler




Jeremias Maerki


Re: PDFBox Project for GSoC 2012

Posted by Andreas Lehmkuehler <an...@lehmi.de>.
Hi,

I like your proposal, comments see below ...

Am 03.04.2012 02:34, schrieb Tharaka Nayanajith Wijebandara:
> Hi,
>
> I have completed some parts of the the GSoC proposal and I need some
> feedback from the community about the that. Here a I used swing approach
> since I have some doubts about RCP. Anyway I also understand RCP will
> easy approach rather than this and if someone going to help me on that I'm
> still ready to go in that way. Still I have to prepare 'Project Timeline
> and Deliverables' part and I will send those part soon.
>
> *Project Title*
>
> *=============*
>
>
>
> Implement PDF object inspector and enhance features of PDFReader GUI
>
>
>
>
>
> *Abstract*
>
> *========*
>
>
>
> PDFReader is small GUI application which comes with Apache PDFBox pure java
> PDF library. Currently it is only capable of rendering PDFs. This project
> is to redesign PDFReader GUI with plugin framework and implement some
> developer oriented features for it. Primary objectives of project are as
> follows.
>
>
>
> (1). Design a plugin framework for the PDFReader.
+1

> (2). Implement PDF basic viewing features. (Zooming, rotating, page
> thumbnail viewer)
+1

> (3). Implement a PDF object inspector to display PDF object properties.
Could be complicated ...

> (4). Implement a type-writer feature to add/remove text from a content
> stream.
IMO, you should consider to drop this idea. The search/replace feature was 
discussed many times on the mailing list. As PDF is not a text oriented format, 
it is even difficult to find text but much more complicated to replace it.

<SNIP>


BR
Andreas Lehmkühler

Re: PDFBox Project for GSoC 2012

Posted by mehdi houshmand <me...@gmail.com>.
Hi Tharaka,

I'd say it's looking pretty good, you've got a lot of information there, so
great job!!

I would however say maybe adding some secondary objectives would be a good
idea, just so we explicitly display the intentions of the project. Having
secondary objectives are also good to give you focus if all the primary
objectives are completed. Also, there are a few grammatical errors, this
isn't a big deal at all and I appreciate English may not be your first
language, maybe before you do the final version I'll help you just make a
few corrections.

Good work though, I'm looking forward to commencing this project.

Mehdi

On 3 April 2012 01:34, Tharaka Nayanajith Wijebandara
<th...@gmail.com>wrote:

> Hi,
>
> I have completed some parts of the the GSoC proposal and I need some
> feedback from the community about the that. Here a I used swing approach
> since I have some doubts about RCP. Anyway I also understand RCP will
> easy approach rather than this and if someone going to help me on that I'm
> still ready to go in that way. Still I have to prepare 'Project Timeline
> and Deliverables' part and I will send those part soon.
>
> *Project Title*
>
> *=============*
>
>
>
> Implement PDF object inspector and enhance features of PDFReader GUI
>
>
>
>
>
> *Abstract*
>
> *========*
>
>
>
> PDFReader is small GUI application which comes with Apache PDFBox pure java
> PDF library. Currently it is only capable of rendering PDFs. This project
> is to redesign PDFReader GUI with plugin framework and implement some
> developer oriented features for it. Primary objectives of project are as
> follows.
>
>
>
> (1). Design a plugin framework for the PDFReader.
>
>
>
> (2). Implement PDF basic viewing features. (Zooming, rotating, page
> thumbnail viewer)
>
>
>
> (3). Implement a PDF object inspector to display PDF object properties.
>
>
>
> (4). Implement a type-writer feature to add/remove text from a content
> stream.
>
>
>
>
>
> *Project Details*
>
> *===============*
>
>
>
> PDFBox already has so many features required, but currently most of them
> only available for developers and from the command line. Aim of this
> project is extending PDFReader into well designed application which allows
> access to some PDFBox features through GUI and integrating some developer
> oriented features. However, project itself is not going to consider GUI
> access to all available PDFBox features. Instead of that it re-architects
> PDFReader into pluggable application which developer can easily integrate
> other features by developing plugins.
>
> In that case, most important primary objective of the project is designing
> a simple plugin framework for the PDFReader. In the implementation, we can
> use JAR Service Provider mechanism as the base of the plugin framework. For
> that first, it's required to identify what are pluggable classes which is
> going to use in the application and define interfaces for those classes
> with the basic methods. Then we can implement some mechanism for plugin
> developer to define what are the classes to use, and where (which
> JAR/directory) to load them from. When application loads, it will check
> specific locations for plugins and create instance of those plugins using
> class loader. Here after application can use those plugin instances as
> components of application itself.
>
>
>
> Rotating and zooming functions are basic features which is available in
> every PDF viewer and it allow user to view PDF pages from different
> perspectives. However current PDFReader is unable to provide those
> features. This project will address to this problem as part of second
> primary objective by implementing some menu commands for those features. In
> the implementation, it will be matter of changing some codes in PageDrawer
> and other related class which are responsible to render PDF pages using
> Java2D.
>
>
>
> Other interesting feature which is going to archive as another part of
> second objective is developing page thumbnail viewer for the PDFReader.
> This will be cool GUI component for user to navigate through the PDF
> Document very easily. It will be image list which has small image of the
> every PDF page of the current document with page numbers. User can simply
> click on the image to jump the particular page. This component will be
> developed as plugin for PDFReader and user will be able to show and hide
> using the menu command.
>
>
> Third objective is implementing a object inspector for PDFReader and it
> will be very much useful to developer who want to debug PDF documents. User
> can enable click one the any object on the PDF viewer and then in a small
> window, it will show the information about particular object while
> highlighting the object inside the viewer. As a example if user click on
> image of the PDF it will show the information such as image size, color
> space, resolution and bit depth. If object is text, it will show the
> details about font, color space and color values. In implementation this
> also will be developed as another plugin for the PDFReader.
>
>
> Typewriter feature is last objective which we going to implement within
> this project and it also will be developed as plugin. It will allow user to
> click any place in the PDF and type new text for the PDF page. Small window
> or tool box will appear in this mode and user can change the font size,
> color and other related properties. Furthermore, if user wants to remove
> this newly added text, he can simply select and press 'Delete'.
>
>
>
> *Benefits to PDFBox*
>
> *==================*
>
> Most of the modern code libraries are delivered with GUI application to
> test the features before studying API. However, features of PDFBox are
> mostly accessible through the command line interface and it discourages
> users and new developers from using PDFBox. By having GUI for access these
> features any user can simply use them and developer can test the features
> without going through the code. Since designing plugin framework is a
> primary objective of the project, developers will be able to easily append
> new features for GUI in future.
>
> PDF object inspector will be great tool for developer and it will make easy
> debugging PDFs. Futhermore, it will help to new developers to study the
> structure of PDF file and understanding PDFBox core functionalities.
>
>
> On Sat, Mar 31, 2012 at 11:12 AM, Tharaka Nayanajith Wijebandara <
> tharaka.nw@gmail.com> wrote:
>
> > Hi,
> >
> >
> > Idea about object inspector is clear and I think I can make that dream
> > into reality. ;)
> >
> >
> > I have some doubt regarding RCP. Actually I didn't have any previous
> > experience with RPC and read some articles to get idea about it. As per
> my
> > understanding, although there are several RCPs in our case eclipse RCP is
> > most suitable one. However since RCP Application has several dependencies
> > and they are not in maven structure, I still can't understand how them
> > organize in pdfbox project. Is there any example project for
> understanding
> > it? Maybe I'm misunderstood, need some help from you to clear it.
> >
> >
> > Mehdi, I started to create proposal and I'll put it here within this
> > weekend.
> >
> > On Thu, Mar 29, 2012 at 2:35 PM, mehdi houshmand <med1985@gmail.com
> >wrote:
> >
> >> I think my hopes and dreams are pretty exactly the same as Jeremias'.
> >> These
> >> features would give PDFBox differentiation from Adobe products which
> focus
> >> much more design of PDF rather than nuts and bots debugging. As for the
> >> GUI, I'm really not fussy regarding the UI framework you use, but RCP
> does
> >> look pretty interesting...
> >>
> >> Just as an aside, in terms of structure, I was thinking of this being a
> >> maven sub-project in the same way fontbox and jempbox etc are. I just
> >> wanted to make sure we're all on the same wavelength here...
> >>
> >> Tharaka, I think you've probably got enough information here to create a
> >> proposal, if you want to put a draft up here for us (the community) to
> >> check over feel free to do so. But you're going to need to publish the
> >> proposal on Google melange (
> >> http://www.google-melange.com/gsoc/homepage/google/gsoc2012) so that I
> >> can
> >> agree to mentor it and rate the application.
> >>
> >> Just a bit of admin:
> >> Debian has a good template for student applications (
> >> http://wiki.debian.org/SummerOfCode/StudentApplicationTemplate). Once
> you
> >> fill out the necessary details I will assign myself as the mentor and
> have
> >> to rank your application (
> >> http://community.apache.org/mentee-ranking-process.html). If you want
> to
> >> fair better in the ranking process, involvement in the community is
> highly
> >> regarded by either a) answering user questions on the mailing list
> and/or
> >> b) going through PDFBox bugs and fixing some bugs; this isn't mandatory
> >> but
> >> will improve the ranking on your application.
> >>
> >> Mehdi
> >>
> >>
> >> On 29 March 2012 07:36, Jeremias Maerki <de...@jeremias-maerki.ch> wrote:
> >>
> >> > Hi Tharaka,
> >> >
> >> > Object Inspector: I'm not sure you need a tree for inspecting a single
> >> > object. The tree is useful when the PDF debugger is integrated as an
> >> > alternative view of a PDF. I'd expect to have a customized window (a
> >> > plug-in ;-) ) for each kind of object when inspecting it. A bitmap
> image
> >> > would display information about the size, the color space and profile,
> >> > effective resolution. Clicking on a word would display its font and
> >> > color. I guess it would be quite cool to have a button that says "jump
> >> > to object tree" so it would switch to the debugger view and focus on
> the
> >> > selected object. Imagine having viewer and debugger side-by-side in
> >> > separate views and the object that you click on in the debugger is
> >> > highlighted in the viewer. Wow. But I'm dreaming. ;-)
> >> >
> >> > Unfortunately, JPF is off-limits for PDFBox if it is used for core
> >> > functionality because it's under the LGPL license. Only optional
> >> > components/plug-ins may depend on LGPL libraries. But I think the
> normal
> >> > JAR Service Provider mechanism would already be good enough as a
> plug-in
> >> > framework:
> >> >
> >> >
> >>
> http://docs.oracle.com/javase/6/docs/technotes/guides/jar/jar.html#Service%20Provider
> >> > We use that extensively over in Apache XML Graphics land. I would
> >> > suggest to base the GUI on Java 6 where you have the service lookup
> >> > built in:
> >> > http://docs.oracle.com/javase/6/docs/api/java/util/ServiceLoader.html
> >> >
> >> > Swing with Nimbus generally sounds good to me.
> >> >
> >> > OTOH, an RCP application would actually offer the whole pluggability
> >> > (even at runtime) and an Eclipse-like workbench which would be great
> for
> >> > such an application. But it would also mean practically re-writing
> both
> >> > the PDF viewer (although the PDF painting via Java2D should still be
> >> > usable) and debugger functionality.
> >> >
> >> > Both Swing and RCP are non-trivial to learn. On the technical side I'd
> >> > probably favor RCP just a little bit, mostly because it's OSGi-based
> >> > (I'm biased there) even if the Eclipse world does ugly things to OSGi.
> >> > Furthermore, I know few good Swing applications but many good RCP
> >> > applications. Maybe that's an indicator. But I'd have to learn RCP
> >> > myself if I was to help out with this.
> >> >
> >> > Optional Content Groups (since PDF 1.5) are groups of objects (think
> >> > "layer") that can be enabled and disabled. I think that's advanced
> >> > functionality that the mainstream doesn't really need. But I'm also
> not
> >> > sure what Medhi means with transparency layers.
> >> >
> >> > On 29.03.2012 03:55:28 Tharaka Nayanajith Wijebandara wrote:
> >> > > Hi,
> >> > >
> >> > >
> >> > > Thanx mehdi, you have summarized all primary objective and we can
> >> > continue
> >> > > discussion based on it.
> >> > >
> >> > > For the GUI, I'm going to use Swing Framework and to modernize we
> can
> >> go
> >> > > for some available look and Feel such as nimbus. Is there any
> >> suggestion?
> >> > >
> >> > >
> >> > > I need some ideas about PDF object inspector and since it tool for
> >> > > developer all you can give some suggestions. In my opinion we can
> use
> >> > swing
> >> > > tree for this. User can right click on object and click ‘inspect’
> >> command
> >> > > in the menu, then it will show the object and it properties in the
> >> tree
> >> > and
> >> > > highlight object margin in the PDF view.
> >> > >
> >> > >
> >> > > Plugin frame is another feature to consider. We can use and adapt
> some
> >> > > available plugin frame for PDFReader rather than going for new our
> own
> >> > one.
> >> > > As per my understand Java Plug-in Framework (
> >> http://jpf.sourceforge.net/
> >> > )
> >> > > is good one we can use and it is XML based. Is there any apache
> >> framework
> >> > > for this? If I'm correct, maven project is based on plugin
> framework.
> >> > >
> >> > >
> >> > > Mehdi, I would like to add bookmark feature also in to primary
> >> objective
> >> > > list. It will allow user to view, add, edit and delete book marks.
> And
> >> > also
> >> > > I'm confused about what you mean by supporting optional content and
> >> > > transparency layers.
> >> > >
> >> > > On Mon, Mar 26, 2012 at 1:49 PM, mehdi houshmand <med1985@gmail.com
> >
> >> > wrote:
> >> > >
> >> > > > Hi Tharaka,
> >> > > >
> >> > > > Ok, well let's start with thinking about how this GUI is going to
> >> > look. In
> >> > > > its current form, it's looking a little dated, it might be worth
> >> using
> >> > some
> >> > > > of the GUI frameworks out there to spruce it up a little and
> making
> >> it
> >> > a
> >> > > > bit more modern. So if we get a list of TODOs (starting with
> >> Jeremias'
> >> > > > suggestions):
> >> > > >
> >> > > > - Investigate and design a proposal for upgrading the PDFReader
> GUI
> >> > (i.e.
> >> > > > modernizing it)
> >> > > > - Implement a PDF object inspector to display PDF object
> properties
> >> > when
> >> > > > objects are "clicked-on" in the viewer
> >> > > > - Implement a type-writer feature to add/remove text from a
> content
> >> > stream
> >> > > > from within the PDFViewer
> >> > > > - Design a plugin framework for the viewer
> >> > > >
> >> > > > Maybe those are a good list of primary objectives? We should think
> >> > about
> >> > > > some secondary objectives, like possibly supporting Optional
> Content
> >> > in the
> >> > > > view (not sure if it already is) and better handling of
> transparency
> >> > > > layers?
> >> > > >
> >> > > > Just some thoughts, let's keep this discussions fluid for now, we
> >> > haven't
> >> > > > actually got that long before we have to submit a proposal though.
> >> > > >
> >> > > > Mehdi
> >> > > >
> >> > > > On 22 March 2012 03:04, Tharaka Nayanajith Wijebandara <
> >> > > > tharaka.nw@gmail.com
> >> > > > > wrote:
> >> > > >
> >> > > > > Hi,
> >> > > > >
> >> > > > >
> >> > > > > Thanks everyone for your valuable ideas and comments.
> >> > > > >
> >> > > > >
> >> > > > > It seems most of you like to enhancing PDFReader project.
> >> Specially I
> >> > > > > prefer Jeremias' idea, develop application which allow user to
> >> access
> >> > > > > PDFBox features through the GUI and PDF viewer with integrated
> PDF
> >> > > > > Debugger. In my opinion it will be very much useful tool for
> >> users as
> >> > > > well
> >> > > > > as developers rather than just another PDF reader. So I would
> >> like to
> >> > > > > continue with this project. However, since there are several
> ways
> >> to
> >> > > > > enhance PDFReader, we have to define the scope of the project
> >> > according
> >> > > > to
> >> > > > > priority in next few days.
> >> > > > >
> >> > > > >
> >> > > > > Additionally I want to mention another thing here. As my
> >> experience
> >> > > > current
> >> > > > > PDFReader is also not very much reliable and not working
> >> smoothly. I
> >> > > > still
> >> > > > > can't understand that whether it is problem of code of PDFReader
> >> or
> >> > > > PDFBox
> >> > > > > core. Anyway I think that it's better if we can consider also
> this
> >> > in the
> >> > > > > project, if it's not some issue in PDFBox core.
> >> > > > >
> >> > > > > On Wed, Mar 21, 2012 at 10:26 PM, mehdi houshmand <
> >> med1985@gmail.com
> >> > >
> >> > > > > wrote:
> >> > > > >
> >> > > > > > Hi Tharaka,
> >> > > > > >
> >> > > > > > You have plenty of options here, the student application
> >> deadline
> >> > is on
> >> > > > > the
> >> > > > > > 6th April (see the calendar
> >> > > > > > http://www.google-melange.com/gsoc/events/google/gsoc2012),
> so
> >> > you've
> >> > > > > got
> >> > > > > > plenty of time.
> >> > > > > >
> >> > > > > > Let me and/or the community know if you have any questions
> about
> >> > the
> >> > > > > > projects. If you have an idea of which project you'd prefer,
> >> maybe
> >> > we
> >> > > > can
> >> > > > > > start drawing up some goals and a schedule and discuss with
> the
> >> > > > community
> >> > > > > > to get some feedback on which facets are a) interesting b)
> >> useful
> >> > or c)
> >> > > > > > both!
> >> > > > > >
> >> > > > > > Mehdi
> >> > > > > >
> >> > > > > > On 20 March 2012 15:58, Jeremias Maerki <
> dev@jeremias-maerki.ch
> >> >
> >> > > > wrote:
> >> > > > > >
> >> > > > > > > There are a number of HTML to XSL-FO converters. I've never
> >> > checked
> >> > > > how
> >> > > > > > > good the results are but ultimately a native HTML engine is
> >> > likely to
> >> > > > > > > produce better results especially since HTML was not really
> >> > designed
> >> > > > > for
> >> > > > > > > print and has to be retrofitted in CSS3. But again, this is
> a
> >> > large
> >> > > > > > > project and, IMO, out of scope for PDFBox.
> >> > > > > > >
> >> > > > > > > http://html2fo.sourceforge.net/
> >> > > > > > > http://denature.sourceforge.net/
> >> > > > > > > ...and probably others...
> >> > > > > > >
> >> > > > > > > On 20.03.2012 13:02:33 Dexter Mishra wrote:
> >> > > > > > > > I vote for a PDS editor kind of stuff.
> >> > > > > > > > Another thing regarding the HTML to PDS, cant it be done
> >> with
> >> > > > XSL-FO
> >> > > > > > > > feature?
> >> > > > > > > > Thanks
> >> > > > > > > >
> >> > > > > > > > On Tue, Mar 20, 2012 at 3:03 PM, Maruan Sahyoun <
> >> > > > > > sahyoun@fileaffairs.de
> >> > > > > > > >wrote:
> >> > > > > > > >
> >> > > > > > > > >
> >> > > > > > > > > > <snip/>
> >> > > > > > > > > >
> >> > > > > > > > > >>
> >> > > > > > > > > >> Although I think that the current PDF Reader can be
> >> > enhanced
> >> > > > in
> >> > > > > > many
> >> > > > > > > > > ways
> >> > > > > > > > > >> there are already so many Readers out there as well
> as
> >> PDF
> >> > > > > support
> >> > > > > > > > > within
> >> > > > > > > > > >> web browsers my personal opinion is that enhancing
> >> PDFBox
> >> > core
> >> > > > > > > > > capabilities
> >> > > > > > > > > >> would be more beneficial.
> >> > > > > > > > > >>
> >> > > > > > > > > >> With kind regards
> >> > > > > > > > > >>
> >> > > > > > > > > >> Maruan Sahyoun
> >> > > > > > > > > >>
> >> > > > > > > > > >>
> >> > > > > > > > > > Check out Jeremias' suggestions of the viewer, it's
> less
> >> > of a
> >> > > > > > viewer
> >> > > > > > > and
> >> > > > > > > > > > more of a front-end for a lot of the tools PDFBox has
> to
> >> > > > offer, a
> >> > > > > > > PDFBox
> >> > > > > > > > > > GUI so to speak rather than a PDF viewer.
> >> > > > > > > > >
> >> > > > > > > > > I'd still look into enhancing PDFBox core as this will
> >> > benefit
> >> > > > most
> >> > > > > > > users.
> >> > > > > > > > > Looking at the bugs and issues most come from core
> >> > capabilities.
> >> > > > > > >
> >> > > > > > >
> >> > > > > > >
> >> > > > > > >
> >> > > > > > > Jeremias Maerki
> >> > > > > > >
> >> > > > > > >
> >> > > > > >
> >> > > > >
> >> > > > >
> >> > > > >
> >> > > > > --
> >> > > > > Thanks & Regards,
> >> > > > > Tharaka Wijebandara,
> >> > > > > Faculty of Information Technology,
> >> > > > > University of Moratuwa.
> >> > > > >
> >> > > >
> >> > >
> >> > >
> >> > >
> >> > > --
> >> > > Thanks & Regards,
> >> > > Tharaka Wijebandara,
> >> > > Faculty of Information Technology,
> >> > > University of Moratuwa.
> >> >
> >> >
> >> >
> >> >
> >> > Jeremias Maerki
> >> >
> >> >
> >>
> >
> >
> >
> > --
> > Thanks & Regards,
> > Tharaka Wijebandara,
> > Faculty of Information Technology,
> > University of Moratuwa.
> >
> >
>
>
> --
> Thanks & Regards,
> Tharaka Wijebandara,
> Faculty of Information Technology,
> University of Moratuwa.
>

Re: PDFBox Project for GSoC 2012

Posted by Tharaka Nayanajith Wijebandara <th...@gmail.com>.
Hi,

I have completed some parts of the the GSoC proposal and I need some
feedback from the community about the that. Here a I used swing approach
since I have some doubts about RCP. Anyway I also understand RCP will
easy approach rather than this and if someone going to help me on that I'm
still ready to go in that way. Still I have to prepare 'Project Timeline
and Deliverables' part and I will send those part soon.

*Project Title*

*=============*



Implement PDF object inspector and enhance features of PDFReader GUI





*Abstract*

*========*



PDFReader is small GUI application which comes with Apache PDFBox pure java
PDF library. Currently it is only capable of rendering PDFs. This project
is to redesign PDFReader GUI with plugin framework and implement some
developer oriented features for it. Primary objectives of project are as
follows.



(1). Design a plugin framework for the PDFReader.



(2). Implement PDF basic viewing features. (Zooming, rotating, page
thumbnail viewer)



(3). Implement a PDF object inspector to display PDF object properties.



(4). Implement a type-writer feature to add/remove text from a content
stream.





*Project Details*

*===============*



PDFBox already has so many features required, but currently most of them
only available for developers and from the command line. Aim of this
project is extending PDFReader into well designed application which allows
access to some PDFBox features through GUI and integrating some developer
oriented features. However, project itself is not going to consider GUI
access to all available PDFBox features. Instead of that it re-architects
PDFReader into pluggable application which developer can easily integrate
other features by developing plugins.

In that case, most important primary objective of the project is designing
a simple plugin framework for the PDFReader. In the implementation, we can
use JAR Service Provider mechanism as the base of the plugin framework. For
that first, it's required to identify what are pluggable classes which is
going to use in the application and define interfaces for those classes
with the basic methods. Then we can implement some mechanism for plugin
developer to define what are the classes to use, and where (which
JAR/directory) to load them from. When application loads, it will check
specific locations for plugins and create instance of those plugins using
class loader. Here after application can use those plugin instances as
components of application itself.



Rotating and zooming functions are basic features which is available in
every PDF viewer and it allow user to view PDF pages from different
perspectives. However current PDFReader is unable to provide those
features. This project will address to this problem as part of second
primary objective by implementing some menu commands for those features. In
the implementation, it will be matter of changing some codes in PageDrawer
and other related class which are responsible to render PDF pages using
Java2D.



Other interesting feature which is going to archive as another part of
second objective is developing page thumbnail viewer for the PDFReader.
This will be cool GUI component for user to navigate through the PDF
Document very easily. It will be image list which has small image of the
every PDF page of the current document with page numbers. User can simply
click on the image to jump the particular page. This component will be
developed as plugin for PDFReader and user will be able to show and hide
using the menu command.


Third objective is implementing a object inspector for PDFReader and it
will be very much useful to developer who want to debug PDF documents. User
can enable click one the any object on the PDF viewer and then in a small
window, it will show the information about particular object while
highlighting the object inside the viewer. As a example if user click on
image of the PDF it will show the information such as image size, color
space, resolution and bit depth. If object is text, it will show the
details about font, color space and color values. In implementation this
also will be developed as another plugin for the PDFReader.


Typewriter feature is last objective which we going to implement within
this project and it also will be developed as plugin. It will allow user to
click any place in the PDF and type new text for the PDF page. Small window
or tool box will appear in this mode and user can change the font size,
color and other related properties. Furthermore, if user wants to remove
this newly added text, he can simply select and press 'Delete'.



*Benefits to PDFBox*

*==================*

Most of the modern code libraries are delivered with GUI application to
test the features before studying API. However, features of PDFBox are
mostly accessible through the command line interface and it discourages
users and new developers from using PDFBox. By having GUI for access these
features any user can simply use them and developer can test the features
without going through the code. Since designing plugin framework is a
primary objective of the project, developers will be able to easily append
new features for GUI in future.

PDF object inspector will be great tool for developer and it will make easy
debugging PDFs. Futhermore, it will help to new developers to study the
structure of PDF file and understanding PDFBox core functionalities.


On Sat, Mar 31, 2012 at 11:12 AM, Tharaka Nayanajith Wijebandara <
tharaka.nw@gmail.com> wrote:

> Hi,
>
>
> Idea about object inspector is clear and I think I can make that dream
> into reality. ;)
>
>
> I have some doubt regarding RCP. Actually I didn't have any previous
> experience with RPC and read some articles to get idea about it. As per my
> understanding, although there are several RCPs in our case eclipse RCP is
> most suitable one. However since RCP Application has several dependencies
> and they are not in maven structure, I still can't understand how them
> organize in pdfbox project. Is there any example project for understanding
> it? Maybe I'm misunderstood, need some help from you to clear it.
>
>
> Mehdi, I started to create proposal and I'll put it here within this
> weekend.
>
> On Thu, Mar 29, 2012 at 2:35 PM, mehdi houshmand <me...@gmail.com>wrote:
>
>> I think my hopes and dreams are pretty exactly the same as Jeremias'.
>> These
>> features would give PDFBox differentiation from Adobe products which focus
>> much more design of PDF rather than nuts and bots debugging. As for the
>> GUI, I'm really not fussy regarding the UI framework you use, but RCP does
>> look pretty interesting...
>>
>> Just as an aside, in terms of structure, I was thinking of this being a
>> maven sub-project in the same way fontbox and jempbox etc are. I just
>> wanted to make sure we're all on the same wavelength here...
>>
>> Tharaka, I think you've probably got enough information here to create a
>> proposal, if you want to put a draft up here for us (the community) to
>> check over feel free to do so. But you're going to need to publish the
>> proposal on Google melange (
>> http://www.google-melange.com/gsoc/homepage/google/gsoc2012) so that I
>> can
>> agree to mentor it and rate the application.
>>
>> Just a bit of admin:
>> Debian has a good template for student applications (
>> http://wiki.debian.org/SummerOfCode/StudentApplicationTemplate). Once you
>> fill out the necessary details I will assign myself as the mentor and have
>> to rank your application (
>> http://community.apache.org/mentee-ranking-process.html). If you want to
>> fair better in the ranking process, involvement in the community is highly
>> regarded by either a) answering user questions on the mailing list and/or
>> b) going through PDFBox bugs and fixing some bugs; this isn't mandatory
>> but
>> will improve the ranking on your application.
>>
>> Mehdi
>>
>>
>> On 29 March 2012 07:36, Jeremias Maerki <de...@jeremias-maerki.ch> wrote:
>>
>> > Hi Tharaka,
>> >
>> > Object Inspector: I'm not sure you need a tree for inspecting a single
>> > object. The tree is useful when the PDF debugger is integrated as an
>> > alternative view of a PDF. I'd expect to have a customized window (a
>> > plug-in ;-) ) for each kind of object when inspecting it. A bitmap image
>> > would display information about the size, the color space and profile,
>> > effective resolution. Clicking on a word would display its font and
>> > color. I guess it would be quite cool to have a button that says "jump
>> > to object tree" so it would switch to the debugger view and focus on the
>> > selected object. Imagine having viewer and debugger side-by-side in
>> > separate views and the object that you click on in the debugger is
>> > highlighted in the viewer. Wow. But I'm dreaming. ;-)
>> >
>> > Unfortunately, JPF is off-limits for PDFBox if it is used for core
>> > functionality because it's under the LGPL license. Only optional
>> > components/plug-ins may depend on LGPL libraries. But I think the normal
>> > JAR Service Provider mechanism would already be good enough as a plug-in
>> > framework:
>> >
>> >
>> http://docs.oracle.com/javase/6/docs/technotes/guides/jar/jar.html#Service%20Provider
>> > We use that extensively over in Apache XML Graphics land. I would
>> > suggest to base the GUI on Java 6 where you have the service lookup
>> > built in:
>> > http://docs.oracle.com/javase/6/docs/api/java/util/ServiceLoader.html
>> >
>> > Swing with Nimbus generally sounds good to me.
>> >
>> > OTOH, an RCP application would actually offer the whole pluggability
>> > (even at runtime) and an Eclipse-like workbench which would be great for
>> > such an application. But it would also mean practically re-writing both
>> > the PDF viewer (although the PDF painting via Java2D should still be
>> > usable) and debugger functionality.
>> >
>> > Both Swing and RCP are non-trivial to learn. On the technical side I'd
>> > probably favor RCP just a little bit, mostly because it's OSGi-based
>> > (I'm biased there) even if the Eclipse world does ugly things to OSGi.
>> > Furthermore, I know few good Swing applications but many good RCP
>> > applications. Maybe that's an indicator. But I'd have to learn RCP
>> > myself if I was to help out with this.
>> >
>> > Optional Content Groups (since PDF 1.5) are groups of objects (think
>> > "layer") that can be enabled and disabled. I think that's advanced
>> > functionality that the mainstream doesn't really need. But I'm also not
>> > sure what Medhi means with transparency layers.
>> >
>> > On 29.03.2012 03:55:28 Tharaka Nayanajith Wijebandara wrote:
>> > > Hi,
>> > >
>> > >
>> > > Thanx mehdi, you have summarized all primary objective and we can
>> > continue
>> > > discussion based on it.
>> > >
>> > > For the GUI, I'm going to use Swing Framework and to modernize we can
>> go
>> > > for some available look and Feel such as nimbus. Is there any
>> suggestion?
>> > >
>> > >
>> > > I need some ideas about PDF object inspector and since it tool for
>> > > developer all you can give some suggestions. In my opinion we can use
>> > swing
>> > > tree for this. User can right click on object and click ‘inspect’
>> command
>> > > in the menu, then it will show the object and it properties in the
>> tree
>> > and
>> > > highlight object margin in the PDF view.
>> > >
>> > >
>> > > Plugin frame is another feature to consider. We can use and adapt some
>> > > available plugin frame for PDFReader rather than going for new our own
>> > one.
>> > > As per my understand Java Plug-in Framework (
>> http://jpf.sourceforge.net/
>> > )
>> > > is good one we can use and it is XML based. Is there any apache
>> framework
>> > > for this? If I'm correct, maven project is based on plugin framework.
>> > >
>> > >
>> > > Mehdi, I would like to add bookmark feature also in to primary
>> objective
>> > > list. It will allow user to view, add, edit and delete book marks. And
>> > also
>> > > I'm confused about what you mean by supporting optional content and
>> > > transparency layers.
>> > >
>> > > On Mon, Mar 26, 2012 at 1:49 PM, mehdi houshmand <me...@gmail.com>
>> > wrote:
>> > >
>> > > > Hi Tharaka,
>> > > >
>> > > > Ok, well let's start with thinking about how this GUI is going to
>> > look. In
>> > > > its current form, it's looking a little dated, it might be worth
>> using
>> > some
>> > > > of the GUI frameworks out there to spruce it up a little and making
>> it
>> > a
>> > > > bit more modern. So if we get a list of TODOs (starting with
>> Jeremias'
>> > > > suggestions):
>> > > >
>> > > > - Investigate and design a proposal for upgrading the PDFReader GUI
>> > (i.e.
>> > > > modernizing it)
>> > > > - Implement a PDF object inspector to display PDF object properties
>> > when
>> > > > objects are "clicked-on" in the viewer
>> > > > - Implement a type-writer feature to add/remove text from a content
>> > stream
>> > > > from within the PDFViewer
>> > > > - Design a plugin framework for the viewer
>> > > >
>> > > > Maybe those are a good list of primary objectives? We should think
>> > about
>> > > > some secondary objectives, like possibly supporting Optional Content
>> > in the
>> > > > view (not sure if it already is) and better handling of transparency
>> > > > layers?
>> > > >
>> > > > Just some thoughts, let's keep this discussions fluid for now, we
>> > haven't
>> > > > actually got that long before we have to submit a proposal though.
>> > > >
>> > > > Mehdi
>> > > >
>> > > > On 22 March 2012 03:04, Tharaka Nayanajith Wijebandara <
>> > > > tharaka.nw@gmail.com
>> > > > > wrote:
>> > > >
>> > > > > Hi,
>> > > > >
>> > > > >
>> > > > > Thanks everyone for your valuable ideas and comments.
>> > > > >
>> > > > >
>> > > > > It seems most of you like to enhancing PDFReader project.
>> Specially I
>> > > > > prefer Jeremias' idea, develop application which allow user to
>> access
>> > > > > PDFBox features through the GUI and PDF viewer with integrated PDF
>> > > > > Debugger. In my opinion it will be very much useful tool for
>> users as
>> > > > well
>> > > > > as developers rather than just another PDF reader. So I would
>> like to
>> > > > > continue with this project. However, since there are several ways
>> to
>> > > > > enhance PDFReader, we have to define the scope of the project
>> > according
>> > > > to
>> > > > > priority in next few days.
>> > > > >
>> > > > >
>> > > > > Additionally I want to mention another thing here. As my
>> experience
>> > > > current
>> > > > > PDFReader is also not very much reliable and not working
>> smoothly. I
>> > > > still
>> > > > > can't understand that whether it is problem of code of PDFReader
>> or
>> > > > PDFBox
>> > > > > core. Anyway I think that it's better if we can consider also this
>> > in the
>> > > > > project, if it's not some issue in PDFBox core.
>> > > > >
>> > > > > On Wed, Mar 21, 2012 at 10:26 PM, mehdi houshmand <
>> med1985@gmail.com
>> > >
>> > > > > wrote:
>> > > > >
>> > > > > > Hi Tharaka,
>> > > > > >
>> > > > > > You have plenty of options here, the student application
>> deadline
>> > is on
>> > > > > the
>> > > > > > 6th April (see the calendar
>> > > > > > http://www.google-melange.com/gsoc/events/google/gsoc2012), so
>> > you've
>> > > > > got
>> > > > > > plenty of time.
>> > > > > >
>> > > > > > Let me and/or the community know if you have any questions about
>> > the
>> > > > > > projects. If you have an idea of which project you'd prefer,
>> maybe
>> > we
>> > > > can
>> > > > > > start drawing up some goals and a schedule and discuss with the
>> > > > community
>> > > > > > to get some feedback on which facets are a) interesting b)
>> useful
>> > or c)
>> > > > > > both!
>> > > > > >
>> > > > > > Mehdi
>> > > > > >
>> > > > > > On 20 March 2012 15:58, Jeremias Maerki <dev@jeremias-maerki.ch
>> >
>> > > > wrote:
>> > > > > >
>> > > > > > > There are a number of HTML to XSL-FO converters. I've never
>> > checked
>> > > > how
>> > > > > > > good the results are but ultimately a native HTML engine is
>> > likely to
>> > > > > > > produce better results especially since HTML was not really
>> > designed
>> > > > > for
>> > > > > > > print and has to be retrofitted in CSS3. But again, this is a
>> > large
>> > > > > > > project and, IMO, out of scope for PDFBox.
>> > > > > > >
>> > > > > > > http://html2fo.sourceforge.net/
>> > > > > > > http://denature.sourceforge.net/
>> > > > > > > ...and probably others...
>> > > > > > >
>> > > > > > > On 20.03.2012 13:02:33 Dexter Mishra wrote:
>> > > > > > > > I vote for a PDS editor kind of stuff.
>> > > > > > > > Another thing regarding the HTML to PDS, cant it be done
>> with
>> > > > XSL-FO
>> > > > > > > > feature?
>> > > > > > > > Thanks
>> > > > > > > >
>> > > > > > > > On Tue, Mar 20, 2012 at 3:03 PM, Maruan Sahyoun <
>> > > > > > sahyoun@fileaffairs.de
>> > > > > > > >wrote:
>> > > > > > > >
>> > > > > > > > >
>> > > > > > > > > > <snip/>
>> > > > > > > > > >
>> > > > > > > > > >>
>> > > > > > > > > >> Although I think that the current PDF Reader can be
>> > enhanced
>> > > > in
>> > > > > > many
>> > > > > > > > > ways
>> > > > > > > > > >> there are already so many Readers out there as well as
>> PDF
>> > > > > support
>> > > > > > > > > within
>> > > > > > > > > >> web browsers my personal opinion is that enhancing
>> PDFBox
>> > core
>> > > > > > > > > capabilities
>> > > > > > > > > >> would be more beneficial.
>> > > > > > > > > >>
>> > > > > > > > > >> With kind regards
>> > > > > > > > > >>
>> > > > > > > > > >> Maruan Sahyoun
>> > > > > > > > > >>
>> > > > > > > > > >>
>> > > > > > > > > > Check out Jeremias' suggestions of the viewer, it's less
>> > of a
>> > > > > > viewer
>> > > > > > > and
>> > > > > > > > > > more of a front-end for a lot of the tools PDFBox has to
>> > > > offer, a
>> > > > > > > PDFBox
>> > > > > > > > > > GUI so to speak rather than a PDF viewer.
>> > > > > > > > >
>> > > > > > > > > I'd still look into enhancing PDFBox core as this will
>> > benefit
>> > > > most
>> > > > > > > users.
>> > > > > > > > > Looking at the bugs and issues most come from core
>> > capabilities.
>> > > > > > >
>> > > > > > >
>> > > > > > >
>> > > > > > >
>> > > > > > > Jeremias Maerki
>> > > > > > >
>> > > > > > >
>> > > > > >
>> > > > >
>> > > > >
>> > > > >
>> > > > > --
>> > > > > Thanks & Regards,
>> > > > > Tharaka Wijebandara,
>> > > > > Faculty of Information Technology,
>> > > > > University of Moratuwa.
>> > > > >
>> > > >
>> > >
>> > >
>> > >
>> > > --
>> > > Thanks & Regards,
>> > > Tharaka Wijebandara,
>> > > Faculty of Information Technology,
>> > > University of Moratuwa.
>> >
>> >
>> >
>> >
>> > Jeremias Maerki
>> >
>> >
>>
>
>
>
> --
> Thanks & Regards,
> Tharaka Wijebandara,
> Faculty of Information Technology,
> University of Moratuwa.
>
>


-- 
Thanks & Regards,
Tharaka Wijebandara,
Faculty of Information Technology,
University of Moratuwa.

Re: PDFBox Project for GSoC 2012

Posted by Tharaka Nayanajith Wijebandara <th...@gmail.com>.
Hi,


Idea about object inspector is clear and I think I can make that dream into
reality. ;)


I have some doubt regarding RCP. Actually I didn't have any previous
experience with RPC and read some articles to get idea about it. As per my
understanding, although there are several RCPs in our case eclipse RCP is
most suitable one. However since RCP Application has several dependencies
and they are not in maven structure, I still can't understand how them
organize in pdfbox project. Is there any example project for understanding
it? Maybe I'm misunderstood, need some help from you to clear it.


Mehdi, I started to create proposal and I'll put it here within this
weekend.

On Thu, Mar 29, 2012 at 2:35 PM, mehdi houshmand <me...@gmail.com> wrote:

> I think my hopes and dreams are pretty exactly the same as Jeremias'. These
> features would give PDFBox differentiation from Adobe products which focus
> much more design of PDF rather than nuts and bots debugging. As for the
> GUI, I'm really not fussy regarding the UI framework you use, but RCP does
> look pretty interesting...
>
> Just as an aside, in terms of structure, I was thinking of this being a
> maven sub-project in the same way fontbox and jempbox etc are. I just
> wanted to make sure we're all on the same wavelength here...
>
> Tharaka, I think you've probably got enough information here to create a
> proposal, if you want to put a draft up here for us (the community) to
> check over feel free to do so. But you're going to need to publish the
> proposal on Google melange (
> http://www.google-melange.com/gsoc/homepage/google/gsoc2012) so that I can
> agree to mentor it and rate the application.
>
> Just a bit of admin:
> Debian has a good template for student applications (
> http://wiki.debian.org/SummerOfCode/StudentApplicationTemplate). Once you
> fill out the necessary details I will assign myself as the mentor and have
> to rank your application (
> http://community.apache.org/mentee-ranking-process.html). If you want to
> fair better in the ranking process, involvement in the community is highly
> regarded by either a) answering user questions on the mailing list and/or
> b) going through PDFBox bugs and fixing some bugs; this isn't mandatory but
> will improve the ranking on your application.
>
> Mehdi
>
>
> On 29 March 2012 07:36, Jeremias Maerki <de...@jeremias-maerki.ch> wrote:
>
> > Hi Tharaka,
> >
> > Object Inspector: I'm not sure you need a tree for inspecting a single
> > object. The tree is useful when the PDF debugger is integrated as an
> > alternative view of a PDF. I'd expect to have a customized window (a
> > plug-in ;-) ) for each kind of object when inspecting it. A bitmap image
> > would display information about the size, the color space and profile,
> > effective resolution. Clicking on a word would display its font and
> > color. I guess it would be quite cool to have a button that says "jump
> > to object tree" so it would switch to the debugger view and focus on the
> > selected object. Imagine having viewer and debugger side-by-side in
> > separate views and the object that you click on in the debugger is
> > highlighted in the viewer. Wow. But I'm dreaming. ;-)
> >
> > Unfortunately, JPF is off-limits for PDFBox if it is used for core
> > functionality because it's under the LGPL license. Only optional
> > components/plug-ins may depend on LGPL libraries. But I think the normal
> > JAR Service Provider mechanism would already be good enough as a plug-in
> > framework:
> >
> >
> http://docs.oracle.com/javase/6/docs/technotes/guides/jar/jar.html#Service%20Provider
> > We use that extensively over in Apache XML Graphics land. I would
> > suggest to base the GUI on Java 6 where you have the service lookup
> > built in:
> > http://docs.oracle.com/javase/6/docs/api/java/util/ServiceLoader.html
> >
> > Swing with Nimbus generally sounds good to me.
> >
> > OTOH, an RCP application would actually offer the whole pluggability
> > (even at runtime) and an Eclipse-like workbench which would be great for
> > such an application. But it would also mean practically re-writing both
> > the PDF viewer (although the PDF painting via Java2D should still be
> > usable) and debugger functionality.
> >
> > Both Swing and RCP are non-trivial to learn. On the technical side I'd
> > probably favor RCP just a little bit, mostly because it's OSGi-based
> > (I'm biased there) even if the Eclipse world does ugly things to OSGi.
> > Furthermore, I know few good Swing applications but many good RCP
> > applications. Maybe that's an indicator. But I'd have to learn RCP
> > myself if I was to help out with this.
> >
> > Optional Content Groups (since PDF 1.5) are groups of objects (think
> > "layer") that can be enabled and disabled. I think that's advanced
> > functionality that the mainstream doesn't really need. But I'm also not
> > sure what Medhi means with transparency layers.
> >
> > On 29.03.2012 03:55:28 Tharaka Nayanajith Wijebandara wrote:
> > > Hi,
> > >
> > >
> > > Thanx mehdi, you have summarized all primary objective and we can
> > continue
> > > discussion based on it.
> > >
> > > For the GUI, I'm going to use Swing Framework and to modernize we can
> go
> > > for some available look and Feel such as nimbus. Is there any
> suggestion?
> > >
> > >
> > > I need some ideas about PDF object inspector and since it tool for
> > > developer all you can give some suggestions. In my opinion we can use
> > swing
> > > tree for this. User can right click on object and click ‘inspect’
> command
> > > in the menu, then it will show the object and it properties in the tree
> > and
> > > highlight object margin in the PDF view.
> > >
> > >
> > > Plugin frame is another feature to consider. We can use and adapt some
> > > available plugin frame for PDFReader rather than going for new our own
> > one.
> > > As per my understand Java Plug-in Framework (
> http://jpf.sourceforge.net/
> > )
> > > is good one we can use and it is XML based. Is there any apache
> framework
> > > for this? If I'm correct, maven project is based on plugin framework.
> > >
> > >
> > > Mehdi, I would like to add bookmark feature also in to primary
> objective
> > > list. It will allow user to view, add, edit and delete book marks. And
> > also
> > > I'm confused about what you mean by supporting optional content and
> > > transparency layers.
> > >
> > > On Mon, Mar 26, 2012 at 1:49 PM, mehdi houshmand <me...@gmail.com>
> > wrote:
> > >
> > > > Hi Tharaka,
> > > >
> > > > Ok, well let's start with thinking about how this GUI is going to
> > look. In
> > > > its current form, it's looking a little dated, it might be worth
> using
> > some
> > > > of the GUI frameworks out there to spruce it up a little and making
> it
> > a
> > > > bit more modern. So if we get a list of TODOs (starting with
> Jeremias'
> > > > suggestions):
> > > >
> > > > - Investigate and design a proposal for upgrading the PDFReader GUI
> > (i.e.
> > > > modernizing it)
> > > > - Implement a PDF object inspector to display PDF object properties
> > when
> > > > objects are "clicked-on" in the viewer
> > > > - Implement a type-writer feature to add/remove text from a content
> > stream
> > > > from within the PDFViewer
> > > > - Design a plugin framework for the viewer
> > > >
> > > > Maybe those are a good list of primary objectives? We should think
> > about
> > > > some secondary objectives, like possibly supporting Optional Content
> > in the
> > > > view (not sure if it already is) and better handling of transparency
> > > > layers?
> > > >
> > > > Just some thoughts, let's keep this discussions fluid for now, we
> > haven't
> > > > actually got that long before we have to submit a proposal though.
> > > >
> > > > Mehdi
> > > >
> > > > On 22 March 2012 03:04, Tharaka Nayanajith Wijebandara <
> > > > tharaka.nw@gmail.com
> > > > > wrote:
> > > >
> > > > > Hi,
> > > > >
> > > > >
> > > > > Thanks everyone for your valuable ideas and comments.
> > > > >
> > > > >
> > > > > It seems most of you like to enhancing PDFReader project.
> Specially I
> > > > > prefer Jeremias' idea, develop application which allow user to
> access
> > > > > PDFBox features through the GUI and PDF viewer with integrated PDF
> > > > > Debugger. In my opinion it will be very much useful tool for users
> as
> > > > well
> > > > > as developers rather than just another PDF reader. So I would like
> to
> > > > > continue with this project. However, since there are several ways
> to
> > > > > enhance PDFReader, we have to define the scope of the project
> > according
> > > > to
> > > > > priority in next few days.
> > > > >
> > > > >
> > > > > Additionally I want to mention another thing here. As my experience
> > > > current
> > > > > PDFReader is also not very much reliable and not working smoothly.
> I
> > > > still
> > > > > can't understand that whether it is problem of code of PDFReader or
> > > > PDFBox
> > > > > core. Anyway I think that it's better if we can consider also this
> > in the
> > > > > project, if it's not some issue in PDFBox core.
> > > > >
> > > > > On Wed, Mar 21, 2012 at 10:26 PM, mehdi houshmand <
> med1985@gmail.com
> > >
> > > > > wrote:
> > > > >
> > > > > > Hi Tharaka,
> > > > > >
> > > > > > You have plenty of options here, the student application deadline
> > is on
> > > > > the
> > > > > > 6th April (see the calendar
> > > > > > http://www.google-melange.com/gsoc/events/google/gsoc2012), so
> > you've
> > > > > got
> > > > > > plenty of time.
> > > > > >
> > > > > > Let me and/or the community know if you have any questions about
> > the
> > > > > > projects. If you have an idea of which project you'd prefer,
> maybe
> > we
> > > > can
> > > > > > start drawing up some goals and a schedule and discuss with the
> > > > community
> > > > > > to get some feedback on which facets are a) interesting b) useful
> > or c)
> > > > > > both!
> > > > > >
> > > > > > Mehdi
> > > > > >
> > > > > > On 20 March 2012 15:58, Jeremias Maerki <de...@jeremias-maerki.ch>
> > > > wrote:
> > > > > >
> > > > > > > There are a number of HTML to XSL-FO converters. I've never
> > checked
> > > > how
> > > > > > > good the results are but ultimately a native HTML engine is
> > likely to
> > > > > > > produce better results especially since HTML was not really
> > designed
> > > > > for
> > > > > > > print and has to be retrofitted in CSS3. But again, this is a
> > large
> > > > > > > project and, IMO, out of scope for PDFBox.
> > > > > > >
> > > > > > > http://html2fo.sourceforge.net/
> > > > > > > http://denature.sourceforge.net/
> > > > > > > ...and probably others...
> > > > > > >
> > > > > > > On 20.03.2012 13:02:33 Dexter Mishra wrote:
> > > > > > > > I vote for a PDS editor kind of stuff.
> > > > > > > > Another thing regarding the HTML to PDS, cant it be done with
> > > > XSL-FO
> > > > > > > > feature?
> > > > > > > > Thanks
> > > > > > > >
> > > > > > > > On Tue, Mar 20, 2012 at 3:03 PM, Maruan Sahyoun <
> > > > > > sahyoun@fileaffairs.de
> > > > > > > >wrote:
> > > > > > > >
> > > > > > > > >
> > > > > > > > > > <snip/>
> > > > > > > > > >
> > > > > > > > > >>
> > > > > > > > > >> Although I think that the current PDF Reader can be
> > enhanced
> > > > in
> > > > > > many
> > > > > > > > > ways
> > > > > > > > > >> there are already so many Readers out there as well as
> PDF
> > > > > support
> > > > > > > > > within
> > > > > > > > > >> web browsers my personal opinion is that enhancing
> PDFBox
> > core
> > > > > > > > > capabilities
> > > > > > > > > >> would be more beneficial.
> > > > > > > > > >>
> > > > > > > > > >> With kind regards
> > > > > > > > > >>
> > > > > > > > > >> Maruan Sahyoun
> > > > > > > > > >>
> > > > > > > > > >>
> > > > > > > > > > Check out Jeremias' suggestions of the viewer, it's less
> > of a
> > > > > > viewer
> > > > > > > and
> > > > > > > > > > more of a front-end for a lot of the tools PDFBox has to
> > > > offer, a
> > > > > > > PDFBox
> > > > > > > > > > GUI so to speak rather than a PDF viewer.
> > > > > > > > >
> > > > > > > > > I'd still look into enhancing PDFBox core as this will
> > benefit
> > > > most
> > > > > > > users.
> > > > > > > > > Looking at the bugs and issues most come from core
> > capabilities.
> > > > > > >
> > > > > > >
> > > > > > >
> > > > > > >
> > > > > > > Jeremias Maerki
> > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > > >
> > > > >
> > > > > --
> > > > > Thanks & Regards,
> > > > > Tharaka Wijebandara,
> > > > > Faculty of Information Technology,
> > > > > University of Moratuwa.
> > > > >
> > > >
> > >
> > >
> > >
> > > --
> > > Thanks & Regards,
> > > Tharaka Wijebandara,
> > > Faculty of Information Technology,
> > > University of Moratuwa.
> >
> >
> >
> >
> > Jeremias Maerki
> >
> >
>



-- 
Thanks & Regards,
Tharaka Wijebandara,
Faculty of Information Technology,
University of Moratuwa.

Re: PDFBox Project for GSoC 2012

Posted by mehdi houshmand <me...@gmail.com>.
I think my hopes and dreams are pretty exactly the same as Jeremias'. These
features would give PDFBox differentiation from Adobe products which focus
much more design of PDF rather than nuts and bots debugging. As for the
GUI, I'm really not fussy regarding the UI framework you use, but RCP does
look pretty interesting...

Just as an aside, in terms of structure, I was thinking of this being a
maven sub-project in the same way fontbox and jempbox etc are. I just
wanted to make sure we're all on the same wavelength here...

Tharaka, I think you've probably got enough information here to create a
proposal, if you want to put a draft up here for us (the community) to
check over feel free to do so. But you're going to need to publish the
proposal on Google melange (
http://www.google-melange.com/gsoc/homepage/google/gsoc2012) so that I can
agree to mentor it and rate the application.

Just a bit of admin:
Debian has a good template for student applications (
http://wiki.debian.org/SummerOfCode/StudentApplicationTemplate). Once you
fill out the necessary details I will assign myself as the mentor and have
to rank your application (
http://community.apache.org/mentee-ranking-process.html). If you want to
fair better in the ranking process, involvement in the community is highly
regarded by either a) answering user questions on the mailing list and/or
b) going through PDFBox bugs and fixing some bugs; this isn't mandatory but
will improve the ranking on your application.

Mehdi


On 29 March 2012 07:36, Jeremias Maerki <de...@jeremias-maerki.ch> wrote:

> Hi Tharaka,
>
> Object Inspector: I'm not sure you need a tree for inspecting a single
> object. The tree is useful when the PDF debugger is integrated as an
> alternative view of a PDF. I'd expect to have a customized window (a
> plug-in ;-) ) for each kind of object when inspecting it. A bitmap image
> would display information about the size, the color space and profile,
> effective resolution. Clicking on a word would display its font and
> color. I guess it would be quite cool to have a button that says "jump
> to object tree" so it would switch to the debugger view and focus on the
> selected object. Imagine having viewer and debugger side-by-side in
> separate views and the object that you click on in the debugger is
> highlighted in the viewer. Wow. But I'm dreaming. ;-)
>
> Unfortunately, JPF is off-limits for PDFBox if it is used for core
> functionality because it's under the LGPL license. Only optional
> components/plug-ins may depend on LGPL libraries. But I think the normal
> JAR Service Provider mechanism would already be good enough as a plug-in
> framework:
>
> http://docs.oracle.com/javase/6/docs/technotes/guides/jar/jar.html#Service%20Provider
> We use that extensively over in Apache XML Graphics land. I would
> suggest to base the GUI on Java 6 where you have the service lookup
> built in:
> http://docs.oracle.com/javase/6/docs/api/java/util/ServiceLoader.html
>
> Swing with Nimbus generally sounds good to me.
>
> OTOH, an RCP application would actually offer the whole pluggability
> (even at runtime) and an Eclipse-like workbench which would be great for
> such an application. But it would also mean practically re-writing both
> the PDF viewer (although the PDF painting via Java2D should still be
> usable) and debugger functionality.
>
> Both Swing and RCP are non-trivial to learn. On the technical side I'd
> probably favor RCP just a little bit, mostly because it's OSGi-based
> (I'm biased there) even if the Eclipse world does ugly things to OSGi.
> Furthermore, I know few good Swing applications but many good RCP
> applications. Maybe that's an indicator. But I'd have to learn RCP
> myself if I was to help out with this.
>
> Optional Content Groups (since PDF 1.5) are groups of objects (think
> "layer") that can be enabled and disabled. I think that's advanced
> functionality that the mainstream doesn't really need. But I'm also not
> sure what Medhi means with transparency layers.
>
> On 29.03.2012 03:55:28 Tharaka Nayanajith Wijebandara wrote:
> > Hi,
> >
> >
> > Thanx mehdi, you have summarized all primary objective and we can
> continue
> > discussion based on it.
> >
> > For the GUI, I'm going to use Swing Framework and to modernize we can go
> > for some available look and Feel such as nimbus. Is there any suggestion?
> >
> >
> > I need some ideas about PDF object inspector and since it tool for
> > developer all you can give some suggestions. In my opinion we can use
> swing
> > tree for this. User can right click on object and click ‘inspect’ command
> > in the menu, then it will show the object and it properties in the tree
> and
> > highlight object margin in the PDF view.
> >
> >
> > Plugin frame is another feature to consider. We can use and adapt some
> > available plugin frame for PDFReader rather than going for new our own
> one.
> > As per my understand Java Plug-in Framework (http://jpf.sourceforge.net/
> )
> > is good one we can use and it is XML based. Is there any apache framework
> > for this? If I'm correct, maven project is based on plugin framework.
> >
> >
> > Mehdi, I would like to add bookmark feature also in to primary objective
> > list. It will allow user to view, add, edit and delete book marks. And
> also
> > I'm confused about what you mean by supporting optional content and
> > transparency layers.
> >
> > On Mon, Mar 26, 2012 at 1:49 PM, mehdi houshmand <me...@gmail.com>
> wrote:
> >
> > > Hi Tharaka,
> > >
> > > Ok, well let's start with thinking about how this GUI is going to
> look. In
> > > its current form, it's looking a little dated, it might be worth using
> some
> > > of the GUI frameworks out there to spruce it up a little and making it
> a
> > > bit more modern. So if we get a list of TODOs (starting with Jeremias'
> > > suggestions):
> > >
> > > - Investigate and design a proposal for upgrading the PDFReader GUI
> (i.e.
> > > modernizing it)
> > > - Implement a PDF object inspector to display PDF object properties
> when
> > > objects are "clicked-on" in the viewer
> > > - Implement a type-writer feature to add/remove text from a content
> stream
> > > from within the PDFViewer
> > > - Design a plugin framework for the viewer
> > >
> > > Maybe those are a good list of primary objectives? We should think
> about
> > > some secondary objectives, like possibly supporting Optional Content
> in the
> > > view (not sure if it already is) and better handling of transparency
> > > layers?
> > >
> > > Just some thoughts, let's keep this discussions fluid for now, we
> haven't
> > > actually got that long before we have to submit a proposal though.
> > >
> > > Mehdi
> > >
> > > On 22 March 2012 03:04, Tharaka Nayanajith Wijebandara <
> > > tharaka.nw@gmail.com
> > > > wrote:
> > >
> > > > Hi,
> > > >
> > > >
> > > > Thanks everyone for your valuable ideas and comments.
> > > >
> > > >
> > > > It seems most of you like to enhancing PDFReader project. Specially I
> > > > prefer Jeremias' idea, develop application which allow user to access
> > > > PDFBox features through the GUI and PDF viewer with integrated PDF
> > > > Debugger. In my opinion it will be very much useful tool for users as
> > > well
> > > > as developers rather than just another PDF reader. So I would like to
> > > > continue with this project. However, since there are several ways to
> > > > enhance PDFReader, we have to define the scope of the project
> according
> > > to
> > > > priority in next few days.
> > > >
> > > >
> > > > Additionally I want to mention another thing here. As my experience
> > > current
> > > > PDFReader is also not very much reliable and not working smoothly. I
> > > still
> > > > can't understand that whether it is problem of code of PDFReader or
> > > PDFBox
> > > > core. Anyway I think that it's better if we can consider also this
> in the
> > > > project, if it's not some issue in PDFBox core.
> > > >
> > > > On Wed, Mar 21, 2012 at 10:26 PM, mehdi houshmand <med1985@gmail.com
> >
> > > > wrote:
> > > >
> > > > > Hi Tharaka,
> > > > >
> > > > > You have plenty of options here, the student application deadline
> is on
> > > > the
> > > > > 6th April (see the calendar
> > > > > http://www.google-melange.com/gsoc/events/google/gsoc2012), so
> you've
> > > > got
> > > > > plenty of time.
> > > > >
> > > > > Let me and/or the community know if you have any questions about
> the
> > > > > projects. If you have an idea of which project you'd prefer, maybe
> we
> > > can
> > > > > start drawing up some goals and a schedule and discuss with the
> > > community
> > > > > to get some feedback on which facets are a) interesting b) useful
> or c)
> > > > > both!
> > > > >
> > > > > Mehdi
> > > > >
> > > > > On 20 March 2012 15:58, Jeremias Maerki <de...@jeremias-maerki.ch>
> > > wrote:
> > > > >
> > > > > > There are a number of HTML to XSL-FO converters. I've never
> checked
> > > how
> > > > > > good the results are but ultimately a native HTML engine is
> likely to
> > > > > > produce better results especially since HTML was not really
> designed
> > > > for
> > > > > > print and has to be retrofitted in CSS3. But again, this is a
> large
> > > > > > project and, IMO, out of scope for PDFBox.
> > > > > >
> > > > > > http://html2fo.sourceforge.net/
> > > > > > http://denature.sourceforge.net/
> > > > > > ...and probably others...
> > > > > >
> > > > > > On 20.03.2012 13:02:33 Dexter Mishra wrote:
> > > > > > > I vote for a PDS editor kind of stuff.
> > > > > > > Another thing regarding the HTML to PDS, cant it be done with
> > > XSL-FO
> > > > > > > feature?
> > > > > > > Thanks
> > > > > > >
> > > > > > > On Tue, Mar 20, 2012 at 3:03 PM, Maruan Sahyoun <
> > > > > sahyoun@fileaffairs.de
> > > > > > >wrote:
> > > > > > >
> > > > > > > >
> > > > > > > > > <snip/>
> > > > > > > > >
> > > > > > > > >>
> > > > > > > > >> Although I think that the current PDF Reader can be
> enhanced
> > > in
> > > > > many
> > > > > > > > ways
> > > > > > > > >> there are already so many Readers out there as well as PDF
> > > > support
> > > > > > > > within
> > > > > > > > >> web browsers my personal opinion is that enhancing PDFBox
> core
> > > > > > > > capabilities
> > > > > > > > >> would be more beneficial.
> > > > > > > > >>
> > > > > > > > >> With kind regards
> > > > > > > > >>
> > > > > > > > >> Maruan Sahyoun
> > > > > > > > >>
> > > > > > > > >>
> > > > > > > > > Check out Jeremias' suggestions of the viewer, it's less
> of a
> > > > > viewer
> > > > > > and
> > > > > > > > > more of a front-end for a lot of the tools PDFBox has to
> > > offer, a
> > > > > > PDFBox
> > > > > > > > > GUI so to speak rather than a PDF viewer.
> > > > > > > >
> > > > > > > > I'd still look into enhancing PDFBox core as this will
> benefit
> > > most
> > > > > > users.
> > > > > > > > Looking at the bugs and issues most come from core
> capabilities.
> > > > > >
> > > > > >
> > > > > >
> > > > > >
> > > > > > Jeremias Maerki
> > > > > >
> > > > > >
> > > > >
> > > >
> > > >
> > > >
> > > > --
> > > > Thanks & Regards,
> > > > Tharaka Wijebandara,
> > > > Faculty of Information Technology,
> > > > University of Moratuwa.
> > > >
> > >
> >
> >
> >
> > --
> > Thanks & Regards,
> > Tharaka Wijebandara,
> > Faculty of Information Technology,
> > University of Moratuwa.
>
>
>
>
> Jeremias Maerki
>
>

Re: PDFBox Project for GSoC 2012

Posted by Jeremias Maerki <de...@jeremias-maerki.ch>.
Hi Tharaka,

Object Inspector: I'm not sure you need a tree for inspecting a single
object. The tree is useful when the PDF debugger is integrated as an
alternative view of a PDF. I'd expect to have a customized window (a
plug-in ;-) ) for each kind of object when inspecting it. A bitmap image
would display information about the size, the color space and profile,
effective resolution. Clicking on a word would display its font and
color. I guess it would be quite cool to have a button that says "jump
to object tree" so it would switch to the debugger view and focus on the
selected object. Imagine having viewer and debugger side-by-side in
separate views and the object that you click on in the debugger is
highlighted in the viewer. Wow. But I'm dreaming. ;-)

Unfortunately, JPF is off-limits for PDFBox if it is used for core
functionality because it's under the LGPL license. Only optional
components/plug-ins may depend on LGPL libraries. But I think the normal
JAR Service Provider mechanism would already be good enough as a plug-in
framework:
http://docs.oracle.com/javase/6/docs/technotes/guides/jar/jar.html#Service%20Provider
We use that extensively over in Apache XML Graphics land. I would
suggest to base the GUI on Java 6 where you have the service lookup
built in: http://docs.oracle.com/javase/6/docs/api/java/util/ServiceLoader.html

Swing with Nimbus generally sounds good to me.

OTOH, an RCP application would actually offer the whole pluggability
(even at runtime) and an Eclipse-like workbench which would be great for
such an application. But it would also mean practically re-writing both
the PDF viewer (although the PDF painting via Java2D should still be
usable) and debugger functionality.

Both Swing and RCP are non-trivial to learn. On the technical side I'd
probably favor RCP just a little bit, mostly because it's OSGi-based
(I'm biased there) even if the Eclipse world does ugly things to OSGi.
Furthermore, I know few good Swing applications but many good RCP
applications. Maybe that's an indicator. But I'd have to learn RCP
myself if I was to help out with this.

Optional Content Groups (since PDF 1.5) are groups of objects (think
"layer") that can be enabled and disabled. I think that's advanced
functionality that the mainstream doesn't really need. But I'm also not
sure what Medhi means with transparency layers.

On 29.03.2012 03:55:28 Tharaka Nayanajith Wijebandara wrote:
> Hi,
> 
> 
> Thanx mehdi, you have summarized all primary objective and we can continue
> discussion based on it.
> 
> For the GUI, I'm going to use Swing Framework and to modernize we can go
> for some available look and Feel such as nimbus. Is there any suggestion?
> 
> 
> I need some ideas about PDF object inspector and since it tool for
> developer all you can give some suggestions. In my opinion we can use swing
> tree for this. User can right click on object and click ‘inspect’ command
> in the menu, then it will show the object and it properties in the tree and
> highlight object margin in the PDF view.
> 
> 
> Plugin frame is another feature to consider. We can use and adapt some
> available plugin frame for PDFReader rather than going for new our own one.
> As per my understand Java Plug-in Framework (http://jpf.sourceforge.net/)
> is good one we can use and it is XML based. Is there any apache framework
> for this? If I'm correct, maven project is based on plugin framework.
> 
> 
> Mehdi, I would like to add bookmark feature also in to primary objective
> list. It will allow user to view, add, edit and delete book marks. And also
> I'm confused about what you mean by supporting optional content and
> transparency layers.
> 
> On Mon, Mar 26, 2012 at 1:49 PM, mehdi houshmand <me...@gmail.com> wrote:
> 
> > Hi Tharaka,
> >
> > Ok, well let's start with thinking about how this GUI is going to look. In
> > its current form, it's looking a little dated, it might be worth using some
> > of the GUI frameworks out there to spruce it up a little and making it a
> > bit more modern. So if we get a list of TODOs (starting with Jeremias'
> > suggestions):
> >
> > - Investigate and design a proposal for upgrading the PDFReader GUI (i.e.
> > modernizing it)
> > - Implement a PDF object inspector to display PDF object properties when
> > objects are "clicked-on" in the viewer
> > - Implement a type-writer feature to add/remove text from a content stream
> > from within the PDFViewer
> > - Design a plugin framework for the viewer
> >
> > Maybe those are a good list of primary objectives? We should think about
> > some secondary objectives, like possibly supporting Optional Content in the
> > view (not sure if it already is) and better handling of transparency
> > layers?
> >
> > Just some thoughts, let's keep this discussions fluid for now, we haven't
> > actually got that long before we have to submit a proposal though.
> >
> > Mehdi
> >
> > On 22 March 2012 03:04, Tharaka Nayanajith Wijebandara <
> > tharaka.nw@gmail.com
> > > wrote:
> >
> > > Hi,
> > >
> > >
> > > Thanks everyone for your valuable ideas and comments.
> > >
> > >
> > > It seems most of you like to enhancing PDFReader project. Specially I
> > > prefer Jeremias' idea, develop application which allow user to access
> > > PDFBox features through the GUI and PDF viewer with integrated PDF
> > > Debugger. In my opinion it will be very much useful tool for users as
> > well
> > > as developers rather than just another PDF reader. So I would like to
> > > continue with this project. However, since there are several ways to
> > > enhance PDFReader, we have to define the scope of the project according
> > to
> > > priority in next few days.
> > >
> > >
> > > Additionally I want to mention another thing here. As my experience
> > current
> > > PDFReader is also not very much reliable and not working smoothly. I
> > still
> > > can't understand that whether it is problem of code of PDFReader or
> > PDFBox
> > > core. Anyway I think that it's better if we can consider also this in the
> > > project, if it's not some issue in PDFBox core.
> > >
> > > On Wed, Mar 21, 2012 at 10:26 PM, mehdi houshmand <me...@gmail.com>
> > > wrote:
> > >
> > > > Hi Tharaka,
> > > >
> > > > You have plenty of options here, the student application deadline is on
> > > the
> > > > 6th April (see the calendar
> > > > http://www.google-melange.com/gsoc/events/google/gsoc2012), so you've
> > > got
> > > > plenty of time.
> > > >
> > > > Let me and/or the community know if you have any questions about the
> > > > projects. If you have an idea of which project you'd prefer, maybe we
> > can
> > > > start drawing up some goals and a schedule and discuss with the
> > community
> > > > to get some feedback on which facets are a) interesting b) useful or c)
> > > > both!
> > > >
> > > > Mehdi
> > > >
> > > > On 20 March 2012 15:58, Jeremias Maerki <de...@jeremias-maerki.ch>
> > wrote:
> > > >
> > > > > There are a number of HTML to XSL-FO converters. I've never checked
> > how
> > > > > good the results are but ultimately a native HTML engine is likely to
> > > > > produce better results especially since HTML was not really designed
> > > for
> > > > > print and has to be retrofitted in CSS3. But again, this is a large
> > > > > project and, IMO, out of scope for PDFBox.
> > > > >
> > > > > http://html2fo.sourceforge.net/
> > > > > http://denature.sourceforge.net/
> > > > > ...and probably others...
> > > > >
> > > > > On 20.03.2012 13:02:33 Dexter Mishra wrote:
> > > > > > I vote for a PDS editor kind of stuff.
> > > > > > Another thing regarding the HTML to PDS, cant it be done with
> > XSL-FO
> > > > > > feature?
> > > > > > Thanks
> > > > > >
> > > > > > On Tue, Mar 20, 2012 at 3:03 PM, Maruan Sahyoun <
> > > > sahyoun@fileaffairs.de
> > > > > >wrote:
> > > > > >
> > > > > > >
> > > > > > > > <snip/>
> > > > > > > >
> > > > > > > >>
> > > > > > > >> Although I think that the current PDF Reader can be enhanced
> > in
> > > > many
> > > > > > > ways
> > > > > > > >> there are already so many Readers out there as well as PDF
> > > support
> > > > > > > within
> > > > > > > >> web browsers my personal opinion is that enhancing PDFBox core
> > > > > > > capabilities
> > > > > > > >> would be more beneficial.
> > > > > > > >>
> > > > > > > >> With kind regards
> > > > > > > >>
> > > > > > > >> Maruan Sahyoun
> > > > > > > >>
> > > > > > > >>
> > > > > > > > Check out Jeremias' suggestions of the viewer, it's less of a
> > > > viewer
> > > > > and
> > > > > > > > more of a front-end for a lot of the tools PDFBox has to
> > offer, a
> > > > > PDFBox
> > > > > > > > GUI so to speak rather than a PDF viewer.
> > > > > > >
> > > > > > > I'd still look into enhancing PDFBox core as this will benefit
> > most
> > > > > users.
> > > > > > > Looking at the bugs and issues most come from core capabilities.
> > > > >
> > > > >
> > > > >
> > > > >
> > > > > Jeremias Maerki
> > > > >
> > > > >
> > > >
> > >
> > >
> > >
> > > --
> > > Thanks & Regards,
> > > Tharaka Wijebandara,
> > > Faculty of Information Technology,
> > > University of Moratuwa.
> > >
> >
> 
> 
> 
> -- 
> Thanks & Regards,
> Tharaka Wijebandara,
> Faculty of Information Technology,
> University of Moratuwa.




Jeremias Maerki


Re: PDFBox Project for GSoC 2012

Posted by Tharaka Nayanajith Wijebandara <th...@gmail.com>.
Hi,


Thanx mehdi, you have summarized all primary objective and we can continue
discussion based on it.

For the GUI, I'm going to use Swing Framework and to modernize we can go
for some available look and Feel such as nimbus. Is there any suggestion?


I need some ideas about PDF object inspector and since it tool for
developer all you can give some suggestions. In my opinion we can use swing
tree for this. User can right click on object and click ‘inspect’ command
in the menu, then it will show the object and it properties in the tree and
highlight object margin in the PDF view.


Plugin frame is another feature to consider. We can use and adapt some
available plugin frame for PDFReader rather than going for new our own one.
As per my understand Java Plug-in Framework (http://jpf.sourceforge.net/)
is good one we can use and it is XML based. Is there any apache framework
for this? If I'm correct, maven project is based on plugin framework.


Mehdi, I would like to add bookmark feature also in to primary objective
list. It will allow user to view, add, edit and delete book marks. And also
I'm confused about what you mean by supporting optional content and
transparency layers.

On Mon, Mar 26, 2012 at 1:49 PM, mehdi houshmand <me...@gmail.com> wrote:

> Hi Tharaka,
>
> Ok, well let's start with thinking about how this GUI is going to look. In
> its current form, it's looking a little dated, it might be worth using some
> of the GUI frameworks out there to spruce it up a little and making it a
> bit more modern. So if we get a list of TODOs (starting with Jeremias'
> suggestions):
>
> - Investigate and design a proposal for upgrading the PDFReader GUI (i.e.
> modernizing it)
> - Implement a PDF object inspector to display PDF object properties when
> objects are "clicked-on" in the viewer
> - Implement a type-writer feature to add/remove text from a content stream
> from within the PDFViewer
> - Design a plugin framework for the viewer
>
> Maybe those are a good list of primary objectives? We should think about
> some secondary objectives, like possibly supporting Optional Content in the
> view (not sure if it already is) and better handling of transparency
> layers?
>
> Just some thoughts, let's keep this discussions fluid for now, we haven't
> actually got that long before we have to submit a proposal though.
>
> Mehdi
>
> On 22 March 2012 03:04, Tharaka Nayanajith Wijebandara <
> tharaka.nw@gmail.com
> > wrote:
>
> > Hi,
> >
> >
> > Thanks everyone for your valuable ideas and comments.
> >
> >
> > It seems most of you like to enhancing PDFReader project. Specially I
> > prefer Jeremias' idea, develop application which allow user to access
> > PDFBox features through the GUI and PDF viewer with integrated PDF
> > Debugger. In my opinion it will be very much useful tool for users as
> well
> > as developers rather than just another PDF reader. So I would like to
> > continue with this project. However, since there are several ways to
> > enhance PDFReader, we have to define the scope of the project according
> to
> > priority in next few days.
> >
> >
> > Additionally I want to mention another thing here. As my experience
> current
> > PDFReader is also not very much reliable and not working smoothly. I
> still
> > can't understand that whether it is problem of code of PDFReader or
> PDFBox
> > core. Anyway I think that it's better if we can consider also this in the
> > project, if it's not some issue in PDFBox core.
> >
> > On Wed, Mar 21, 2012 at 10:26 PM, mehdi houshmand <me...@gmail.com>
> > wrote:
> >
> > > Hi Tharaka,
> > >
> > > You have plenty of options here, the student application deadline is on
> > the
> > > 6th April (see the calendar
> > > http://www.google-melange.com/gsoc/events/google/gsoc2012), so you've
> > got
> > > plenty of time.
> > >
> > > Let me and/or the community know if you have any questions about the
> > > projects. If you have an idea of which project you'd prefer, maybe we
> can
> > > start drawing up some goals and a schedule and discuss with the
> community
> > > to get some feedback on which facets are a) interesting b) useful or c)
> > > both!
> > >
> > > Mehdi
> > >
> > > On 20 March 2012 15:58, Jeremias Maerki <de...@jeremias-maerki.ch>
> wrote:
> > >
> > > > There are a number of HTML to XSL-FO converters. I've never checked
> how
> > > > good the results are but ultimately a native HTML engine is likely to
> > > > produce better results especially since HTML was not really designed
> > for
> > > > print and has to be retrofitted in CSS3. But again, this is a large
> > > > project and, IMO, out of scope for PDFBox.
> > > >
> > > > http://html2fo.sourceforge.net/
> > > > http://denature.sourceforge.net/
> > > > ...and probably others...
> > > >
> > > > On 20.03.2012 13:02:33 Dexter Mishra wrote:
> > > > > I vote for a PDS editor kind of stuff.
> > > > > Another thing regarding the HTML to PDS, cant it be done with
> XSL-FO
> > > > > feature?
> > > > > Thanks
> > > > >
> > > > > On Tue, Mar 20, 2012 at 3:03 PM, Maruan Sahyoun <
> > > sahyoun@fileaffairs.de
> > > > >wrote:
> > > > >
> > > > > >
> > > > > > > <snip/>
> > > > > > >
> > > > > > >>
> > > > > > >> Although I think that the current PDF Reader can be enhanced
> in
> > > many
> > > > > > ways
> > > > > > >> there are already so many Readers out there as well as PDF
> > support
> > > > > > within
> > > > > > >> web browsers my personal opinion is that enhancing PDFBox core
> > > > > > capabilities
> > > > > > >> would be more beneficial.
> > > > > > >>
> > > > > > >> With kind regards
> > > > > > >>
> > > > > > >> Maruan Sahyoun
> > > > > > >>
> > > > > > >>
> > > > > > > Check out Jeremias' suggestions of the viewer, it's less of a
> > > viewer
> > > > and
> > > > > > > more of a front-end for a lot of the tools PDFBox has to
> offer, a
> > > > PDFBox
> > > > > > > GUI so to speak rather than a PDF viewer.
> > > > > >
> > > > > > I'd still look into enhancing PDFBox core as this will benefit
> most
> > > > users.
> > > > > > Looking at the bugs and issues most come from core capabilities.
> > > >
> > > >
> > > >
> > > >
> > > > Jeremias Maerki
> > > >
> > > >
> > >
> >
> >
> >
> > --
> > Thanks & Regards,
> > Tharaka Wijebandara,
> > Faculty of Information Technology,
> > University of Moratuwa.
> >
>



-- 
Thanks & Regards,
Tharaka Wijebandara,
Faculty of Information Technology,
University of Moratuwa.

Re: PDFBox Project for GSoC 2012

Posted by mehdi houshmand <me...@gmail.com>.
Hi Tharaka,

Ok, well let's start with thinking about how this GUI is going to look. In
its current form, it's looking a little dated, it might be worth using some
of the GUI frameworks out there to spruce it up a little and making it a
bit more modern. So if we get a list of TODOs (starting with Jeremias'
suggestions):

- Investigate and design a proposal for upgrading the PDFReader GUI (i.e.
modernizing it)
- Implement a PDF object inspector to display PDF object properties when
objects are "clicked-on" in the viewer
- Implement a type-writer feature to add/remove text from a content stream
from within the PDFViewer
- Design a plugin framework for the viewer

Maybe those are a good list of primary objectives? We should think about
some secondary objectives, like possibly supporting Optional Content in the
view (not sure if it already is) and better handling of transparency layers?

Just some thoughts, let's keep this discussions fluid for now, we haven't
actually got that long before we have to submit a proposal though.

Mehdi

On 22 March 2012 03:04, Tharaka Nayanajith Wijebandara <tharaka.nw@gmail.com
> wrote:

> Hi,
>
>
> Thanks everyone for your valuable ideas and comments.
>
>
> It seems most of you like to enhancing PDFReader project. Specially I
> prefer Jeremias' idea, develop application which allow user to access
> PDFBox features through the GUI and PDF viewer with integrated PDF
> Debugger. In my opinion it will be very much useful tool for users as well
> as developers rather than just another PDF reader. So I would like to
> continue with this project. However, since there are several ways to
> enhance PDFReader, we have to define the scope of the project according to
> priority in next few days.
>
>
> Additionally I want to mention another thing here. As my experience current
> PDFReader is also not very much reliable and not working smoothly. I still
> can't understand that whether it is problem of code of PDFReader or PDFBox
> core. Anyway I think that it's better if we can consider also this in the
> project, if it's not some issue in PDFBox core.
>
> On Wed, Mar 21, 2012 at 10:26 PM, mehdi houshmand <me...@gmail.com>
> wrote:
>
> > Hi Tharaka,
> >
> > You have plenty of options here, the student application deadline is on
> the
> > 6th April (see the calendar
> > http://www.google-melange.com/gsoc/events/google/gsoc2012), so you've
> got
> > plenty of time.
> >
> > Let me and/or the community know if you have any questions about the
> > projects. If you have an idea of which project you'd prefer, maybe we can
> > start drawing up some goals and a schedule and discuss with the community
> > to get some feedback on which facets are a) interesting b) useful or c)
> > both!
> >
> > Mehdi
> >
> > On 20 March 2012 15:58, Jeremias Maerki <de...@jeremias-maerki.ch> wrote:
> >
> > > There are a number of HTML to XSL-FO converters. I've never checked how
> > > good the results are but ultimately a native HTML engine is likely to
> > > produce better results especially since HTML was not really designed
> for
> > > print and has to be retrofitted in CSS3. But again, this is a large
> > > project and, IMO, out of scope for PDFBox.
> > >
> > > http://html2fo.sourceforge.net/
> > > http://denature.sourceforge.net/
> > > ...and probably others...
> > >
> > > On 20.03.2012 13:02:33 Dexter Mishra wrote:
> > > > I vote for a PDS editor kind of stuff.
> > > > Another thing regarding the HTML to PDS, cant it be done with XSL-FO
> > > > feature?
> > > > Thanks
> > > >
> > > > On Tue, Mar 20, 2012 at 3:03 PM, Maruan Sahyoun <
> > sahyoun@fileaffairs.de
> > > >wrote:
> > > >
> > > > >
> > > > > > <snip/>
> > > > > >
> > > > > >>
> > > > > >> Although I think that the current PDF Reader can be enhanced in
> > many
> > > > > ways
> > > > > >> there are already so many Readers out there as well as PDF
> support
> > > > > within
> > > > > >> web browsers my personal opinion is that enhancing PDFBox core
> > > > > capabilities
> > > > > >> would be more beneficial.
> > > > > >>
> > > > > >> With kind regards
> > > > > >>
> > > > > >> Maruan Sahyoun
> > > > > >>
> > > > > >>
> > > > > > Check out Jeremias' suggestions of the viewer, it's less of a
> > viewer
> > > and
> > > > > > more of a front-end for a lot of the tools PDFBox has to offer, a
> > > PDFBox
> > > > > > GUI so to speak rather than a PDF viewer.
> > > > >
> > > > > I'd still look into enhancing PDFBox core as this will benefit most
> > > users.
> > > > > Looking at the bugs and issues most come from core capabilities.
> > >
> > >
> > >
> > >
> > > Jeremias Maerki
> > >
> > >
> >
>
>
>
> --
> Thanks & Regards,
> Tharaka Wijebandara,
> Faculty of Information Technology,
> University of Moratuwa.
>

Re: PDFBox Project for GSoC 2012

Posted by Tharaka Nayanajith Wijebandara <th...@gmail.com>.
Hi,


Thanks everyone for your valuable ideas and comments.


It seems most of you like to enhancing PDFReader project. Specially I
prefer Jeremias' idea, develop application which allow user to access
PDFBox features through the GUI and PDF viewer with integrated PDF
Debugger. In my opinion it will be very much useful tool for users as well
as developers rather than just another PDF reader. So I would like to
continue with this project. However, since there are several ways to
enhance PDFReader, we have to define the scope of the project according to
priority in next few days.


Additionally I want to mention another thing here. As my experience current
PDFReader is also not very much reliable and not working smoothly. I still
can't understand that whether it is problem of code of PDFReader or PDFBox
core. Anyway I think that it's better if we can consider also this in the
project, if it's not some issue in PDFBox core.

On Wed, Mar 21, 2012 at 10:26 PM, mehdi houshmand <me...@gmail.com> wrote:

> Hi Tharaka,
>
> You have plenty of options here, the student application deadline is on the
> 6th April (see the calendar
> http://www.google-melange.com/gsoc/events/google/gsoc2012), so you've got
> plenty of time.
>
> Let me and/or the community know if you have any questions about the
> projects. If you have an idea of which project you'd prefer, maybe we can
> start drawing up some goals and a schedule and discuss with the community
> to get some feedback on which facets are a) interesting b) useful or c)
> both!
>
> Mehdi
>
> On 20 March 2012 15:58, Jeremias Maerki <de...@jeremias-maerki.ch> wrote:
>
> > There are a number of HTML to XSL-FO converters. I've never checked how
> > good the results are but ultimately a native HTML engine is likely to
> > produce better results especially since HTML was not really designed for
> > print and has to be retrofitted in CSS3. But again, this is a large
> > project and, IMO, out of scope for PDFBox.
> >
> > http://html2fo.sourceforge.net/
> > http://denature.sourceforge.net/
> > ...and probably others...
> >
> > On 20.03.2012 13:02:33 Dexter Mishra wrote:
> > > I vote for a PDS editor kind of stuff.
> > > Another thing regarding the HTML to PDS, cant it be done with XSL-FO
> > > feature?
> > > Thanks
> > >
> > > On Tue, Mar 20, 2012 at 3:03 PM, Maruan Sahyoun <
> sahyoun@fileaffairs.de
> > >wrote:
> > >
> > > >
> > > > > <snip/>
> > > > >
> > > > >>
> > > > >> Although I think that the current PDF Reader can be enhanced in
> many
> > > > ways
> > > > >> there are already so many Readers out there as well as PDF support
> > > > within
> > > > >> web browsers my personal opinion is that enhancing PDFBox core
> > > > capabilities
> > > > >> would be more beneficial.
> > > > >>
> > > > >> With kind regards
> > > > >>
> > > > >> Maruan Sahyoun
> > > > >>
> > > > >>
> > > > > Check out Jeremias' suggestions of the viewer, it's less of a
> viewer
> > and
> > > > > more of a front-end for a lot of the tools PDFBox has to offer, a
> > PDFBox
> > > > > GUI so to speak rather than a PDF viewer.
> > > >
> > > > I'd still look into enhancing PDFBox core as this will benefit most
> > users.
> > > > Looking at the bugs and issues most come from core capabilities.
> >
> >
> >
> >
> > Jeremias Maerki
> >
> >
>



-- 
Thanks & Regards,
Tharaka Wijebandara,
Faculty of Information Technology,
University of Moratuwa.

Re: PDFBox Project for GSoC 2012

Posted by mehdi houshmand <me...@gmail.com>.
Hi Tharaka,

You have plenty of options here, the student application deadline is on the
6th April (see the calendar
http://www.google-melange.com/gsoc/events/google/gsoc2012), so you've got
plenty of time.

Let me and/or the community know if you have any questions about the
projects. If you have an idea of which project you'd prefer, maybe we can
start drawing up some goals and a schedule and discuss with the community
to get some feedback on which facets are a) interesting b) useful or c)
both!

Mehdi

On 20 March 2012 15:58, Jeremias Maerki <de...@jeremias-maerki.ch> wrote:

> There are a number of HTML to XSL-FO converters. I've never checked how
> good the results are but ultimately a native HTML engine is likely to
> produce better results especially since HTML was not really designed for
> print and has to be retrofitted in CSS3. But again, this is a large
> project and, IMO, out of scope for PDFBox.
>
> http://html2fo.sourceforge.net/
> http://denature.sourceforge.net/
> ...and probably others...
>
> On 20.03.2012 13:02:33 Dexter Mishra wrote:
> > I vote for a PDS editor kind of stuff.
> > Another thing regarding the HTML to PDS, cant it be done with XSL-FO
> > feature?
> > Thanks
> >
> > On Tue, Mar 20, 2012 at 3:03 PM, Maruan Sahyoun <sahyoun@fileaffairs.de
> >wrote:
> >
> > >
> > > > <snip/>
> > > >
> > > >>
> > > >> Although I think that the current PDF Reader can be enhanced in many
> > > ways
> > > >> there are already so many Readers out there as well as PDF support
> > > within
> > > >> web browsers my personal opinion is that enhancing PDFBox core
> > > capabilities
> > > >> would be more beneficial.
> > > >>
> > > >> With kind regards
> > > >>
> > > >> Maruan Sahyoun
> > > >>
> > > >>
> > > > Check out Jeremias' suggestions of the viewer, it's less of a viewer
> and
> > > > more of a front-end for a lot of the tools PDFBox has to offer, a
> PDFBox
> > > > GUI so to speak rather than a PDF viewer.
> > >
> > > I'd still look into enhancing PDFBox core as this will benefit most
> users.
> > > Looking at the bugs and issues most come from core capabilities.
>
>
>
>
> Jeremias Maerki
>
>

Re: PDFBox Project for GSoC 2012

Posted by Jeremias Maerki <de...@jeremias-maerki.ch>.
There are a number of HTML to XSL-FO converters. I've never checked how
good the results are but ultimately a native HTML engine is likely to
produce better results especially since HTML was not really designed for
print and has to be retrofitted in CSS3. But again, this is a large
project and, IMO, out of scope for PDFBox.

http://html2fo.sourceforge.net/
http://denature.sourceforge.net/
...and probably others...

On 20.03.2012 13:02:33 Dexter Mishra wrote:
> I vote for a PDS editor kind of stuff.
> Another thing regarding the HTML to PDS, cant it be done with XSL-FO
> feature?
> Thanks
> 
> On Tue, Mar 20, 2012 at 3:03 PM, Maruan Sahyoun <sa...@fileaffairs.de>wrote:
> 
> >
> > > <snip/>
> > >
> > >>
> > >> Although I think that the current PDF Reader can be enhanced in many
> > ways
> > >> there are already so many Readers out there as well as PDF support
> > within
> > >> web browsers my personal opinion is that enhancing PDFBox core
> > capabilities
> > >> would be more beneficial.
> > >>
> > >> With kind regards
> > >>
> > >> Maruan Sahyoun
> > >>
> > >>
> > > Check out Jeremias' suggestions of the viewer, it's less of a viewer and
> > > more of a front-end for a lot of the tools PDFBox has to offer, a PDFBox
> > > GUI so to speak rather than a PDF viewer.
> >
> > I'd still look into enhancing PDFBox core as this will benefit most users.
> > Looking at the bugs and issues most come from core capabilities.




Jeremias Maerki


Re: PDFBox Project for GSoC 2012

Posted by Dexter Mishra <de...@gmail.com>.
I vote for a PDS editor kind of stuff.
Another thing regarding the HTML to PDS, cant it be done with XSL-FO
feature?
Thanks

On Tue, Mar 20, 2012 at 3:03 PM, Maruan Sahyoun <sa...@fileaffairs.de>wrote:

>
> > <snip/>
> >
> >>
> >> Although I think that the current PDF Reader can be enhanced in many
> ways
> >> there are already so many Readers out there as well as PDF support
> within
> >> web browsers my personal opinion is that enhancing PDFBox core
> capabilities
> >> would be more beneficial.
> >>
> >> With kind regards
> >>
> >> Maruan Sahyoun
> >>
> >>
> > Check out Jeremias' suggestions of the viewer, it's less of a viewer and
> > more of a front-end for a lot of the tools PDFBox has to offer, a PDFBox
> > GUI so to speak rather than a PDF viewer.
>
> I'd still look into enhancing PDFBox core as this will benefit most users.
> Looking at the bugs and issues most come from core capabilities.

Re: PDFBox Project for GSoC 2012

Posted by Maruan Sahyoun <sa...@fileaffairs.de>.
> <snip/>
> 
>> 
>> Although I think that the current PDF Reader can be enhanced in many ways
>> there are already so many Readers out there as well as PDF support within
>> web browsers my personal opinion is that enhancing PDFBox core capabilities
>> would be more beneficial.
>> 
>> With kind regards
>> 
>> Maruan Sahyoun
>> 
>> 
> Check out Jeremias' suggestions of the viewer, it's less of a viewer and
> more of a front-end for a lot of the tools PDFBox has to offer, a PDFBox
> GUI so to speak rather than a PDF viewer.

I'd still look into enhancing PDFBox core as this will benefit most users. Looking at the bugs and issues most come from core capabilities.

Re: PDFBox Project for GSoC 2012

Posted by mehdi houshmand <me...@gmail.com>.
 <snip/>

>
> Although I think that the current PDF Reader can be enhanced in many ways
> there are already so many Readers out there as well as PDF support within
> web browsers my personal opinion is that enhancing PDFBox core capabilities
> would be more beneficial.
>
> With kind regards
>
> Maruan Sahyoun
>
>
Check out Jeremias' suggestions of the viewer, it's less of a viewer and
more of a front-end for a lot of the tools PDFBox has to offer, a PDFBox
GUI so to speak rather than a PDF viewer.

Re: PDFBox Project for GSoC 2012

Posted by Maruan Sahyoun <sa...@fileaffairs.de>.
Hi,

> Hi,
> 
> Am 18.03.2012 03:16, schrieb Tharaka Nayanajith Wijebandara:
>> Hi,
>> 
>> 
>> Thanks mehdi.
>> 
>> 
>> I have two ideas for a GSoC task, but need all of your help to select
>> suitable one.
>> 
>> 
>>    - One project is HTML to PDF and vise versa converter. This feature can
>>    be found in JIRA also (https://issues.apache.org/jira/browse/PDFBOX-6,
>>    https://issues.apache.org/jira/browse/PDFBOX-9)
> Good idea, but complicated, as some of the feature you would need aren't yet implemented.

I think PDF to HTML is a very good idea even if it will be very limited because as Andreas pointed out there are some features missing. Maybe these can be documented and eventually be implemented.

> 
>>    - Other one is enhancing features of PDF reader and zooming features,
>>    page display features, bookmark navigator, page thumbnail viewer can be
>>    very much useful. Since I have previous experience in awt, swing and
>>    java2d, it will be easy for me.
> I like this idea. It would be a nice feature.

Although I think that the current PDF Reader can be enhanced in many ways there are already so many Readers out there as well as PDF support within web browsers my personal opinion is that enhancing PDFBox core capabilities would be more beneficial.

With kind regards

Maruan Sahyoun   

> 
>> There might be several other tasks which are important than this. So all of
>> you are welcome, to reply with good ideas.
> Yes there are a lot things to do, probably someone else might come up with a wish?
> 
>> On Sat, Mar 17, 2012 at 5:01 PM, mehdi houshmand<me...@gmail.com>  wrote:
>> 
>>> Hi Tharaka,
>>> 
>>> Have you had any more thoughts on a project you'd like to undertake?
>>> Have you applied and been through all the admin needed to be accepted
>>> into GSoC 2012? Let me know if you need any help.
>>> 
>>> Mehdi
>>> 
>>> On 9 March 2012 06:25, Andreas Lehmkuehler<an...@lehmi.de>  wrote:
>>>> Hi,
>>>> 
>>>> Am 07.03.2012 07:40, schrieb mehdi houshmand:
>>>>> 
>>>>> Hi Andreas,
>>>>> 
>>>>> Sorry, maybe I wasn't clear, I am an ASF committer... Just not to
>>>>> PDFBox.. . I do have domain expertise being a full-time FOP developer
>>>>> and having dealt with PDFs and fonts quite a bit. Should I pop an
>>>>> email to dev-community to see if it's ok? It seems like such a waste
>>>>> to have an interested applicant but no mentor...
>>>> 
>>>> I'm not an GSoC expert but that sounds good to me. You may double check
>>> with
>>>> the dev-community, but IMHO it's not necessary.
>>>> I'm glad that you volunteer to help us, thanks in advance. I'll try to
>>> help
>>>> as much as I can.
>>>> 
> <SNIP>
> 
> BR
> Andreas Lehmkühler


Re: PDFBox Project for GSoC 2012

Posted by Andreas Lehmkuehler <an...@lehmi.de>.
Hi,

Am 18.03.2012 03:16, schrieb Tharaka Nayanajith Wijebandara:
> Hi,
>
>
> Thanks mehdi.
>
>
> I have two ideas for a GSoC task, but need all of your help to select
> suitable one.
>
>
>     - One project is HTML to PDF and vise versa converter. This feature can
>     be found in JIRA also (https://issues.apache.org/jira/browse/PDFBOX-6,
>     https://issues.apache.org/jira/browse/PDFBOX-9)
Good idea, but complicated, as some of the feature you would need aren't yet 
implemented.

>     - Other one is enhancing features of PDF reader and zooming features,
>     page display features, bookmark navigator, page thumbnail viewer can be
>     very much useful. Since I have previous experience in awt, swing and
>     java2d, it will be easy for me.
I like this idea. It would be a nice feature.

> There might be several other tasks which are important than this. So all of
> you are welcome, to reply with good ideas.
Yes there are a lot things to do, probably someone else might come up with a wish?

> On Sat, Mar 17, 2012 at 5:01 PM, mehdi houshmand<me...@gmail.com>  wrote:
>
>> Hi Tharaka,
>>
>> Have you had any more thoughts on a project you'd like to undertake?
>> Have you applied and been through all the admin needed to be accepted
>> into GSoC 2012? Let me know if you need any help.
>>
>> Mehdi
>>
>> On 9 March 2012 06:25, Andreas Lehmkuehler<an...@lehmi.de>  wrote:
>>> Hi,
>>>
>>> Am 07.03.2012 07:40, schrieb mehdi houshmand:
>>>>
>>>> Hi Andreas,
>>>>
>>>> Sorry, maybe I wasn't clear, I am an ASF committer... Just not to
>>>> PDFBox.. . I do have domain expertise being a full-time FOP developer
>>>> and having dealt with PDFs and fonts quite a bit. Should I pop an
>>>> email to dev-community to see if it's ok? It seems like such a waste
>>>> to have an interested applicant but no mentor...
>>>
>>> I'm not an GSoC expert but that sounds good to me. You may double check
>> with
>>> the dev-community, but IMHO it's not necessary.
>>> I'm glad that you volunteer to help us, thanks in advance. I'll try to
>> help
>>> as much as I can.
>>>
<SNIP>

BR
Andreas Lehmkühler

Re: PDFBox Project for GSoC 2012

Posted by Timo Boehme <ti...@ontochem.com>.
Hi,

Am 18.03.2012 03:16, schrieb Tharaka Nayanajith Wijebandara:
> I have two ideas for a GSoC task, but need all of your help to select
> suitable one.
>
>
>     - One project is HTML to PDF and vise versa converter. ...
>
>     - Other one is enhancing features of PDF reader and zooming features,
>     page display features, bookmark navigator, page thumbnail viewer can be
>     very much useful. Since I have previous experience in awt, swing and
>     java2d, it will be easy for me.

I also like the proposal for an enhancement of the PDF reader/viewer. 
Especially for a GSoC project it is preferable to have clear (viewable) 
results in the end.
 From a PDFBOX developer point of view it would also be nice to add some 
PDF debugging features to the viewer like providing object ids, font 
information etc. when hovering over text/image.


Kind regards,

Timo

-- 

  Timo Boehme
  OntoChem GmbH
  H.-Damerow-Str. 4
  06120 Halle/Saale
  T: +49 345 4780474
  F: +49 345 4780471
  timo.boehme@ontochem.com

_____________________________________________________________________

  OntoChem GmbH
  Geschäftsführer: Dr. Lutz Weber
  Sitz: Halle / Saale
  Registergericht: Stendal
  Registernummer: HRB 215461
_____________________________________________________________________


Re: PDFBox Project for GSoC 2012

Posted by Tharaka Nayanajith Wijebandara <th...@gmail.com>.
Hi,


Thanks mehdi.


I have two ideas for a GSoC task, but need all of your help to select
suitable one.


   - One project is HTML to PDF and vise versa converter. This feature can
   be found in JIRA also (https://issues.apache.org/jira/browse/PDFBOX-6,
   https://issues.apache.org/jira/browse/PDFBOX-9)


   - Other one is enhancing features of PDF reader and zooming features,
   page display features, bookmark navigator, page thumbnail viewer can be
   very much useful. Since I have previous experience in awt, swing and
   java2d, it will be easy for me.


There might be several other tasks which are important than this. So all of
you are welcome, to reply with good ideas.

On Sat, Mar 17, 2012 at 5:01 PM, mehdi houshmand <me...@gmail.com> wrote:

> Hi Tharaka,
>
> Have you had any more thoughts on a project you'd like to undertake?
> Have you applied and been through all the admin needed to be accepted
> into GSoC 2012? Let me know if you need any help.
>
> Mehdi
>
> On 9 March 2012 06:25, Andreas Lehmkuehler <an...@lehmi.de> wrote:
> > Hi,
> >
> > Am 07.03.2012 07:40, schrieb mehdi houshmand:
> >>
> >> Hi Andreas,
> >>
> >> Sorry, maybe I wasn't clear, I am an ASF committer... Just not to
> >> PDFBox.. . I do have domain expertise being a full-time FOP developer
> >> and having dealt with PDFs and fonts quite a bit. Should I pop an
> >> email to dev-community to see if it's ok? It seems like such a waste
> >> to have an interested applicant but no mentor...
> >
> > I'm not an GSoC expert but that sounds good to me. You may double check
> with
> > the dev-community, but IMHO it's not necessary.
> > I'm glad that you volunteer to help us, thanks in advance. I'll try to
> help
> > as much as I can.
> >
> >
> > BR
> > Andreas Lehmkühler
> >
> >
> >> Mehdi
> >>
> >> On 6 March 2012 21:32, Andreas Lehmkuehler<an...@lehmi.de>  wrote:
> >>>
> >>> Hi,
> >>>
> >>>
> >>> Am 06.03.2012 21:24, schrieb mehdi houshmand:
> >>>
> >>>> Hi Andreas,
> >>>>
> >>>> Does the mentor need to be a PDFBox committer? If not, I wouldn't mind
> >>>> putting myself forward as a candidate... Of course, that is if no one
> >>>> else does.
> >>>
> >>>
> >>> Thanks for the offer, but AFAIKT it's not possible. According to [1]
> the
> >>> mentor has to be an ASF member or committer.
> >>>
> >>>
> >>>> Mehdi
> >>>>
> >>>> On 6 March 2012 18:43, Andreas Lehmkuehler<an...@lehmi.de>
>  wrote:
> >>>>>
> >>>>>
> >>>>> Hi,
> >>>>>
> >>>>> Am 29.02.2012 03:50, schrieb Tharaka Nayanajith Wijebandara:
> >>>>>
> >>>>>> Hi,
> >>>>>>
> >>>>>>
> >>>>>> I'm university student in Sri Lanka and a newbie to Open Source
> >>>>>> Development. I would like to participate for Google Summer of Code
> >>>>>> 2012
> >>>>>> with an Apache Project. Since I'm familiar with Java and I have used
> >>>>>> PDFBox
> >>>>>> Library for my academic project, I like to develop new feature for
> >>>>>> PDFBox
> >>>>>> as my GSoC project. First of all I want to know that is it possible
> to
> >>>>>> participate for GSoC 2012 with PDFBox project?
> >>>>>>
> >>>>>>
> >>>>>> If it is yes, I want help from PDFBox development community to
> select
> >>>>>> appropriate PDFBox task for GSoC.
> >>>>>
> >>>>>
> >>>>>
> >>>>> There is a lot to do and I'm sure that some of those jobs should
> >>>>> qualify
> >>>>> as
> >>>>> GSoC task.
> >>>>>
> >>>>>
> >>>>>> If you have any idea about good project or advice for me, please
> reply
> >>>>>> to
> >>>>>> this.
> >>>>>
> >>>>>
> >>>>>
> >>>>> You will need a mentor and I'm not sure if you will find one among
> our
> >>>>> ranks. I'd like to support you, but I can't do it due to personal
> >>>>> reasons.
> >>>>>
> >>>>> Anybody else?
> >>>>>
> >>>>>
> >>>>> BR
> >>>>> Andreas Lehmkühler
> >>>
> >>>
> >>>
> >>> BR
> >>> Andreas Lehmkühler
> >>>
> >>> [1] http://community.apache.org/guide-to-being-a-mentor.html
> >
> >
>



-- 
Thanks & Regards,
Tharaka Wijebandara,
Faculty of Information Technology,
University of Moratuwa.

Re: PDFBox Project for GSoC 2012

Posted by mehdi houshmand <me...@gmail.com>.
Hi Tharaka,

Have you had any more thoughts on a project you'd like to undertake?
Have you applied and been through all the admin needed to be accepted
into GSoC 2012? Let me know if you need any help.

Mehdi

On 9 March 2012 06:25, Andreas Lehmkuehler <an...@lehmi.de> wrote:
> Hi,
>
> Am 07.03.2012 07:40, schrieb mehdi houshmand:
>>
>> Hi Andreas,
>>
>> Sorry, maybe I wasn't clear, I am an ASF committer... Just not to
>> PDFBox.. . I do have domain expertise being a full-time FOP developer
>> and having dealt with PDFs and fonts quite a bit. Should I pop an
>> email to dev-community to see if it's ok? It seems like such a waste
>> to have an interested applicant but no mentor...
>
> I'm not an GSoC expert but that sounds good to me. You may double check with
> the dev-community, but IMHO it's not necessary.
> I'm glad that you volunteer to help us, thanks in advance. I'll try to help
> as much as I can.
>
>
> BR
> Andreas Lehmkühler
>
>
>> Mehdi
>>
>> On 6 March 2012 21:32, Andreas Lehmkuehler<an...@lehmi.de>  wrote:
>>>
>>> Hi,
>>>
>>>
>>> Am 06.03.2012 21:24, schrieb mehdi houshmand:
>>>
>>>> Hi Andreas,
>>>>
>>>> Does the mentor need to be a PDFBox committer? If not, I wouldn't mind
>>>> putting myself forward as a candidate... Of course, that is if no one
>>>> else does.
>>>
>>>
>>> Thanks for the offer, but AFAIKT it's not possible. According to [1] the
>>> mentor has to be an ASF member or committer.
>>>
>>>
>>>> Mehdi
>>>>
>>>> On 6 March 2012 18:43, Andreas Lehmkuehler<an...@lehmi.de>    wrote:
>>>>>
>>>>>
>>>>> Hi,
>>>>>
>>>>> Am 29.02.2012 03:50, schrieb Tharaka Nayanajith Wijebandara:
>>>>>
>>>>>> Hi,
>>>>>>
>>>>>>
>>>>>> I'm university student in Sri Lanka and a newbie to Open Source
>>>>>> Development. I would like to participate for Google Summer of Code
>>>>>> 2012
>>>>>> with an Apache Project. Since I'm familiar with Java and I have used
>>>>>> PDFBox
>>>>>> Library for my academic project, I like to develop new feature for
>>>>>> PDFBox
>>>>>> as my GSoC project. First of all I want to know that is it possible to
>>>>>> participate for GSoC 2012 with PDFBox project?
>>>>>>
>>>>>>
>>>>>> If it is yes, I want help from PDFBox development community to select
>>>>>> appropriate PDFBox task for GSoC.
>>>>>
>>>>>
>>>>>
>>>>> There is a lot to do and I'm sure that some of those jobs should
>>>>> qualify
>>>>> as
>>>>> GSoC task.
>>>>>
>>>>>
>>>>>> If you have any idea about good project or advice for me, please reply
>>>>>> to
>>>>>> this.
>>>>>
>>>>>
>>>>>
>>>>> You will need a mentor and I'm not sure if you will find one among our
>>>>> ranks. I'd like to support you, but I can't do it due to personal
>>>>> reasons.
>>>>>
>>>>> Anybody else?
>>>>>
>>>>>
>>>>> BR
>>>>> Andreas Lehmkühler
>>>
>>>
>>>
>>> BR
>>> Andreas Lehmkühler
>>>
>>> [1] http://community.apache.org/guide-to-being-a-mentor.html
>
>

Re: PDFBox Project for GSoC 2012

Posted by Andreas Lehmkuehler <an...@lehmi.de>.
Hi,

Am 07.03.2012 07:40, schrieb mehdi houshmand:
> Hi Andreas,
>
> Sorry, maybe I wasn't clear, I am an ASF committer... Just not to
> PDFBox.. . I do have domain expertise being a full-time FOP developer
> and having dealt with PDFs and fonts quite a bit. Should I pop an
> email to dev-community to see if it's ok? It seems like such a waste
> to have an interested applicant but no mentor...
I'm not an GSoC expert but that sounds good to me. You may double check with the 
dev-community, but IMHO it's not necessary.
I'm glad that you volunteer to help us, thanks in advance. I'll try to help as 
much as I can.


BR
Andreas Lehmkühler

> Mehdi
>
> On 6 March 2012 21:32, Andreas Lehmkuehler<an...@lehmi.de>  wrote:
>> Hi,
>>
>>
>> Am 06.03.2012 21:24, schrieb mehdi houshmand:
>>
>>> Hi Andreas,
>>>
>>> Does the mentor need to be a PDFBox committer? If not, I wouldn't mind
>>> putting myself forward as a candidate... Of course, that is if no one
>>> else does.
>>
>> Thanks for the offer, but AFAIKT it's not possible. According to [1] the
>> mentor has to be an ASF member or committer.
>>
>>
>>> Mehdi
>>>
>>> On 6 March 2012 18:43, Andreas Lehmkuehler<an...@lehmi.de>    wrote:
>>>>
>>>> Hi,
>>>>
>>>> Am 29.02.2012 03:50, schrieb Tharaka Nayanajith Wijebandara:
>>>>
>>>>> Hi,
>>>>>
>>>>>
>>>>> I'm university student in Sri Lanka and a newbie to Open Source
>>>>> Development. I would like to participate for Google Summer of Code 2012
>>>>> with an Apache Project. Since I'm familiar with Java and I have used
>>>>> PDFBox
>>>>> Library for my academic project, I like to develop new feature for
>>>>> PDFBox
>>>>> as my GSoC project. First of all I want to know that is it possible to
>>>>> participate for GSoC 2012 with PDFBox project?
>>>>>
>>>>>
>>>>> If it is yes, I want help from PDFBox development community to select
>>>>> appropriate PDFBox task for GSoC.
>>>>
>>>>
>>>> There is a lot to do and I'm sure that some of those jobs should qualify
>>>> as
>>>> GSoC task.
>>>>
>>>>
>>>>> If you have any idea about good project or advice for me, please reply
>>>>> to
>>>>> this.
>>>>
>>>>
>>>> You will need a mentor and I'm not sure if you will find one among our
>>>> ranks. I'd like to support you, but I can't do it due to personal
>>>> reasons.
>>>>
>>>> Anybody else?
>>>>
>>>>
>>>> BR
>>>> Andreas Lehmkühler
>>
>>
>> BR
>> Andreas Lehmkühler
>>
>> [1] http://community.apache.org/guide-to-being-a-mentor.html


Re: PDFBox Project for GSoC 2012

Posted by mehdi houshmand <me...@gmail.com>.
Hi Andreas,

Sorry, maybe I wasn't clear, I am an ASF committer... Just not to
PDFBox.. . I do have domain expertise being a full-time FOP developer
and having dealt with PDFs and fonts quite a bit. Should I pop an
email to dev-community to see if it's ok? It seems like such a waste
to have an interested applicant but no mentor...


Mehdi

On 6 March 2012 21:32, Andreas Lehmkuehler <an...@lehmi.de> wrote:
> Hi,
>
>
> Am 06.03.2012 21:24, schrieb mehdi houshmand:
>
>> Hi Andreas,
>>
>> Does the mentor need to be a PDFBox committer? If not, I wouldn't mind
>> putting myself forward as a candidate... Of course, that is if no one
>> else does.
>
> Thanks for the offer, but AFAIKT it's not possible. According to [1] the
> mentor has to be an ASF member or committer.
>
>
>> Mehdi
>>
>> On 6 March 2012 18:43, Andreas Lehmkuehler<an...@lehmi.de>  wrote:
>>>
>>> Hi,
>>>
>>> Am 29.02.2012 03:50, schrieb Tharaka Nayanajith Wijebandara:
>>>
>>>> Hi,
>>>>
>>>>
>>>> I'm university student in Sri Lanka and a newbie to Open Source
>>>> Development. I would like to participate for Google Summer of Code 2012
>>>> with an Apache Project. Since I'm familiar with Java and I have used
>>>> PDFBox
>>>> Library for my academic project, I like to develop new feature for
>>>> PDFBox
>>>> as my GSoC project. First of all I want to know that is it possible to
>>>> participate for GSoC 2012 with PDFBox project?
>>>>
>>>>
>>>> If it is yes, I want help from PDFBox development community to select
>>>> appropriate PDFBox task for GSoC.
>>>
>>>
>>> There is a lot to do and I'm sure that some of those jobs should qualify
>>> as
>>> GSoC task.
>>>
>>>
>>>> If you have any idea about good project or advice for me, please reply
>>>> to
>>>> this.
>>>
>>>
>>> You will need a mentor and I'm not sure if you will find one among our
>>> ranks. I'd like to support you, but I can't do it due to personal
>>> reasons.
>>>
>>> Anybody else?
>>>
>>>
>>> BR
>>> Andreas Lehmkühler
>
>
> BR
> Andreas Lehmkühler
>
> [1] http://community.apache.org/guide-to-being-a-mentor.html

Re: PDFBox Project for GSoC 2012

Posted by Tharaka Nayanajith Wijebandara <th...@gmail.com>.
Hi,

Andreas,Mehdi Thanks for the response.

It seems hard to find a mentor. Anyway I hopefully continue my effort on
the project.

I would very much appreciate it if someone would like to help me by being a
mentor and it will be remarkable experience for both of us as well as our
community.

On Wed, Mar 7, 2012 at 3:02 AM, Andreas Lehmkuehler <an...@lehmi.de>wrote:

> Hi,
>
>
> Am 06.03.2012 21:24, schrieb mehdi houshmand:
>
>  Hi Andreas,
>>
>> Does the mentor need to be a PDFBox committer? If not, I wouldn't mind
>> putting myself forward as a candidate... Of course, that is if no one
>> else does.
>>
> Thanks for the offer, but AFAIKT it's not possible. According to [1] the
> mentor has to be an ASF member or committer.
>
>
>  Mehdi
>>
>> On 6 March 2012 18:43, Andreas Lehmkuehler<an...@lehmi.de>  wrote:
>>
>>> Hi,
>>>
>>> Am 29.02.2012 03:50, schrieb Tharaka Nayanajith Wijebandara:
>>>
>>>  Hi,
>>>>
>>>>
>>>> I'm university student in Sri Lanka and a newbie to Open Source
>>>> Development. I would like to participate for Google Summer of Code 2012
>>>> with an Apache Project. Since I'm familiar with Java and I have used
>>>> PDFBox
>>>> Library for my academic project, I like to develop new feature for
>>>> PDFBox
>>>> as my GSoC project. First of all I want to know that is it possible to
>>>> participate for GSoC 2012 with PDFBox project?
>>>>
>>>>
>>>> If it is yes, I want help from PDFBox development community to select
>>>> appropriate PDFBox task for GSoC.
>>>>
>>>
>>> There is a lot to do and I'm sure that some of those jobs should qualify
>>> as
>>> GSoC task.
>>>
>>>
>>>  If you have any idea about good project or advice for me, please reply
>>>> to
>>>> this.
>>>>
>>>
>>> You will need a mentor and I'm not sure if you will find one among our
>>> ranks. I'd like to support you, but I can't do it due to personal
>>> reasons.
>>>
>>> Anybody else?
>>>
>>>
>>> BR
>>> Andreas Lehmkühler
>>>
>>
> BR
> Andreas Lehmkühler
>
> [1] http://community.apache.org/**guide-to-being-a-mentor.html<http://community.apache.org/guide-to-being-a-mentor.html>
>



-- 
Thanks & Regards,
Tharaka Wijebandara,
Faculty of Information Technology,
University of Moratuwa.

Re: PDFBox Project for GSoC 2012

Posted by Andreas Lehmkuehler <an...@lehmi.de>.
Hi,


Am 06.03.2012 21:24, schrieb mehdi houshmand:
> Hi Andreas,
>
> Does the mentor need to be a PDFBox committer? If not, I wouldn't mind
> putting myself forward as a candidate... Of course, that is if no one
> else does.
Thanks for the offer, but AFAIKT it's not possible. According to [1] the mentor 
has to be an ASF member or committer.

> Mehdi
>
> On 6 March 2012 18:43, Andreas Lehmkuehler<an...@lehmi.de>  wrote:
>> Hi,
>>
>> Am 29.02.2012 03:50, schrieb Tharaka Nayanajith Wijebandara:
>>
>>> Hi,
>>>
>>>
>>> I'm university student in Sri Lanka and a newbie to Open Source
>>> Development. I would like to participate for Google Summer of Code 2012
>>> with an Apache Project. Since I'm familiar with Java and I have used
>>> PDFBox
>>> Library for my academic project, I like to develop new feature for PDFBox
>>> as my GSoC project. First of all I want to know that is it possible to
>>> participate for GSoC 2012 with PDFBox project?
>>>
>>>
>>> If it is yes, I want help from PDFBox development community to select
>>> appropriate PDFBox task for GSoC.
>>
>> There is a lot to do and I'm sure that some of those jobs should qualify as
>> GSoC task.
>>
>>
>>> If you have any idea about good project or advice for me, please reply to
>>> this.
>>
>> You will need a mentor and I'm not sure if you will find one among our
>> ranks. I'd like to support you, but I can't do it due to personal reasons.
>>
>> Anybody else?
>>
>>
>> BR
>> Andreas Lehmkühler

BR
Andreas Lehmkühler

[1] http://community.apache.org/guide-to-being-a-mentor.html

Re: PDFBox Project for GSoC 2012

Posted by mehdi houshmand <me...@gmail.com>.
Hi Andreas,

Does the mentor need to be a PDFBox committer? If not, I wouldn't mind
putting myself forward as a candidate... Of course, that is if no one
else does.

Mehdi

On 6 March 2012 18:43, Andreas Lehmkuehler <an...@lehmi.de> wrote:
> Hi,
>
> Am 29.02.2012 03:50, schrieb Tharaka Nayanajith Wijebandara:
>
>> Hi,
>>
>>
>> I'm university student in Sri Lanka and a newbie to Open Source
>> Development. I would like to participate for Google Summer of Code 2012
>> with an Apache Project. Since I'm familiar with Java and I have used
>> PDFBox
>> Library for my academic project, I like to develop new feature for PDFBox
>> as my GSoC project. First of all I want to know that is it possible to
>> participate for GSoC 2012 with PDFBox project?
>>
>>
>> If it is yes, I want help from PDFBox development community to select
>> appropriate PDFBox task for GSoC.
>
> There is a lot to do and I'm sure that some of those jobs should qualify as
> GSoC task.
>
>
>> If you have any idea about good project or advice for me, please reply to
>> this.
>
> You will need a mentor and I'm not sure if you will find one among our
> ranks. I'd like to support you, but I can't do it due to personal reasons.
>
> Anybody else?
>
>
> BR
> Andreas Lehmkühler