You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@poi.apache.org by Yegor Kozlov <ye...@dinom.ru> on 2012/03/21 12:03:33 UTC

Google Summer of Code: Apache POI

Hi All,

GSoC 2012 is on the way and it is a good change to draw more attention
to the POI project and attract new contributors / potential
committers.

If there are no objections, I would like to submit a proposal.

I've already registered as a mentor and if any one else fancy , please
go ahead and register.

So far I have two ideas:

 (1) Improve drawing support in HSSF. The main issue is that HSSF can
create new drawings from scratch but cannot modify existing ones. This
means that you cannot add graphic elements such as pictures, shapes
and cell comments to sheets that already have drawings. This drawback
is frequently reported on the mailing lists and the fix is in high
demand.

The student will need to have deep knowledge of the Excel Binary
(Biff8) and MS Drawing (Escher) formats and current architecture of
HSSF.

 (2) Support extraction of VBA macro code from both binary and OOXML
documents. The goal is to read VBA code as text.
  It will be a new module and a new format parser and I guess a lot of
low-level hacking. Armed with the spec ([MS-OVBA].pdf ), it should be
feasible to implement within a month or two.



Regards,
Yegor

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


Re: Google Summer of Code: Apache POI

Posted by Yegor Kozlov <ye...@dinom.ru>.
https://issues.apache.org/jira/browse/COMDEV-73
https://issues.apache.org/jira/browse/COMDEV-74

On Wed, Mar 21, 2012 at 3:03 PM, Yegor Kozlov <ye...@dinom.ru> wrote:
> Hi All,
>
> GSoC 2012 is on the way and it is a good change to draw more attention
> to the POI project and attract new contributors / potential
> committers.
>
> If there are no objections, I would like to submit a proposal.
>
> I've already registered as a mentor and if any one else fancy , please
> go ahead and register.
>
> So far I have two ideas:
>
>  (1) Improve drawing support in HSSF. The main issue is that HSSF can
> create new drawings from scratch but cannot modify existing ones. This
> means that you cannot add graphic elements such as pictures, shapes
> and cell comments to sheets that already have drawings. This drawback
> is frequently reported on the mailing lists and the fix is in high
> demand.
>
> The student will need to have deep knowledge of the Excel Binary
> (Biff8) and MS Drawing (Escher) formats and current architecture of
> HSSF.
>
>  (2) Support extraction of VBA macro code from both binary and OOXML
> documents. The goal is to read VBA code as text.
>  It will be a new module and a new format parser and I guess a lot of
> low-level hacking. Armed with the spec ([MS-OVBA].pdf ), it should be
> feasible to implement within a month or two.
>
>
>
> Regards,
> Yegor

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


Re: Google Summer of Code: Apache POI

Posted by Yegor Kozlov <ye...@dinom.ru>.
https://issues.apache.org/jira/browse/COMDEV-75

On Wed, Mar 21, 2012 at 3:03 PM, Yegor Kozlov <ye...@dinom.ru> wrote:
> Hi All,
>
> GSoC 2012 is on the way and it is a good change to draw more attention
> to the POI project and attract new contributors / potential
> committers.
>
> If there are no objections, I would like to submit a proposal.
>
> I've already registered as a mentor and if any one else fancy , please
> go ahead and register.
>
> So far I have two ideas:
>
>  (1) Improve drawing support in HSSF. The main issue is that HSSF can
> create new drawings from scratch but cannot modify existing ones. This
> means that you cannot add graphic elements such as pictures, shapes
> and cell comments to sheets that already have drawings. This drawback
> is frequently reported on the mailing lists and the fix is in high
> demand.
>
> The student will need to have deep knowledge of the Excel Binary
> (Biff8) and MS Drawing (Escher) formats and current architecture of
> HSSF.
>
>  (2) Support extraction of VBA macro code from both binary and OOXML
> documents. The goal is to read VBA code as text.
>  It will be a new module and a new format parser and I guess a lot of
> low-level hacking. Armed with the spec ([MS-OVBA].pdf ), it should be
> feasible to implement within a month or two.
>
>
>
> Regards,
> Yegor

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


Re: Google Summer of Code: Apache POI

Posted by Jim Talbut <jt...@spudsoft.co.uk>.
>  These are out of my expertise.
I'm amazed, I didn't think anything was out of your expertise.


>> And the minor one:
>> 4. Support for RichText in SXSSF, support for not streaming some rows (so a small number could be post processed) would be useful too.
>>
> this one seems to small for GSOC. Create a feature request in Bugzilla
> and there is a good chance we will have it in PO-3.9 .
RichText: https://issues.apache.org/bugzilla/show_bug.cgi?id=52972
And excluding some rows: 
https://issues.apache.org/bugzilla/show_bug.cgi?id=52973
> P.S. Jim, are you by chance a student? ;)
I'm afraid not, but if I find myself with an unfeasibly large amount of 
time on my hands I may take a look at the pivot tables.

Jim

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


Re: Google Summer of Code: Apache POI

Posted by Yegor Kozlov <ye...@dinom.ru>.
On Thu, Mar 22, 2012 at 10:32 AM, Jim Talbut <jt...@spudsoft.co.uk> wrote:
> Yegor,
>
> There are three major (and one minor) things that I'd like to see added to POI that may be suitable for GSOC:
> 1. Support for creating charts.

It looks like a very good idea for GSOC. About a year back we received
a contribution with initial support for charts in XSSF. For now only
simple scattered charts are supported, here is an example:

https://svn.apache.org/repos/asf/poi/trunk/src/examples/src/org/apache/poi/xssf/usermodel/examples/ScatterChart.java

 Wherever possible, the code above uses common spreadsheet interfaces
so in theory  it should work for both HSSF and XSSF implementations.
 We need someone to take it up and continue and I'm going to post a
JIRA issue for it.


> 2. Support for creating Pivot Tables.
> 3. Full support for encrypted workbooks.

These are out of my expertise.

> And the minor one:
> 4. Support for RichText in SXSSF, support for not streaming some rows (so a small number could be post processed) would be useful too.
>

this one seems to small for GSOC. Create a feature request in Bugzilla
and there is a good chance we will have it in PO-3.9 .

P.S. Jim, are you by chance a student? ;)

Yegor
> Jim
>
> On 21 Mar 2012, at 11:03, Yegor Kozlov <ye...@dinom.ru> wrote:
>
>> Hi All,
>>
>> GSoC 2012 is on the way and it is a good change to draw more attention
>> to the POI project and attract new contributors / potential
>> committers.
>>
>> If there are no objections, I would like to submit a proposal.
>>
>> I've already registered as a mentor and if any one else fancy , please
>> go ahead and register.
>>
>> So far I have two ideas:
>>
>> (1) Improve drawing support in HSSF. The main issue is that HSSF can
>> create new drawings from scratch but cannot modify existing ones. This
>> means that you cannot add graphic elements such as pictures, shapes
>> and cell comments to sheets that already have drawings. This drawback
>> is frequently reported on the mailing lists and the fix is in high
>> demand.
>>
>> The student will need to have deep knowledge of the Excel Binary
>> (Biff8) and MS Drawing (Escher) formats and current architecture of
>> HSSF.
>>
>> (2) Support extraction of VBA macro code from both binary and OOXML
>> documents. The goal is to read VBA code as text.
>>  It will be a new module and a new format parser and I guess a lot of
>> low-level hacking. Armed with the spec ([MS-OVBA].pdf ), it should be
>> feasible to implement within a month or two.
>>
>>
>>
>> Regards,
>> Yegor
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
>> For additional commands, e-mail: dev-help@poi.apache.org
>>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
> For additional commands, e-mail: dev-help@poi.apache.org
>

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


Re: Google Summer of Code: Apache POI

Posted by Jim Talbut <jt...@spudsoft.co.uk>.
Yegor,

There are three major (and one minor) things that I'd like to see added to POI that may be suitable for GSOC:
1. Support for creating charts.
2. Support for creating Pivot Tables.
3. Full support for encrypted workbooks.
And the minor one:
4. Support for RichText in SXSSF, support for not streaming some rows (so a small number could be post processed) would be useful too.

Jim

On 21 Mar 2012, at 11:03, Yegor Kozlov <ye...@dinom.ru> wrote:

> Hi All,
> 
> GSoC 2012 is on the way and it is a good change to draw more attention
> to the POI project and attract new contributors / potential
> committers.
> 
> If there are no objections, I would like to submit a proposal.
> 
> I've already registered as a mentor and if any one else fancy , please
> go ahead and register.
> 
> So far I have two ideas:
> 
> (1) Improve drawing support in HSSF. The main issue is that HSSF can
> create new drawings from scratch but cannot modify existing ones. This
> means that you cannot add graphic elements such as pictures, shapes
> and cell comments to sheets that already have drawings. This drawback
> is frequently reported on the mailing lists and the fix is in high
> demand.
> 
> The student will need to have deep knowledge of the Excel Binary
> (Biff8) and MS Drawing (Escher) formats and current architecture of
> HSSF.
> 
> (2) Support extraction of VBA macro code from both binary and OOXML
> documents. The goal is to read VBA code as text.
>  It will be a new module and a new format parser and I guess a lot of
> low-level hacking. Armed with the spec ([MS-OVBA].pdf ), it should be
> feasible to implement within a month or two.
> 
> 
> 
> Regards,
> Yegor
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
> For additional commands, e-mail: dev-help@poi.apache.org
> 

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org