You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@openoffice.apache.org by chengjh <ch...@apache.org> on 2012/05/29 09:24:52 UTC

Propose to Implement the Loading of TOC and Improve TOC Fidelity with MS Word Binary Document

Hi All,

TOC(Table of Contents) is a significant feature in Aoo Writer..Although,it
has provided powerful capabilities to benefit end users for productivity,
the followed areas,especially the fidelity with MS Word, still need
improvements..I propose them and put them as the candidates
https://cwiki.apache.org/confluence/display/OOOUSERS/AOO+4.0+Feature+Planning
of
the next release for your comments...thanks.

1)The TOC data of a MS Word document is not parsed completely.And the
actual TOC data is from silently updating once a MS Word Document
loaded.Thus,the fidelity can not be ensured especially when the document
contents that impact TOC have been changed after creating TOC in MS
Word.So,we propose to implement the TOC loading process to replace the
update action.
2)The tab between chapter number and TOC entry lost when loading a MS Word
document,which leads to different gap between chapter number and TOC
entry.That looks different from MS Word.
3)Jump info will be lost when loading MS Word TOC created by un-checking
"Use hyperlinks instead of page numbers". To this kind of TOC,end users can
only press ctrl+mouse to click the page number of the TOC entry for jumping
in MS Word.
4)The customized character attributes will be lost when loading MS Word TOC
created by un-checking "Use hyperlinks instead of page numbers". To this
kind of TOC,the customized character attributes of the target paragraphs
can be collected into TOC in MS Word.

-- 

Best Regards,Jianhong Cheng

Re: Propose to Implement the Loading of TOC and Improve TOC Fidelity with MS Word Binary Document

Posted by Fan Zheng <zh...@gmail.com>.
Well, good news. Then the efforts on TOC improvement stuff in ooxml filter
would be smaller. But sorry that I do not exactly know the detail process
of ooxml loading. l need some time on investigation.
在 2012-6-14 傍晚6:34,"Ying Zhang" <tl...@gmail.com>写道:

> thx Zheng Fan, yes, I'm thinking on the support of OOXML TOC import, and
> ooxml filter could support nested field, but I'm not sure whether it's the
> only blocker issue for ooxml toc support, do you have any idea about the
> solution?
>
>
> 2012/6/13 Fan Zheng <zh...@gmail.com>
>
> > to Zhang ying:
> > it is possible for ooxml filter on having this improvement, if the nested
> > fields could be supported.
> > 在 2012-5-30 上午9:58,"Ying Zhang" <tl...@gmail.com>写道:
> >
> > > I see only the improvement for interoperability with MS Binary file
> > format
> > > been mentioned. But since the same problems exist for MS OOXML file
> > format.
> > > Could we consider both and find whether we could define same mechanism
> > and
> > > same scope to make it consistence with each other.
> > > I would like to take the MS OOXML part.
> > >
> > > 2012/5/29 chengjh <ch...@apache.org>
> > >
> > > > Oliver,welcome...
> > > >
> > > > On Tue, May 29, 2012 at 8:21 PM, Oliver-Rainer Wittmann <
> > > > orwittmann@googlemail.com> wrote:
> > > >
> > > > > Hi,
> > > > >
> > > > >
> > > > > On 29.05.2012 09:24, chengjh wrote:
> > > > >
> > > > >> Hi All,
> > > > >>
> > > > >> TOC(Table of Contents) is a significant feature in Aoo
> > > > Writer..Although,it
> > > > >> has provided powerful capabilities to benefit end users for
> > > > productivity,
> > > > >> the followed areas,especially the fidelity with MS Word, still
> need
> > > > >> improvements..I propose them and put them as the candidates
> > > > >> https://cwiki.apache.org/**confluence/display/OOOUSERS/**
> > > > >> AOO+4.0+Feature+Planning<
> > > >
> > >
> >
> https://cwiki.apache.org/confluence/display/OOOUSERS/AOO+4.0+Feature+Planning
> > > > >
> > > > >> of
> > > > >> the next release for your comments...thanks.
> > > > >>
> > > > >> 1)The TOC data of a MS Word document is not parsed completely.And
> > the
> > > > >> actual TOC data is from silently updating once a MS Word Document
> > > > >> loaded.Thus,the fidelity can not be ensured especially when the
> > > document
> > > > >> contents that impact TOC have been changed after creating TOC in
> MS
> > > > >> Word.So,we propose to implement the TOC loading process to replace
> > the
> > > > >> update action.
> > > > >> 2)The tab between chapter number and TOC entry lost when loading a
> > MS
> > > > Word
> > > > >> document,which leads to different gap between chapter number and
> TOC
> > > > >> entry.That looks different from MS Word.
> > > > >> 3)Jump info will be lost when loading MS Word TOC created by
> > > un-checking
> > > > >> "Use hyperlinks instead of page numbers". To this kind of TOC,end
> > > users
> > > > >> can
> > > > >> only press ctrl+mouse to click the page number of the TOC entry
> for
> > > > >> jumping
> > > > >> in MS Word.
> > > > >> 4)The customized character attributes will be lost when loading MS
> > > Word
> > > > >> TOC
> > > > >> created by un-checking "Use hyperlinks instead of page numbers".
> To
> > > this
> > > > >> kind of TOC,the customized character attributes of the target
> > > paragraphs
> > > > >> can be collected into TOC in MS Word.
> > > > >>
> > > > >>
> > > > > Such an improvement makes sense from my point of view.
> > > > >
> > > > > If possible I would help on this.
> > > > >
> > > > > Best regards, Oliver.
> > > > >
> > > >
> > > >
> > > >
> > > > --
> > > >
> > > > Best Regards,Jianhong Cheng
> > > >
> > >
> >
>

Re: Propose to Implement the Loading of TOC and Improve TOC Fidelity with MS Word Binary Document

Posted by Ying Zhang <tl...@gmail.com>.
thx Zheng Fan, yes, I'm thinking on the support of OOXML TOC import, and
ooxml filter could support nested field, but I'm not sure whether it's the
only blocker issue for ooxml toc support, do you have any idea about the
solution?


2012/6/13 Fan Zheng <zh...@gmail.com>

> to Zhang ying:
> it is possible for ooxml filter on having this improvement, if the nested
> fields could be supported.
> 在 2012-5-30 上午9:58,"Ying Zhang" <tl...@gmail.com>写道:
>
> > I see only the improvement for interoperability with MS Binary file
> format
> > been mentioned. But since the same problems exist for MS OOXML file
> format.
> > Could we consider both and find whether we could define same mechanism
> and
> > same scope to make it consistence with each other.
> > I would like to take the MS OOXML part.
> >
> > 2012/5/29 chengjh <ch...@apache.org>
> >
> > > Oliver,welcome...
> > >
> > > On Tue, May 29, 2012 at 8:21 PM, Oliver-Rainer Wittmann <
> > > orwittmann@googlemail.com> wrote:
> > >
> > > > Hi,
> > > >
> > > >
> > > > On 29.05.2012 09:24, chengjh wrote:
> > > >
> > > >> Hi All,
> > > >>
> > > >> TOC(Table of Contents) is a significant feature in Aoo
> > > Writer..Although,it
> > > >> has provided powerful capabilities to benefit end users for
> > > productivity,
> > > >> the followed areas,especially the fidelity with MS Word, still need
> > > >> improvements..I propose them and put them as the candidates
> > > >> https://cwiki.apache.org/**confluence/display/OOOUSERS/**
> > > >> AOO+4.0+Feature+Planning<
> > >
> >
> https://cwiki.apache.org/confluence/display/OOOUSERS/AOO+4.0+Feature+Planning
> > > >
> > > >> of
> > > >> the next release for your comments...thanks.
> > > >>
> > > >> 1)The TOC data of a MS Word document is not parsed completely.And
> the
> > > >> actual TOC data is from silently updating once a MS Word Document
> > > >> loaded.Thus,the fidelity can not be ensured especially when the
> > document
> > > >> contents that impact TOC have been changed after creating TOC in MS
> > > >> Word.So,we propose to implement the TOC loading process to replace
> the
> > > >> update action.
> > > >> 2)The tab between chapter number and TOC entry lost when loading a
> MS
> > > Word
> > > >> document,which leads to different gap between chapter number and TOC
> > > >> entry.That looks different from MS Word.
> > > >> 3)Jump info will be lost when loading MS Word TOC created by
> > un-checking
> > > >> "Use hyperlinks instead of page numbers". To this kind of TOC,end
> > users
> > > >> can
> > > >> only press ctrl+mouse to click the page number of the TOC entry for
> > > >> jumping
> > > >> in MS Word.
> > > >> 4)The customized character attributes will be lost when loading MS
> > Word
> > > >> TOC
> > > >> created by un-checking "Use hyperlinks instead of page numbers". To
> > this
> > > >> kind of TOC,the customized character attributes of the target
> > paragraphs
> > > >> can be collected into TOC in MS Word.
> > > >>
> > > >>
> > > > Such an improvement makes sense from my point of view.
> > > >
> > > > If possible I would help on this.
> > > >
> > > > Best regards, Oliver.
> > > >
> > >
> > >
> > >
> > > --
> > >
> > > Best Regards,Jianhong Cheng
> > >
> >
>

Re: Propose to Implement the Loading of TOC and Improve TOC Fidelity with MS Word Binary Document

Posted by Fan Zheng <zh...@gmail.com>.
to Zhang ying:
it is possible for ooxml filter on having this improvement, if the nested
fields could be supported.
在 2012-5-30 上午9:58,"Ying Zhang" <tl...@gmail.com>写道:

> I see only the improvement for interoperability with MS Binary file format
> been mentioned. But since the same problems exist for MS OOXML file format.
> Could we consider both and find whether we could define same mechanism and
> same scope to make it consistence with each other.
> I would like to take the MS OOXML part.
>
> 2012/5/29 chengjh <ch...@apache.org>
>
> > Oliver,welcome...
> >
> > On Tue, May 29, 2012 at 8:21 PM, Oliver-Rainer Wittmann <
> > orwittmann@googlemail.com> wrote:
> >
> > > Hi,
> > >
> > >
> > > On 29.05.2012 09:24, chengjh wrote:
> > >
> > >> Hi All,
> > >>
> > >> TOC(Table of Contents) is a significant feature in Aoo
> > Writer..Although,it
> > >> has provided powerful capabilities to benefit end users for
> > productivity,
> > >> the followed areas,especially the fidelity with MS Word, still need
> > >> improvements..I propose them and put them as the candidates
> > >> https://cwiki.apache.org/**confluence/display/OOOUSERS/**
> > >> AOO+4.0+Feature+Planning<
> >
> https://cwiki.apache.org/confluence/display/OOOUSERS/AOO+4.0+Feature+Planning
> > >
> > >> of
> > >> the next release for your comments...thanks.
> > >>
> > >> 1)The TOC data of a MS Word document is not parsed completely.And the
> > >> actual TOC data is from silently updating once a MS Word Document
> > >> loaded.Thus,the fidelity can not be ensured especially when the
> document
> > >> contents that impact TOC have been changed after creating TOC in MS
> > >> Word.So,we propose to implement the TOC loading process to replace the
> > >> update action.
> > >> 2)The tab between chapter number and TOC entry lost when loading a MS
> > Word
> > >> document,which leads to different gap between chapter number and TOC
> > >> entry.That looks different from MS Word.
> > >> 3)Jump info will be lost when loading MS Word TOC created by
> un-checking
> > >> "Use hyperlinks instead of page numbers". To this kind of TOC,end
> users
> > >> can
> > >> only press ctrl+mouse to click the page number of the TOC entry for
> > >> jumping
> > >> in MS Word.
> > >> 4)The customized character attributes will be lost when loading MS
> Word
> > >> TOC
> > >> created by un-checking "Use hyperlinks instead of page numbers". To
> this
> > >> kind of TOC,the customized character attributes of the target
> paragraphs
> > >> can be collected into TOC in MS Word.
> > >>
> > >>
> > > Such an improvement makes sense from my point of view.
> > >
> > > If possible I would help on this.
> > >
> > > Best regards, Oliver.
> > >
> >
> >
> >
> > --
> >
> > Best Regards,Jianhong Cheng
> >
>

Re: Propose to Implement the Loading of TOC and Improve TOC Fidelity with MS Word Binary Document

Posted by Fan Zheng <zh...@gmail.com>.
you are right. I will change the design later.
Thanks a lot!
在 2012-6-13 晚上7:22,"Oliver-Rainer Wittmann" <or...@googlemail.com>写道:

> Hi,
>
> On 12.06.2012 16:20, chengjh wrote:
>
>> The function specification and design are ready for review now..Please
>> access  http://wiki.services.**openoffice.org/wiki/Writer/TOC<http://wiki.services.openoffice.org/wiki/Writer/TOC>to review the
>> FS section "Loading of MS Word TOC=>Binary Format=>Function Specification"
>>  and the design section  "Loading of MS Word TOC=>Binary Format=>Design
>> Description"..You are welcome to comment...thanks.
>>
>>
> I already had a look at the wiki and made some minor changes.
>
> Additionally, I think the we still want to "collect" certain paragraphs as
> headings, when we are loading the main content. But, we do not want to
> update the read TOC regarding the "collected" headings. Right?
> Thus, I propose to remove the sentence "Heading paragraphs collecting step
> removal, indicate the step 5 above;". I have already marked this sentence
> in the wiki by striking it.
> If this is ok, we can completely remove it.
>
>
> Best regards, Oliver.
>
>

Re: Propose to Implement the Loading of TOC and Improve TOC Fidelity with MS Word Binary Document

Posted by Oliver-Rainer Wittmann <or...@googlemail.com>.
Hi,

On 12.06.2012 16:20, chengjh wrote:
> The function specification and design are ready for review now..Please
> access  http://wiki.services.openoffice.org/wiki/Writer/TOC to review the
> FS section "Loading of MS Word TOC=>Binary Format=>Function Specification"
>   and the design section  "Loading of MS Word TOC=>Binary Format=>Design
> Description"..You are welcome to comment...thanks.
>

I already had a look at the wiki and made some minor changes.

Additionally, I think the we still want to "collect" certain paragraphs as 
headings, when we are loading the main content. But, we do not want to update 
the read TOC regarding the "collected" headings. Right?
Thus, I propose to remove the sentence "Heading paragraphs collecting step 
removal, indicate the step 5 above;". I have already marked this sentence in the 
wiki by striking it.
If this is ok, we can completely remove it.


Best regards, Oliver.


Re: Propose to Implement the Loading of TOC and Improve TOC Fidelity with MS Word Binary Document

Posted by chengjh <ch...@apache.org>.
The function specification and design are ready for review now..Please
access  http://wiki.services.openoffice.org/wiki/Writer/TOC to review the
FS section "Loading of MS Word TOC=>Binary Format=>Function Specification"
 and the design section  "Loading of MS Word TOC=>Binary Format=>Design
Description"..You are welcome to comment...thanks.

On Wed, May 30, 2012 at 10:54 AM, chengjh <ch...@apache.org> wrote:

> Sure,that will be more integrated..Thanks to Ying..
>
>
> On Wed, May 30, 2012 at 9:58 AM, Ying Zhang <tl...@gmail.com> wrote:
>
>> I see only the improvement for interoperability with MS Binary file format
>> been mentioned. But since the same problems exist for MS OOXML file
>> format.
>> Could we consider both and find whether we could define same mechanism and
>> same scope to make it consistence with each other.
>> I would like to take the MS OOXML part.
>>
>> 2012/5/29 chengjh <ch...@apache.org>
>>
>> > Oliver,welcome...
>> >
>> > On Tue, May 29, 2012 at 8:21 PM, Oliver-Rainer Wittmann <
>> > orwittmann@googlemail.com> wrote:
>> >
>> > > Hi,
>> > >
>> > >
>> > > On 29.05.2012 09:24, chengjh wrote:
>> > >
>> > >> Hi All,
>> > >>
>> > >> TOC(Table of Contents) is a significant feature in Aoo
>> > Writer..Although,it
>> > >> has provided powerful capabilities to benefit end users for
>> > productivity,
>> > >> the followed areas,especially the fidelity with MS Word, still need
>> > >> improvements..I propose them and put them as the candidates
>> > >> https://cwiki.apache.org/**confluence/display/OOOUSERS/**
>> > >> AOO+4.0+Feature+Planning<
>> >
>> https://cwiki.apache.org/confluence/display/OOOUSERS/AOO+4.0+Feature+Planning
>> > >
>> > >> of
>> > >> the next release for your comments...thanks.
>> > >>
>> > >> 1)The TOC data of a MS Word document is not parsed completely.And the
>> > >> actual TOC data is from silently updating once a MS Word Document
>> > >> loaded.Thus,the fidelity can not be ensured especially when the
>> document
>> > >> contents that impact TOC have been changed after creating TOC in MS
>> > >> Word.So,we propose to implement the TOC loading process to replace
>> the
>> > >> update action.
>> > >> 2)The tab between chapter number and TOC entry lost when loading a MS
>> > Word
>> > >> document,which leads to different gap between chapter number and TOC
>> > >> entry.That looks different from MS Word.
>> > >> 3)Jump info will be lost when loading MS Word TOC created by
>> un-checking
>> > >> "Use hyperlinks instead of page numbers". To this kind of TOC,end
>> users
>> > >> can
>> > >> only press ctrl+mouse to click the page number of the TOC entry for
>> > >> jumping
>> > >> in MS Word.
>> > >> 4)The customized character attributes will be lost when loading MS
>> Word
>> > >> TOC
>> > >> created by un-checking "Use hyperlinks instead of page numbers". To
>> this
>> > >> kind of TOC,the customized character attributes of the target
>> paragraphs
>> > >> can be collected into TOC in MS Word.
>> > >>
>> > >>
>> > > Such an improvement makes sense from my point of view.
>> > >
>> > > If possible I would help on this.
>> > >
>> > > Best regards, Oliver.
>> > >
>> >
>> >
>> >
>> > --
>> >
>> > Best Regards,Jianhong Cheng
>> >
>>
>
>
>
> --
>
> Best Regards,Jianhong Cheng
>
>


-- 

Best Regards,Jianhong Cheng

Re: Propose to Implement the Loading of TOC and Improve TOC Fidelity with MS Word Binary Document

Posted by Ying Zhang <tl...@gmail.com>.
I see only the improvement for interoperability with MS Binary file format
been mentioned. But since the same problems exist for MS OOXML file format.
Could we consider both and find whether we could define same mechanism and
same scope to make it consistence with each other.
I would like to take the MS OOXML part.

2012/5/29 chengjh <ch...@apache.org>

> Oliver,welcome...
>
> On Tue, May 29, 2012 at 8:21 PM, Oliver-Rainer Wittmann <
> orwittmann@googlemail.com> wrote:
>
> > Hi,
> >
> >
> > On 29.05.2012 09:24, chengjh wrote:
> >
> >> Hi All,
> >>
> >> TOC(Table of Contents) is a significant feature in Aoo
> Writer..Although,it
> >> has provided powerful capabilities to benefit end users for
> productivity,
> >> the followed areas,especially the fidelity with MS Word, still need
> >> improvements..I propose them and put them as the candidates
> >> https://cwiki.apache.org/**confluence/display/OOOUSERS/**
> >> AOO+4.0+Feature+Planning<
> https://cwiki.apache.org/confluence/display/OOOUSERS/AOO+4.0+Feature+Planning
> >
> >> of
> >> the next release for your comments...thanks.
> >>
> >> 1)The TOC data of a MS Word document is not parsed completely.And the
> >> actual TOC data is from silently updating once a MS Word Document
> >> loaded.Thus,the fidelity can not be ensured especially when the document
> >> contents that impact TOC have been changed after creating TOC in MS
> >> Word.So,we propose to implement the TOC loading process to replace the
> >> update action.
> >> 2)The tab between chapter number and TOC entry lost when loading a MS
> Word
> >> document,which leads to different gap between chapter number and TOC
> >> entry.That looks different from MS Word.
> >> 3)Jump info will be lost when loading MS Word TOC created by un-checking
> >> "Use hyperlinks instead of page numbers". To this kind of TOC,end users
> >> can
> >> only press ctrl+mouse to click the page number of the TOC entry for
> >> jumping
> >> in MS Word.
> >> 4)The customized character attributes will be lost when loading MS Word
> >> TOC
> >> created by un-checking "Use hyperlinks instead of page numbers". To this
> >> kind of TOC,the customized character attributes of the target paragraphs
> >> can be collected into TOC in MS Word.
> >>
> >>
> > Such an improvement makes sense from my point of view.
> >
> > If possible I would help on this.
> >
> > Best regards, Oliver.
> >
>
>
>
> --
>
> Best Regards,Jianhong Cheng
>

Re: Propose to Implement the Loading of TOC and Improve TOC Fidelity with MS Word Binary Document

Posted by chengjh <ch...@apache.org>.
Oliver,welcome...

On Tue, May 29, 2012 at 8:21 PM, Oliver-Rainer Wittmann <
orwittmann@googlemail.com> wrote:

> Hi,
>
>
> On 29.05.2012 09:24, chengjh wrote:
>
>> Hi All,
>>
>> TOC(Table of Contents) is a significant feature in Aoo Writer..Although,it
>> has provided powerful capabilities to benefit end users for productivity,
>> the followed areas,especially the fidelity with MS Word, still need
>> improvements..I propose them and put them as the candidates
>> https://cwiki.apache.org/**confluence/display/OOOUSERS/**
>> AOO+4.0+Feature+Planning<https://cwiki.apache.org/confluence/display/OOOUSERS/AOO+4.0+Feature+Planning>
>> of
>> the next release for your comments...thanks.
>>
>> 1)The TOC data of a MS Word document is not parsed completely.And the
>> actual TOC data is from silently updating once a MS Word Document
>> loaded.Thus,the fidelity can not be ensured especially when the document
>> contents that impact TOC have been changed after creating TOC in MS
>> Word.So,we propose to implement the TOC loading process to replace the
>> update action.
>> 2)The tab between chapter number and TOC entry lost when loading a MS Word
>> document,which leads to different gap between chapter number and TOC
>> entry.That looks different from MS Word.
>> 3)Jump info will be lost when loading MS Word TOC created by un-checking
>> "Use hyperlinks instead of page numbers". To this kind of TOC,end users
>> can
>> only press ctrl+mouse to click the page number of the TOC entry for
>> jumping
>> in MS Word.
>> 4)The customized character attributes will be lost when loading MS Word
>> TOC
>> created by un-checking "Use hyperlinks instead of page numbers". To this
>> kind of TOC,the customized character attributes of the target paragraphs
>> can be collected into TOC in MS Word.
>>
>>
> Such an improvement makes sense from my point of view.
>
> If possible I would help on this.
>
> Best regards, Oliver.
>



-- 

Best Regards,Jianhong Cheng

Re: Propose to Implement the Loading of TOC and Improve TOC Fidelity with MS Word Binary Document

Posted by Oliver-Rainer Wittmann <or...@googlemail.com>.
Hi,

On 29.05.2012 09:24, chengjh wrote:
> Hi All,
>
> TOC(Table of Contents) is a significant feature in Aoo Writer..Although,it
> has provided powerful capabilities to benefit end users for productivity,
> the followed areas,especially the fidelity with MS Word, still need
> improvements..I propose them and put them as the candidates
> https://cwiki.apache.org/confluence/display/OOOUSERS/AOO+4.0+Feature+Planning
> of
> the next release for your comments...thanks.
>
> 1)The TOC data of a MS Word document is not parsed completely.And the
> actual TOC data is from silently updating once a MS Word Document
> loaded.Thus,the fidelity can not be ensured especially when the document
> contents that impact TOC have been changed after creating TOC in MS
> Word.So,we propose to implement the TOC loading process to replace the
> update action.
> 2)The tab between chapter number and TOC entry lost when loading a MS Word
> document,which leads to different gap between chapter number and TOC
> entry.That looks different from MS Word.
> 3)Jump info will be lost when loading MS Word TOC created by un-checking
> "Use hyperlinks instead of page numbers". To this kind of TOC,end users can
> only press ctrl+mouse to click the page number of the TOC entry for jumping
> in MS Word.
> 4)The customized character attributes will be lost when loading MS Word TOC
> created by un-checking "Use hyperlinks instead of page numbers". To this
> kind of TOC,the customized character attributes of the target paragraphs
> can be collected into TOC in MS Word.
>

Such an improvement makes sense from my point of view.

If possible I would help on this.

Best regards, Oliver.