You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@openoffice.apache.org by bu...@apache.org on 2017/08/25 14:20:45 UTC

[Issue 127522] New: request for hyphenation documentation

https://bz.apache.org/ooo/show_bug.cgi?id=127522

          Issue ID: 127522
        Issue Type: DEFECT
           Summary: request for hyphenation documentation
           Product: Writer
           Version: 4.1.3
          Hardware: All
                OS: All
            Status: UNCONFIRMED
          Severity: Normal
          Priority: P5 (lowest)
         Component: programming
          Assignee: issues@openoffice.apache.org
          Reporter: mauro.trevisan@gmail.com
  Target Milestone: ---

Hi, I am writing an hyphenation file for my language, I found a description of
the Liang algorithm and I implemented the rules. Now I want to learn more about
the non-standard hyphention. Namely the Soija extension (ex. ab1cd/am=z2,2) and
the NEXTLEVEL tag (along the others).
Just to be sure, I understand that the LEFTHYPHENMIN and RIGHTHYPHENMIN refers
to word boundaries (here I use the B letter: BabcB, Bab-cB), and that
COMPOUNDLEFTHYPHENMIN and COMPOUNDRIGHTHYPHENMIN should refers to compound
boundaries (here I use the C letter: CabcC, CabC-CcC). Am I right?
The NOHYPHEN tag describes the characters that should not be considered as
hyphenation separators? like if I write NOHYPHEN -, then the '-' in ab-cd is
not a separator? Here I don't understand the utility of this tag. I though I
had to indicate the position of the hyphenation with an odd number, not tell
the hyphenator what is not an hyphen.
Next, what I don't understand at all is the NEXTLEVEL tag.
From what I learn about the Soija extension, the indexes refers to the first
position (1-based) where the substitution occurs and its length. That is, if I
have the word "abcd" with the rule ab1cd/am=z2,2 then the hyphenation is
aam-zd. I understand right?
If I didn't specify the index and length then I understand the whole rule is
substituted. From the word "mabcdm" with the rule ab1cd/am=z, then the
hyphenation will be mam-zm. Is it right?
Another question is about the paper in
http://hunspell.sourceforge.net/tb87nemeth.pdf: here an example on page 3 show
the rule eigh1teen/t=t,5,1. Shouldn't it be eigh1teen/ht=t,4,2?
Another one: can the first index of the Soija point to the hyphenation code?
(like eigh1teen/t=t,4,1)
The rule 7-/=- shouldn't it be -7/-=-?
Is there a paper somewhere where there is the description of the Soija
non-standard hyphenation and the OO tags (NEXTLEVEL and the others)?

Thank you so much.

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 127522] request for hyphenation documentation

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=127522

oooforum (fr) <oo...@free.fr> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
         Resolution|---                         |NOT_AN_OOO_ISSUE
                 CC|                            |oooforum@free.fr
             Status|UNCONFIRMED                 |RESOLVED

--- Comment #1 from oooforum (fr) <oo...@free.fr> ---
Not an issue for AOO.
Hyphenation or dictionaries are provided under extensions part.
Note that these are third party software.
If you have some questions about development, please contact our mailing list:
https://openoffice.apache.org/get-involved.html

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 127522] request for hyphenation documentation

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=127522

--- Comment #4 from mauro <ma...@gmail.com> ---
Thank you. I will try.

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 127522] request for hyphenation documentation

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=127522

Marcus <ma...@apache.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|UNCONFIRMED                 |RESOLVED
         Resolution|---                         |NOT_AN_OOO_ISSUE

--- Comment #3 from Marcus <ma...@apache.org> ---
I don't think that you will get any further information as there is nothing
more available. So, asking the author(s) of the extension(s) could be really
helpful.

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 127522] request for hyphenation documentation

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=127522

--- Comment #2 from mauro <ma...@gmail.com> ---
Are you advising me to contact the author of this hyphenation third-party
software and ask him/her for the documentation?
How can I known who wrote the hyphenator?

Here https://www.openoffice.org/lingucomponent/hyphenator.html it is said that
"The OpenOffice.org hyphenator was based on the libhnj library by Raph Levien".
It's the "based on" part that leads me to ask here. Because something that was
"based on" the work of someone was written by someone else other than Raph
Levien.

Thank you

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 127522] request for hyphenation documentation

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=127522

mauro <ma...@gmail.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
         Resolution|NOT_AN_OOO_ISSUE            |---
             Status|RESOLVED                    |UNCONFIRMED

-- 
You are receiving this mail because:
You are the assignee for the issue.

[Issue 127522] request for hyphenation documentation

Posted by bu...@apache.org.
https://bz.apache.org/ooo/show_bug.cgi?id=127522

Marcus <ma...@apache.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
          Component|programming                 |help
             Status|RESOLVED                    |CLOSED

-- 
You are receiving this mail because:
You are the assignee for the issue.