You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@joshua.apache.org by Matt Post <po...@cs.jhu.edu> on 2016/09/07 18:21:01 UTC
Re: Language Pack English-Japanese
Hi IGA,
That would be great.
There is also this collection of data for English/Japanese translation. If you collect and prepare all of this, I can then either help you build a model, or build it myself.
http://www.phontron.com/japanese-translation-data.php
Sincerely,
Matt
> On Aug 5, 2016, at 5:22 AM, IGA Tosiki <ig...@gmail.com> wrote:
>
> Hi Matt,
>
> I can convert those XML en-ja pair into other format as you point, if
> you think the pairs are useful, and if you want to do so.
>
> Regards,
> Toshiki
>
> 2016-08-05 17:53 GMT+09:00 IGA Tosiki <ig...@gmail.com>:
>> Hi Matt,
>>
>> I can share my en-ja parallel data.
>>
>> https://osdn.jp/projects/blancofw/releases/52952
>>
>> It is pair that translation en to ja for Eclipse IDE menu and
>> messages. It is translated by human and also checked by human.
>>
>> Toshiki
>>
>> 2016-08-04 22:02 GMT+09:00 Matt Post <po...@cs.jhu.edu>:
>>> Hi Toshiki,
>>>
>>> Have you been able to gather any parallel data?
>>>
>>> matt
>>>
>>>
>>>> On Jul 22, 2016, at 3:50 PM, Henry Saputra <he...@gmail.com> wrote:
>>>>
>>>> HI Toshiki,
>>>>
>>>> For this kind of discussion, let's have it in the dev@ list.
>>>>
>>>> You can ask the question to dev@joshua.incubator.apache.org.
>>>>
>>>> Thanks,
>>>>
>>>> Henry
>>>>
>>>> On Thu, Jul 21, 2016 at 9:46 PM, IGA Tosiki <ig...@gmail.com> wrote:
>>>>
>>>>> Hi Matt,
>>>>>
>>>>> Thanks for your reply!
>>>>>
>>>>> I'm happy to read your mail, I want to help you Japanese-English language
>>>>> pack.
>>>>> And YES, I mean translation memories by TMS/XLIFF. But I may convert
>>>>> TMS to what you specified format.
>>>>>
>>>>> And also I knew English to Japanese is very difficult, but also I
>>>>> believe sample of English-Japanese language pack will attract many
>>>>> Japanese people to use Joshua.
>>>>>
>>>>> Regards,
>>>>> Toshiki
>>>>>
>>>>> 2016-07-22 12:42 GMT+09:00 Matt Post <po...@cs.jhu.edu>:
>>>>>> Hi,
>>>>>>
>>>>>> There is no Japanese--English language pack, but I would be happy to
>>>>> build one if you could help by pointing me to data. What we need is
>>>>> parallel data in the form of sentences that are translations of each other.
>>>>> If you have access to this or pointers to where I could find some, I would
>>>>> be happy to build it. There are likely standard datasets available; people
>>>>> like Graham Neubig (http://www.phontron.com) have been working on this
>>>>> for a while.
>>>>>>
>>>>>> What are TMS and LTIFF? Are you talking about translation memories?
>>>>>>
>>>>>> As a side note, translation between English and Japanese is very
>>>>> difficult and tends not to be very good. One approach that helps is
>>>>> translating from trees and forests. Joshua does not have this capability at
>>>>> the moment.
>>>>>>
>>>>>> Sincerely,
>>>>>> matt
>>>>>>
>>>>>>
>>>>>>> On Jul 21, 2016, at 11:28 PM, IGA Tosiki <ig...@gmail.com> wrote:
>>>>>>>
>>>>>>> Hi team,
>>>>>>>
>>>>>>> I got interest about Joshua, and language pack. I am Japanese, and I
>>>>>>> want to know around Japanese language pack.
>>>>>>>
>>>>>>> Is there any plan about building Japanese-English language pack?
>>>>>>> I believe TMS or LTIFF will usefull to building such language pack. I
>>>>>>> have many OSS based TMS between English-Japanese. Is there any path
>>>>>>> using TMX or LTIFF for input of Joshua language pack?
>>>>>>>
>>>>>>> Best regards,
>>>>>>> Toshiki Iga
>>>>>>
>>>>>
>>>