You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@subversion.apache.org by "Denis N. Antonioli" <de...@canoo.com> on 2006/08/14 15:10:50 UTC

Encoding of .svn/entries?

Hi

I've found out that the file .svn/entries is not encoded in utf-8,  
although that is what the xml header says, but in the platform's  
encoding.
This is at least the case on my system (Mac OS X 10.4).

Is this a known bug?

Best
	dna

-- 
Any technology distinguishable from magic is insufficiently advanced.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@subversion.tigris.org
For additional commands, e-mail: dev-help@subversion.tigris.org

Re: Encoding of .svn/entries?

Posted by "Denis N. Antonioli" <de...@canoo.com>.
On 16 août 06, at 15:21, Malcolm Rowe wrote:

> On Wed, Aug 16, 2006 at 02:43:48PM +0200, Denis N. Antonioli wrote:
>>>>>> I've found out that the file .svn/entries is not encoded in  
>>>>>> utf-8,
>>>>>> although that is what the xml header says, but in the platform's
>>>>>> encoding.
>>>>>> This is at least the case on my system (Mac OS X 10.4).
>>>>>>
>> check it out to <test>/sandbox and add to it a file with an accent
>> (café).
>> At the end, the script dumps the <test>/sandbox/.svn/entries:
>>
>>  2f 3e 0a 3c 65 6e 74 72  79 0a 20 20 20 63 6f 6d  |/>.<entry.   com|
>>  6d 69 74 74 65 64 2d 72  65 76 3d 22 31 22 0a 20  |mitted-rev="1". |
>>  20 20 6e 61 6d 65 3d 22  63 61 66 c3 a9 22 0a 20  |  name="café". |
>>  20 20 74 65 78 74 2d 74  69 6d 65 3d 22 32 30 30  |  text-time="200|
>>
>
> '63 61 66 c3 a9' is 'café' in UTF-8.  What's the problem?
>
> Regards,
> Malcolm


Oops :-(

My apologies for the senseless noise!

	dna

-- 
Computer Science is nothing more than the study of patterns of 0s and 1s
   -- Donald Knuth

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@subversion.tigris.org
For additional commands, e-mail: dev-help@subversion.tigris.org


Re: Encoding of .svn/entries?

Posted by Malcolm Rowe <ma...@farside.org.uk>.
On Wed, Aug 16, 2006 at 02:43:48PM +0200, Denis N. Antonioli wrote:
> >>>>I've found out that the file .svn/entries is not encoded in utf-8,
> >>>>although that is what the xml header says, but in the platform's
> >>>>encoding.
> >>>>This is at least the case on my system (Mac OS X 10.4).
> >>>>
> check it out to <test>/sandbox and add to it a file with an accent  
> (café).
> At the end, the script dumps the <test>/sandbox/.svn/entries:
> 
>  2f 3e 0a 3c 65 6e 74 72  79 0a 20 20 20 63 6f 6d  |/>.<entry.   com|
>  6d 69 74 74 65 64 2d 72  65 76 3d 22 31 22 0a 20  |mitted-rev="1". |
>  20 20 6e 61 6d 65 3d 22  63 61 66 c3 a9 22 0a 20  |  name="café". |
>  20 20 74 65 78 74 2d 74  69 6d 65 3d 22 32 30 30  |  text-time="200|
> 

'63 61 66 c3 a9' is 'café' in UTF-8.  What's the problem?

Regards,
Malcolm

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@subversion.tigris.org
For additional commands, e-mail: dev-help@subversion.tigris.org

Re: Encoding of .svn/entries?

Posted by "Denis N. Antonioli" <de...@canoo.com>.
Hi Peter

On 14 août 06, at 17:28, Peter N. Lundblad wrote:

> Denis N. Antonioli writes:
>> On 14 août 06, at 17:12, Garrett Rooney wrote:
>>
>>> On 8/14/06, Denis N. Antonioli <de...@canoo.com> wrote:
>>>> Hi
>>>>
>>>> I've found out that the file .svn/entries is not encoded in utf-8,
>>>> although that is what the xml header says, but in the platform's
>>>> encoding.
>>>> This is at least the case on my system (Mac OS X 10.4).
>>>>
>>>> Is this a known bug?
>>>
>>> Well, it should be utf-8 encoded, but it's kind of a moot point now,
>>> since we're not using xml for the entries file in 1.4.x and newer
>>> versions of svn.
>>>
>> Thank you. Too bad I'm too late ;-)
>>
>
> Not really.  If there's an encoding bug, that might not have gone away
> just because we switched the format.  We use UTF-8 internally, and
> when the XML was written, the internal encoding was just written to  
> the file.
> It is the same now.
>
> Can you describe how to reproduce the problem?

How about the attached setup.sh, a tcsh script?
Save it to a directory <test>, the script will create a local  
repository <test>/rep,
check it out to <test>/sandbox and add to it a file with an accent  
(café).
At the end, the script dumps the <test>/sandbox/.svn/entries:

00000180  2f 3e 0a 3c 65 6e 74 72  79 0a 20 20 20 63 6f 6d  |/ 
 >.<entry.   com|
00000190  6d 69 74 74 65 64 2d 72  65 76 3d 22 31 22 0a 20  |mitted- 
rev="1". |
000001a0  20 20 6e 61 6d 65 3d 22  63 61 66 c3 a9 22 0a 20  |   
name="café". |
000001b0  20 20 74 65 78 74 2d 74  69 6d 65 3d 22 32 30 30  |  text- 
time="200|



Re: Encoding of .svn/entries?

Posted by "Peter N. Lundblad" <pe...@famlundblad.se>.
Denis N. Antonioli writes:
 > On 14 août 06, at 17:12, Garrett Rooney wrote:
 > 
 > > On 8/14/06, Denis N. Antonioli <de...@canoo.com> wrote:
 > >> Hi
 > >>
 > >> I've found out that the file .svn/entries is not encoded in utf-8,
 > >> although that is what the xml header says, but in the platform's
 > >> encoding.
 > >> This is at least the case on my system (Mac OS X 10.4).
 > >>
 > >> Is this a known bug?
 > >
 > > Well, it should be utf-8 encoded, but it's kind of a moot point now,
 > > since we're not using xml for the entries file in 1.4.x and newer
 > > versions of svn.
 > >
 > Thank you. Too bad I'm too late ;-)
 > 

Not really.  If there's an encoding bug, that might not have gone away
just because we switched the format.  We use UTF-8 internally, and
when the XML was written, the internal encoding was just written to the file.
It is the same now.

Can you describe how to reproduce the problem?

Regards,
//Peter

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@subversion.tigris.org
For additional commands, e-mail: dev-help@subversion.tigris.org


Re: Encoding of .svn/entries?

Posted by "Denis N. Antonioli" <de...@canoo.com>.
On 14 août 06, at 17:12, Garrett Rooney wrote:

> On 8/14/06, Denis N. Antonioli <de...@canoo.com> wrote:
>> Hi
>>
>> I've found out that the file .svn/entries is not encoded in utf-8,
>> although that is what the xml header says, but in the platform's
>> encoding.
>> This is at least the case on my system (Mac OS X 10.4).
>>
>> Is this a known bug?
>
> Well, it should be utf-8 encoded, but it's kind of a moot point now,
> since we're not using xml for the entries file in 1.4.x and newer
> versions of svn.
>
> -garrett

Thank you. Too bad I'm too late ;-)

Best
	dna

-- 
And remember: if you don't get source code, it's not open, which
means you'll eventually get screwed.
   -- Tom Christiansen in <5g...@csnews.cs.colorado.edu>

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@subversion.tigris.org
For additional commands, e-mail: dev-help@subversion.tigris.org


Re: Encoding of .svn/entries?

Posted by Garrett Rooney <ro...@electricjellyfish.net>.
On 8/14/06, Denis N. Antonioli <de...@canoo.com> wrote:
> Hi
>
> I've found out that the file .svn/entries is not encoded in utf-8,
> although that is what the xml header says, but in the platform's
> encoding.
> This is at least the case on my system (Mac OS X 10.4).
>
> Is this a known bug?

Well, it should be utf-8 encoded, but it's kind of a moot point now,
since we're not using xml for the entries file in 1.4.x and newer
versions of svn.

-garrett

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@subversion.tigris.org
For additional commands, e-mail: dev-help@subversion.tigris.org