You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@subversion.apache.org by Alexander Veit <Al...@unitedplanet.de> on 2013/05/21 16:53:32 UTC

svnlook cat seems to swallow byte order marks

Hi,

it seems that

 svnlook cat

swallows byte order marks (e.g. \xEF\xBB\xBF) from committed files. Therefore it is difficult to check for the presence or absence of BOMs in commit hooks.

Can anyone confirm that this is a bug in Subversion (at least in version: 1.7.7)?


-- 
Cheers
Alex

RE: svnlook cat seems to swallow byte order marks

Posted by Alexander Veit <Al...@unitedplanet.de>.
The md5 sum of the working copy and the output of svnlook cat are the same. hexdump shows that the byte order mark is actually there. It seems that the shell constructs i used stripped off the bom.

Thank you very much for your help, Michael and Daniel.

Alex

Re: svnlook cat seems to swallow byte order marks

Posted by Daniel Shahaf <d....@daniel.shahaf.name>.
C. Michael Pilato wrote on Tue, May 21, 2013 at 11:07:00 -0400:
> On 05/21/2013 10:53 AM, Alexander Veit wrote:
> > Hi,
> > 
> > it seems that
> > 
> >  svnlook cat
> > 
> > swallows byte order marks (e.g. \xEF\xBB\xBF) from committed files. Therefore it is difficult to check for the presence or absence of BOMs in commit hooks.
> > 
> > Can anyone confirm that this is a bug in Subversion (at least in version: 1.7.7)?

svn info ^/foo | grep Checksum:

and

svnlook cat . foo | openssl md5

should produce the same output.  Do they for you, Alexander?

Re: svnlook cat seems to swallow byte order marks

Posted by "C. Michael Pilato" <cm...@collab.net>.
On 05/21/2013 10:53 AM, Alexander Veit wrote:
> Hi,
> 
> it seems that
> 
>  svnlook cat
> 
> swallows byte order marks (e.g. \xEF\xBB\xBF) from committed files. Therefore it is difficult to check for the presence or absence of BOMs in commit hooks.
> 
> Can anyone confirm that this is a bug in Subversion (at least in version: 1.7.7)?

Nope.  Works as expected here.

$ svnlook cat data/repositories/i18n-data utf8-chinese-bom.txt > foo
$ hexdump -C foo
00000000  ef bb bf 75 74 66 38 2d  63 68 69 6e 65 73 65 2d  |...utf8-chinese-|
00000010  62 6f 6d 2d e5 90 8c e6  84 8f 0a                 |bom-.......|
0000001b
$


-- 
C. Michael Pilato <cm...@collab.net>
CollabNet   <>   www.collab.net   <>   Enterprise Cloud Development