You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@subversion.apache.org by Johan Corveleyn <jc...@gmail.com> on 2011/05/12 23:34:00 UTC

Re: Further diff optimization ideas

Ok, to wrap this up for now: r1102471 finally put these thoughts into
notes/diff-optimizations.txt, with some of Stefan2's feedback/ideas
integrated into it.

I also added another, previously mentioned idea into the notes file,
which I forgot to mention in this mailthread:

--- 8< ---
Avoid some hashing by exploiting the fact that matching lines often come
   in series.

  - If the previous line had a match with the other file, first try to
    directly compare (memcmp) the next line with the successor of the
    matched line. Only if it doesn't match, calculate the hash to insert
    it into the container.
  - This approach probably conflicts with the "Merge hash calculation with
    EOL scanning" suggestion.
--- 8< ---

(not sure if this is a worthwhile idea, but just thought I'd mention it).

Cheers,
-- 
Johan

Re: Further diff optimization ideas

Posted by Stefan Fuhrmann <eq...@web.de>.
On 13.05.2011 10:44, Julian Foad wrote:
> Johan Corveleyn wrote:
>> Ok, to wrap this up for now: r1102471 finally put these thoughts into
>> notes/diff-optimizations.txt, with some of Stefan2's feedback/ideas
>> integrated into it.
> Johan, thank you for writing down the ideas you've thought through so
> much, where we can find them later, in addition to all the improvements
> you have already implemented.
/me is looking forward to discuss "everything diff-y" in Berlin ;)

-- Stefan^2.


Re: Further diff optimization ideas

Posted by Julian Foad <ju...@wandisco.com>.
Johan Corveleyn wrote:
> Ok, to wrap this up for now: r1102471 finally put these thoughts into
> notes/diff-optimizations.txt, with some of Stefan2's feedback/ideas
> integrated into it.

Johan, thank you for writing down the ideas you've thought through so
much, where we can find them later, in addition to all the improvements
you have already implemented.

- Julian