469,903 Members | 1,493 Online
Bytes | Developer Community
New Post

Home Posts Topics Members FAQ

Post your question to a community of 469,903 developers. It's quick & easy.

Binary Differencing

I posted a question a few days ago concerning file differencing and I got
some thought provoking answers. My original question dealt with identifying
file differences for storage of multiple versions of documents, some binary
some textual. I know that there are many excellent version control systems
out there (cvs, subversion, etc) but due to restrictions on the project I'm
working on we have to have a custom implementation. In order to gain what
could be some substantial savings for us on both bandwidth and file storage
we have decided to only store and transmit version differences in our system
(backend is sql server). For the most flexibility we want to go with a
binary differencing model. While searching around I found RFC 3284 "The
VCDIFF Generic Differencing and Compression Data Format" that really seemed
to fit my needs from a very high level. A little more work showed that other
version control systems (Vault) actually implemented this RFC for it's
system. Now I've started to digest the RFC in an effort to start some
prototype implementations but as is common the RFC is a fairly high level
document. Is anyone aware on any subject matter concerning implementing
binary differencing such as VCDiff, preferably some .net (or any) code
snippets. I plan to implement this in the .Net framework (C#) hence my post
to this group. Any advice or direction would be greatly welcomed. Thanks!

Josh Carlisle

Nov 16 '05 #1
2 4600
Josh Carlisle <jc*******@removeforspam.viewfusion.com> wrote:
I posted a question a few days ago concerning file differencing and I got
some thought provoking answers. My original question dealt with identifying
file differences for storage of multiple versions of documents, some binary
some textual. I know that there are many excellent version control systems
out there (cvs, subversion, etc) but due to restrictions on the project I'm
working on we have to have a custom implementation. In order to gain what
could be some substantial savings for us on both bandwidth and file storage
we have decided to only store and transmit version differences in our system
(backend is sql server). For the most flexibility we want to go with a
binary differencing model. While searching around I found RFC 3284 "The
VCDIFF Generic Differencing and Compression Data Format" that really seemed
to fit my needs from a very high level. A little more work showed that other
version control systems (Vault) actually implemented this RFC for it's
system. Now I've started to digest the RFC in an effort to start some
prototype implementations but as is common the RFC is a fairly high level
document. Is anyone aware on any subject matter concerning implementing
binary differencing such as VCDiff, preferably some .net (or any) code
snippets. I plan to implement this in the .Net framework (C#) hence my post
to this group. Any advice or direction would be greatly welcomed. Thanks!


I have a C# *decoder* for VCDiff which is freely available -
http://www.pobox.com/~skeet/csharp/miscutil

Unfortunately I don't have an encoder in C#. It's one of those things
I'd like to do some time, but don't have the time at the moment.

I found RFC 3284 to be one of the best written ones I've seen - the
implementation of a decoder only took about 4 hours.

--
Jon Skeet - <sk***@pobox.com>
http://www.pobox.com/~skeet
If replying to the group, please do not mail me too
Nov 16 '05 #2
Thanks Jon I'll take a look at your decoder. I'm sure it will prove to be
helpfull at the very least for getting me on the right track. I don't have
much experience taking a RFC to code but it does seem to be well written.
Thanks again.

Josh

"Jon Skeet [C# MVP]" <sk***@pobox.com> wrote in message
news:MP************************@msnews.microsoft.c om...
Josh Carlisle <jc*******@removeforspam.viewfusion.com> wrote:
I posted a question a few days ago concerning file differencing and I got
some thought provoking answers. My original question dealt with
identifying
file differences for storage of multiple versions of documents, some
binary
some textual. I know that there are many excellent version control
systems
out there (cvs, subversion, etc) but due to restrictions on the project
I'm
working on we have to have a custom implementation. In order to gain what
could be some substantial savings for us on both bandwidth and file
storage
we have decided to only store and transmit version differences in our
system
(backend is sql server). For the most flexibility we want to go with a
binary differencing model. While searching around I found RFC 3284 "The
VCDIFF Generic Differencing and Compression Data Format" that really
seemed
to fit my needs from a very high level. A little more work showed that
other
version control systems (Vault) actually implemented this RFC for it's
system. Now I've started to digest the RFC in an effort to start some
prototype implementations but as is common the RFC is a fairly high level
document. Is anyone aware on any subject matter concerning implementing
binary differencing such as VCDiff, preferably some .net (or any) code
snippets. I plan to implement this in the .Net framework (C#) hence my
post
to this group. Any advice or direction would be greatly welcomed. Thanks!


I have a C# *decoder* for VCDiff which is freely available -
http://www.pobox.com/~skeet/csharp/miscutil

Unfortunately I don't have an encoder in C#. It's one of those things
I'd like to do some time, but don't have the time at the moment.

I found RFC 3284 to be one of the best written ones I've seen - the
implementation of a decoder only took about 4 hours.

--
Jon Skeet - <sk***@pobox.com>
http://www.pobox.com/~skeet
If replying to the group, please do not mail me too

Nov 16 '05 #3

This discussion thread is closed

Replies have been disabled for this discussion.

Similar topics

20 posts views Thread by Christian Stigen Larsen | last post: by
3 posts views Thread by Tron Thomas | last post: by
103 posts views Thread by Steven T. Hatton | last post: by
2 posts views Thread by Lisa Pearlson | last post: by
9 posts views Thread by Ching-Lung | last post: by
1 post views Thread by Josh Carlisle | last post: by
3 posts views Thread by John R. Delaney | last post: by
10 posts views Thread by rory | last post: by
16 posts views Thread by Erwin Moller | last post: by
1 post views Thread by Waqarahmed | last post: by
reply views Thread by Salome Sato | last post: by
By using this site, you agree to our Privacy Policy and Terms of Use.