Deltup will update packages and reduce download time

neuron · Advocate Joined: 28 May 2002 Posts: 2371

discuss this faster guy, I like the idea

also, why not bittorrent the downloads? It's an idea for big packages anyway. Could take a lot of load off the main mirrors.

ferringb · Retired Dev Joined: 03 Apr 2003 Posts: 357

jjw · Posted: Thu May 22, 2003 10:38 pm Post subject: Re: new delta format

jonner · n00b Joined: 25 Jul 2002 Posts: 42

There has been much discussion on this thread about efficiency of diff algorithms, something which I know almost nothing about. One thing does seem clear, though: the algorithm from rsync is being used. Perhaps I'm missing something, but why not just use rsync itself?

If the gentoo mirrors were running rsync servers as well as http or ftp servers, updating only differences between distfiles could be completely automated with no additional burden on users or developers, though there would be more burden on server CPU's. Portage could do something as simple as make a copy of an old distfile with the desired version's name. Then, it would just run rsync on it.

For example, if foo-0.9.tar.gz exists locally, and Portage needs foo-1.0.tar.gz, it copies foo-0.9.tar.gz to foo-1.0.tar.gz, then rsyncs foo-1.0.tar.gz from the server. The only thing that might be a little tricky would be finding the best candidate old version of the distfile to copy.

jjw · Posted: Fri May 23, 2003 2:13 am Post subject: Re: Why not rsync itself?

jonner · n00b Joined: 25 Jul 2002 Posts: 42

First, I appreciate the effort greatly, as this is a obvious need; I've already thought about the problem a few times. You've obviously already had some success, so I can't dismiss that. I'm just wondering why such a complex solution is necessary.

As for #1 and #3, why? Are you sure? Have you tried rsync? #4 is the reason there would be more CPU usage on the servers. Why is #5 unreasonable? Portage already uses rsync all the time.

It seems that generating persistent diffs would either require additional effort from developers for each ebuild submission, use significantly more space on the server, or only work between two successive versions of distfiles. Am I wrong on all of these points? If so, I'll shut up.

I guess the question is whether the potential inconveniences of your system are less or more than those of using rsync. I'm just trying to invoke the KISS (Keep it Simple Stupid) principle.

BradB · Posted: Fri May 23, 2003 2:54 am Post subject:

jonner · n00b Joined: 25 Jul 2002 Posts: 42

I guess my main concern is that a system to reduce network usage needs to be easy to apply universally. It probably won't be very successful if developers or users need to do something special for each ebuild. Currently, this system does require user and developer intervention, since it hasn't been integrated into Portage yet. However, if that can be done in such a way that it just works for all ebuilds, then more power to it. Also, it would be nice if servers could get away with only storing a few entire tarballs and many diffs, but that's probably more trouble than it's worth.

jjw · Posted: Fri May 23, 2003 4:44 am Post subject:

ferringb · Retired Dev Joined: 03 Apr 2003 Posts: 357

jonner · n00b Joined: 25 Jul 2002 Posts: 42

Well, it sounds like there has already been plenty of thought put into this, so I'll shut up now. I will try out deltup. Also, I'll try rsyncing between different versions of tarballs to see for myself how effective it is.

The idea of diff servers is quite interesting, though having distfile servers with different capabilities will add complexity (maybe not much).

ferringb · Retired Dev Joined: 03 Apr 2003 Posts: 357

BradB · Posted: Tue Jun 17, 2003 3:52 am Post subject:

Just bumping the thread to see where deltup is at - haven't heard much for a bit.

How's it going guys?

Brad

ferringb · Retired Dev Joined: 03 Apr 2003 Posts: 357

jjw · Posted: Tue Jun 17, 2003 5:48 am Post subject: Automatic updates

neuron · Advocate Joined: 28 May 2002 Posts: 2371

sweeet!, defenatly gonna do that

BradB · Posted: Tue Jun 17, 2003 8:13 pm Post subject:

Oh, yeah. If you want that to get seriously hammered, submit a HOWTO in tips & tricks, and ask if it can go in the GWN Letter.

Thanks for all the hard work guys - I will set this up tonight.

Brad

Death Valley Pete · Posted: Tue Jun 17, 2003 9:59 pm Post subject:

Hi jjw,

I've been following your progress in the forums (this thread and the other one) for about a month, and I'd like to say again that what you're doing here is incredible. When I suggested sunsite.dk I figured it got buried in the convertation between two people who clearly knew what they were doing, but apparently not. I'm glad I was able to contribute something even if it was something pretty insignificant.

Anway, I hope I can be forgiven for asking this again but, now that you've got a real non-SF mirror going is there anyway for regular users to submit patches? For example, I made a patch with deltup for gcc-3.2.3-3.3 (it's about 8 MB) that I'd be willing to spend an hour uploading for the cause. :wink:

I know this is a newbie delusion of grandeur but making it automatic (something like: "Patch not found, downloading full tarball... Make and submit patch? (y/n)") would be a nifty feature (and maybe once this is fully integrated there can be an automatic system run on the mirrors to do that). Looking at the files list on the server it seems that the naming convention has changed but I won't ask any dumb questions about that until I've read the new manual. I haven't installed the masked deltup yet but I'll do so now if there's Portage integration involved - in fact I'll do that right now.

Anyway, thanks for all the effort, keep us posted, and vaya con dios.
_________________
<instert pithy statement here>

BradB · Posted: Tue Jun 17, 2003 10:06 pm Post subject:

Death Valley Pete · Posted: Wed Jun 18, 2003 4:19 pm Post subject:

Hi BradB,

I'm flattered, but since the extend of my programming capabilities involves: