Would you mind if you lost all the pictures from your travels?

Did you know that hard drives are rather like light bulbs? They have a finite life and are simply not expected to last forever. At some point, the hard drive inside your computer will fail.

If you are reading this blog, you probably have pictures from your walks and climbs in beautiful mountains. If they are only stored on your computer’s hard drive, you would lose the lot.

By not backing-up you’re simply gambling that the hard drive will fail after you’ve moved on to a newer machine, whether it’s MS Windows, a Mac or Linux.

Losing my pictures would be awful but there’s plenty of other data I’d rather not lose; emails, letters, source code, old academic work. Horror stories of losses abound on the web and I personally know people that it has happened to.

If you backup already, to DVDs or an external hard disk perhaps, but keep that backup in the same building as the computer, you have no protection against the worst case scenarios of fire/theft. I’ve heard it said that people who lose their house in a fire or other disaster come to terms with the loss of everything, except for their pictures.

Backing up on to DVDs or even better, on to external hard drives is a great idea. Nothing is faster for restoring a backup from than a disk in your hand. But you still risk finding that the backup disk has simply died. They all do, eventually.

As long as you have broadband, online backup is a no-brainer. It’s easy to set up, cheap, and once it’s done, it’s done. No need to remember to start the backup, or swap the disk. Nothing more to do – ever.

It can take quite some time to back up initially; days or even weeks if you have a lot of data. But so what? As long as it doesn’t need looking after, it doesn’t matter. And once it’s been done once, only the changes are uploaded.

Its not expensive – especially once you consider how much you’d pay to get your photos back if you were to lose them.

There are many online services and I’ve tried several of those that get better reviews.

Overall I’d recommend Jungle Disk (see below) but I must admit to having been impressed by both CrashPlan Central and BackBlaze.

Edit: 23 May 2010: It struck me that whatever online backup solution you use, the passwords MUST be kept off-site somewhere so that you can access the backup from scratch on a new machine (say, in case your place burns down!). The best way to do that is probably to print them off and keep them somewhere with a friend or relative you trust.

I’ve been using Jungle Disk backed by Amazon S3 (not Rackspace Cloud Files) for several years and can’t find anything that ticks all the same boxes.

But my criteria are perhaps more demanding than some. Apart from checking for data-corruption, encryption and the high bandwidth that any good backup provider will have, I also want data stored in more than one geographic location looked after by a company that’s large enough to be unlikely to go bankrupt.

Amazon S3 writes every file to multiple different geographic locations (and then immediately checks each one to ensure the write was correct).

Most people would probably be happy with BackBlaze or CrashPlan (not Mozy – see below, or Carbonite – which I won’t waste anyone’s time even mentioning further).

The only thing that puts them in front of Jungle Disk is that they require slightly less set-up and they offer a fixed price per month option. Whereas Jungle Disk varies in cost depending on how much data you store. Though for storing less than 10gb of data, Jungle Disk is comparable per month – they just don’t do yearly billing unfortunately. Meaning a forex charge on your card every month if you’re outside the USA.

So, assuming you’re not quite as paranoid as I am, and can live with the small risk of the datacenter being wiped out in a far-fetched disaster (and so aren’t going for Jungle Disk+S3); which one would I recommend?

Probably BackBlaze. Although I prefer CrashPlan’s interface, I prefer BackBlaze’s use of a large, trusted datacenter.

Jungle Disk: jungledisk.com

For me this is the only service that does the job – it’s the most full featured and flexible set-up I’ve found. I use the “Desktop Edition” at $3 a month  (though the new “Simply Backup” version at $2 looks good too).

It backs up any kind of file without restriction, it can back up any number of machines, it uploads as fast as your connection can go (but can be “throttled” at times you decide, so your browser doesn’t crawl) and runs on MS Windows, Mac and Linux. There’s a 5gb file size-limit which might be inconvenient for some but is actually larger than most services allow.

I’ve used it for two years on three machines. I have over 70GB of data backed up and I have used the restore “for real” on several occasions. It just worked.

Jungle Disk is the software that you install to run the backup and restore. But the data itself is stored on either Amazon’s S3 servers or Rackspace Cloud Files.

Why do I use S3 and not Rackspace Cloud Files? Because S3 use multiple, geographically distributed data-centres to hold each file. With Rackspace, each file is held in three separate physical locations in the same datacenter (geographically co-located). Albeit, each with a separate power supply and Internet connection – but that’s no defence against a plane/earthquake/localised alien invasion.

However, using Rackspace has a different advantage: You don’t have to set up a second account (with Amazon S3) since Rackspace own Jungle Disk. That also means one less forex charge on your card of course.

You pay for the Jungle Disk application and the S3 (or Rackspace) storage separately. And since it’s only about 15cents per gig per month you’re unlikely to have to pay very much.

As a side bar – I’ve made lists of a few folders and files on Windows XP that you might want to avoid backing up, that can be copied straight into Jungle Disk. For example, your Firefox and Internet Explorer caches are certainly not worth backing up.

(There’s another good overview at onlinebackupsreview.com)

Note: I’ve just discovered Cloudberry Online Backup which has a one-off license fee cost for Cloudberry and then uses S3 as the storage provider – that might be worth investigating since there’s no monthly payment (and forex hit!) but I’ve not evaluated Cloudberry so can’t comment on it directly.

BackBlaze: backblaze.com

Although overall I’d recommend Jungle Disk, I was impressed by BackBlaze.

It’s simple, it works, it seems to backup and restore just fine. It’s a flat $5 monthly or $50 yearly (which is great if you are outside the USA as there will be less forex charges on your card). It’s also very good value if you have a couple of terabytes to backup.

Although the “backup the lot” approach of BackBlaze is one of their main selling points, I don’t like the way it decides to backup almost everything by default. The interface to selectively exclude certain folders needed some work when I evaluated it in late 2009 (if you can’t easily tell it to exclude things, it’ll use up bandwidth backing up junk – and make any restore slower than it needs to be). There’s a 4gb file size limit but as I mentioned above, that’s quite common.

BackBlaze reviews occasionally mention that they are new, but they’ve been running since early 2008 as far as I can ascertain, so it’s not all that young any more.

However, although they use a very large, dedicated datacenter (which is also used by Sun Microsystems and Cnet) it still means your data is only held in one geographic location. No plane/earthquake/alien insurance there. But you may be less paranoid about such things than I.

I tried backing up a couple of different data sets consisting of a couple of gigabytes of data and then restored them a couple of times. All ran very smoothly.

(Discount code at onlinebackupsreview.com)

CrashPlan Central: crashplan.com

Again, I’d recommend Jungle Disk but, CrashPlan comes at a better price than Jungle Disk and has a nicer (more configurable) interface than BackBlaze.

Unfortunately they run their own, single, data centre. Impressively specified though it is.

Single geo-location for a backup is not unusual, quite the reverse, but at least BackBlaze use the services of a very large and dedicated datacenter provider rather than doing it themselves.

CrashPlan say that it “Supports files larger than you’ll ever need”. Which may be true, but you might want to check what that actually means if you have very large files.

However, the ability to configure exactly what is backed up via a very nice interface is very appealing.

All the backup and restore testing I did ran absolutely fine.

Not Mozy

I’m sorry this section is so long when it only describes what I wouldn’t use – but since Mozy is so popular I thought I’d better explain exactly why I won’t use it.

I really want to like this application for its nice easy interface and general simplicity – but it’s just not reliable. Even the rather partisan review on onlinebackupsreview.com mentions “Some users have reported problems when trying to restore data”.

A “problem restoring” is an absolutely massive flaw in a backup application.

I installed it and backed up 2.5gb of data a couple of years ago. It did have a habit of locking up during a backup but a restart of the machine would un-stick it (everything has bugs, bugs in uploading I can forgive). But then an upload just failed on a particular file and would get no further. It transpired the backup was corrupt on their server (a bug they have since fixed – apparently) and after a lot of to-and-fro with Mozy Support I was advised to re-upload. Not great if you have 60gb to upload and utterly fatal if you happen to have just lost your HD! So I stuck with Jungle Disk.

I installed Mozy again a year or so later (like I said, I want to like it) but after it installed it simply refused to start-up.

Now, I know what I’m doing, but pretending I didn’t, I asked Mozy Support for help (I need to be able to recommend this service after all).

I got the usual “Silly user! Here’s how to install the product…” email reply that most first-line tech support will fob you off with, after they’ve completely failed to read your email.

They took a little convincing that I had installed it but it simply wasn’t working. They eventually suggested removing it completely (and provided a thorough description of how to do that) followed by re-installing – which was the answer I was looking for.

I did, it worked, I backed up. I restored a set of files… fine.

I restored the same set of files again for good measure – error! What error? Well who knows!?

There was an “error2” reported at the end of the restore – and the log showed many of the files hadn’t been restored… except that when I checked with a binary file compare tool, they had been.

I compared both the first successful restore and the one reporting errors – they were identical. The files that were logged as having errors had actually been restored just fine.

So although it did work, it reported that it hadn’t and that’s no good if you really are restoring after a data loss and therefore can’t compare with the originals to see if it’s a real error or not. You simply should not have to be deciding anything at all regarding “errors”.

So, I’m sorry Mozy but an unreliable backup isn’t a great deal better than no backup at all as far as I’m concerned.

The above options are far more reliable in my experience.

a nicer price than Jungle Disk

12 Replies to “Would you mind if you lost all the pictures from your travels?”

  1. Excellent post. All my stuff is backed up on a home server plus an external hard disk but not off site. I must sort that out! The thing that stops me doing a remote backup is speed.

  2. Thanks Robin – hope it helps you to keep your photos as safe as they can be.

    As for speed: The speed up is only a problem for the first part. And even then, it’s not like you actually have to do anything of course. :)

    And the speed down; you can always restore just the things you need immediately and then let the bulk of it trickle in over the next few days (download is far faster than upload on most broadband).

    Recently LB’s machine died horribly. She uses it for running her business so I had it well backed-up locally as multiple disk images.

    However when we bought a new one, we used Jungle Disk to download 4gb of documents overnight and she was back up and running the next morning. It was simply easier to select things for restore from the Jungle Disk interface than plugging in the hard drive, mounting the backup image etc. etc.

  3. My vote is for Tarsnap (http://www.tarsnap.com/). I’ve been using it since December last year, which was great as my laptop died in January and I used the service to successfully restore my data. Great features include de-duplication, incremental snapshots, and security by encrypting data before uploading.

    Cheap too: Through incremental snapshots I’ve backed up 92GB, but because of deduplication I’m only storing 12GB and at a cost of around 11 cents per day

  4. A hugely important, and too often neglected, topic – good to see the various providers compared and stacked up against each other.

    I used JungleDisk a while ago (when it was still free – so, granted, it probably wasn’t the stable version it is now), but didn’t get along with it. My personal choice now is DropBox (www.dropbox.com). It also uses Amazon S3 for storage.

  5. Excellent post Dave. I will have a closer look at Crashplan. The fact that they store the data in one place is a plus imho because if they store it round the globe they will store it also in places with less strict data protection laws etc. Not saying that the US has the strictest data protection in the world but it is at least a place with some protection and a working jurisdiction. I personally judge the risk that my NAS at home collapses and at the same time the online backup as very minimal.
    But at the end of the day I still have some fears to store my data at some provider where it is not in my hands anymore. Just a feeling but I guess you know what I mean.

    Thanks for the great post

  6. An interesting comparison of services, it’s something I’ve thought of but the sheer volume of data and the recurrent costs – however modest – have put me off.

    Does the software have a true ‘syncing’ facility to highlight newer / older / orphan files on both local PC and remote server?.

  7. Hi Chris – Glad you like the posting and that you have those amazing images of yours backed up!

    I’m also a big fan of DropBox. I use it as a file transfer sharing tool though (which is it’s main use – and why I’ve not mentioned it as a backup tool).

    I also think it’s worth installing just to get the “Public” directory that allows you to send URLs of large files rather than trying to work out how to email them!

    But for backup – I’d rather have a dedicated backup tool that I can point at any directories I like and have the backed up (unless you’re using mklink or Synctoy perhaps?).

    But it looks like DropBox will be enabling a “Watched Folders” feature in the future which would turn it into a far more effective backup tool…

  8. Hi Roman,

    S3 stores the data in multiple locations within the same region – so if you store in Europe (in fact, Ireland) it’s all in Europe.

    Strange that you would prefer the US – most people from Europe avoid storing data there since they are not considered to have strong enough data protection laws.

    But, if you do store in the S3 USA Region it’s actually compliant with EU data protection laws because Amazon participates in the European Safe Harbor Program.

    When you create your “bucket” of storage you can choose which location to use. Europe will have less latency and also give you the ability to use the Export feature: you post a drive to them and they load data on to it before posting it back.

    You can’t yet use the Import feature to upload your initial backup but apparently that’s expected to be made a feature of Jungle Disk very soon…

    As for concerns about storing your data with someone else – that’s easily addressed by using the built-in encryption facility. You can simply choose to have all the files that you send encrypted before they leave your machine. With a long enough password that should make it very secure (but don’t forget to keep that password safely printed out somewhere other than where the computer is kept!)

  9. Hi GeoffC,

    The volume trickles up without you really being aware of it :)

    I’ve backed up 70 gigabytes afresh recently (to RackSpace – before I realised they use only one datacenter for my data). It took about two months across an ADSL rated at 10meg (I get about 9.5meg download but averaged only 120K upload!).

    A couple of months flies past of course. And even if it takes six – well, once it’s up it doesn’t take long at all to upload the differences.

    Not quite sure what you mean by a “true” syncing facility?

    Online backup applications will only ever upload changes to files. The better ones only upload the changed blocks within the files.

    Deletions are generally reflected to the backup but old versions are usually available. It just depends on what application you use – Jungle Disk has very configurable rules regarding how many old versions to keep and for how long.

  10. Thanks Dave, you really got me thinking, the problem is I am still not sure what is the best solution for a mac user. I use dropbox, I have the mac idisk and time machine stuff, but I recognise that there is possibly a better setup.

    It seems that Jungle Disk is the best and I suppose I just have to bite the bullet and go for it.

    Thanks and keep up these well informed posts for us less informed users.


  11. Hi Roger,

    It’s something that’s occupied way too much of my time! So it seems only right to help others avoid having to make such effort to figure out what works well.

    Jungle disk is currently the best in my opinion. Though I must admit I’m hoping for someone else to create an “all you can upload” fixed price offering with an application that’s as solid as Jungle Disk!

Leave a Reply

Your email address will not be published. Required fields are marked *