alanwilliamson
This is rather is timely considering my talk last week at London's CloudCamp. I was the last one to speak. I spoke about the need to take responsibility for your cloud infastructure and not rely on any of the marketing hype from any of the companies. I painted real world stories and explained how we utilise two providers (Amazon and Flexiscale) for our cloud requirements to insure against complete black out.
The previous speakers all towed the "cloud computing" is great company line, your eggs are safe in our basket type of pitches. Including a quick talk from Martin Buhr from Amazon. He took the usual route of explaining how great Amazon was at scaling out very quickly, demonstrating how some projects scaled up to 1000 of instances in some cases.
But the problem with these examples is that by in large they are artificial. Many do not need to do this one-off 'batch' processing. If one or two instances out of a 1000 go down then who cares? Or even if the network goes down, who cares, the processing will still happen behind Amazon's firewall. Even, in the worse case, the system dies completely, they can simply restart the batch.
But for users that demand as near to 100% uptime as possible, then the cloud story gets a whole lot cloudier.
Only a couple of weeks ago, Amazon's SimpleDB suffered a short outage, resulting in all queries returning null. Back in February 2008, S3 suffered an outage. This weekend, Amazon has once again been down for nearly 8 hours.
As you can imagine their forums are alight with lots of angry customers, although their focus is misguided. Amazon is not at fault, they are.
Just because you use S3 or EC2, why do you assume all the usual rules of server management go out the window? They still apply. If you were hosting your own file space, then you would have it backed up and if it was crucial to you business, then you would have an alternative.
If, like the vast majority, you fly by the seat of your pants, relying on a single thing, then expect your pants to catch fire every so often. Blame no one but yourself.
As I illustrated in my CloudCamp talk, outages can and do happen. So it is up to you to figure out what to do when things go horribly wrong. In this particularly instance, no amount of support money to Amazon would have fixed this problem any quicker. So save yourself the support cost, and instead, build out an alternative path should your primary cloud provider take a nose dive.
Trust no one. Use all your years of server hosting experience and apply it to the cloud.
Incidentally, I will be talking at the Cloud Computing Expo in November at San Jose, and in that session I go over the logistics of making a truly redundant cloud architecture.
Comments
please note, all comments will be moderated for spam and abuse before being publicly posted.
Article Details
- Published:
8:07 AM GMT, Monday, 21 July 2008 - Categories:
Technical General - Tags:
cloud computing amazon flexiscale - Comments:
3 left; add comment
Related Articles
Article Archives




Cloud computing is the wave of the future and its truly cost-effective for startups, but Amazon continues to drop the ball, I would urge users to move over to Nirvanix, To make the process of file and folder migration easier, we have made a software tool you may download here:
http://developer.nirvanix.com/files/folders/applications/entry886.aspxCloud computing is the way forward, but we need to work a little harder to make sure we are not beholding to just a single vendor.
After all, we would never have hosted our servers with an ISP with just one pipe to the Internet, yet, here we all our flocking to the cloud providers putting all our eggs in one basket.We are using S3 and our service was suffered by this incidence. Despite this incidence, I still believe that the cloud computing is the right direction. We just need to see more competitions, so that overall quality of could computing service will improve.