Adaptec Advisors are Back!
Adaptec PR firm sent a note mentioning that Adaptec Storage Advisor’s blog is back! Check it out.
I am also trying to get back to updating my blog after a long hiatus. Hopefully with some small and quick blog posts on regular basis, my writing habit will establish. In the mean time, enjoy the sights from my various trips.
How do you overcome writing drought?
Originally posted at http://andirog.blogspot.com.
Online Backup Services - Six Questions
During my visit to Denver few weeks ago, I had the opportunity to talk with folks working with online backup and archive cloud services. Some of my impressions from these discussions are interesting and worth sharing. These are based on what I heard from professionals working for or providing services to online backup service providers. These are not result of a full-blown survey, and at best anecdotal. You are welcome to respond to these questions if you like via comments, emails or your own blog post.
Q1: Who are the primary adopters of Online Backup Services?
Individuals and small businesses.
Entities with fewer than a dozen workstations .
Few with a centralized server.
Q2: What was the primary backup method before adopting online backup?
None.
A USB key or USB attached disk drive.
Few with a share on another workstation.
Q3: What was the offsite backup strategy before adopting online backup?
None.
A Floppy, USB key or CD with important files.
Few with a mobile HD.
Q4: What is the subscription and retention rates for online backup service?
High subscription rate.
Very low retention rate.
Most abandoned service within few weeks.
Q5: What are the primary reasons provided for discontinuing use of online backup service?
Excessive use of Internet connection.
Backup takes too long.
Poor experience during primary use of workstation.
Q6: What was the backup method after discontinuing online backup?
A USB attached disk drive.
A NAS device on network.
Few with no backup method.
Summary
Overall, online backup services seems to be a great way to introduce backups to people with no prior backup methods as only few reverted back to no backups after discontinuing use of online backup service. Tape is non-existent in environments that are finding online backups attractive. Despite heightened awareness of online backup service, the low bandwidth connection to Internet continues to be main hurdle in retaining subscribers, a focus on spending limited resources on sales improving or cost reducing services over a fear-based buying decision. A comment I heard was,
I prefer to allocate 50% of Internet bandwidth to VoIP services that reduce my telecommunication cost instead of to offsite backup.
Originally posted at http://andirog.blogspot.com.
Denver Visit, New Piñata & Scalability Videos
Week in Denver
I will be in Denver this week till Friday June 27th. Unfortunately, I will miss nPost Golf 2.0 event in Seattle.
Despite a busy work day schedule in Denver, I am looking forward to seeing some friends and colleagues also. If you are a fellow storage blogger or reader or working on a cool storage technology and located in Denver area, ping me and we can meet one evening during my visit.
New Piñata for EMC & IBM
Recently, a reader alerted me to new Data Domain blog Dedupe Matters written by Brian Biles. Welcome Brian to the world of Bloggers. Lets see how quickly EMC and IBM bloggers make you the new piñata like they did to HDS bloggers.
In any case, it’s a nice change from their rumored no blogging policy. Hopefully, blogging at Data Domain will go beyond people in Ivory Towers.
Video of Presentations from Google Scalability Conference
Google already uploaded the videos of presentations from last week’s Google Scalability Conference. I also plan to discuss some of the presentation topics in further details as time permits.
Welcome Remarks by Brian Bershad
GIGA+ by Swapnil Patil
HPC with NetworkSpaces for R by David Hendersen
Chapel by Brad Chamberlain
SMP via Transactional Memory by Vijay Menon
Communicating Like Nemo by Jennifer Wong
Maidsafe by David Irvine
CARMEN by Paul Watson
Scalable Wikipedia by Thorsten Schuett
Originally posted at http://andirog.blogspot.com.
Google Conference on Scalability - First Impression
As expected from conference schedule, Google conference turned out to be a technical event primarily focused on parallel programming and infrastructure scalability. At last minute, Google decided to merge two tracks in to one. Though, I got to attend all the sessions, they felt time-compressed and rushed. I was surprised to see lot of attendees who came from outside Seattle. I met quite a few people from Bay area, Canada and Europe. I enjoyed the sessions though some audience members commented about very technical nature of the conference compared to previous year. As Brian Bershad, Google commented in his welcome speech, the challenge is to find technologies and solutions to scale handling search queries from 600 million to 6 billion. And, I came away better informed on different challenges and potential solutions we may see down the road.
I also sat down and chatted with Robin Harris. We decided to forego making a video of our conversation. I am not a big fan of talking head videos or podcasts unless they leverage the unique values of these methods not available through written words or pictures. And who wants to listen to two storage bloggers chatting about nothing. I find them miserable myself so why put others through the same misery.
In my opinion, three sessions: CARMEN: a Scalable Science Cloud [PDF], GIGA+: Scalable Directories for Shared File Systems [PDF] and maidsafe stood out at the conference from infrastructure scalability perspective. Communicating Like Nemo was very entertaining. The common theme in audience questions on most infrastructure presentations was reliability, availability, scalability, and security of the offered solution. It is a good indication of what is on the mind of people when evaluating new infrastructure offerings. With the popularity of hashing in storage of data, speeding up hash lookup is becoming an interesting problem for scalability.
David Irvine’s session on maidsafe was the only session where a speaker white-boarded most of the presentation. His confidence and knowledge was commendable. Not many speakers can pull off white-boarding 80% of presentation with 100s in audience. Comparing maidsafe with ant colony was an interesting way to show scalability and simplicity of solution. Maidsafe solution seems to be in same category as RevStor, Seanodes, Cleversafe, Oceanstor, Farsite and several others that are trying to leverage storage across 100s and 1,000s of distributed nodes in a peer-to-peer or quid pro quo network, a solution most likely attractive to players in cloud and web distribution market.
Originally posted at http://andirog.blogspot.com.
Heading to Google Scalability Conference
Like Robin Harris, I am also attending Google Scalability Conference in Seattle Friday and Saturday. Hopefully, I will see him at the event. All sessions at Google conference look good. Unfortunately, I will miss half of them as two tracks are running in parallel. I am looking forward to hearing about maidsafe, CARMEN, GIGA+, and Google Maps scale down.
Recently, several readers inquired whether I lost interest in blogging as I am posting very irregularly. Nothing can be further from the truth. I am still as excited about blogging as I was almost five years ago when I first delve into blogs. The lack of frequent updates is due to my attention being somewhere else (new dig, rig and gig). Will elaborate some other time.
BTW, checkout Storage Optimization blog by Carter George. His startup Ocarina got interesting story with their data footprint reduction technology. I am looking forward to learning more once I refocus on new exciting stuff in storage. Hopefully, Ocarina can change the KPCB’s luck in storage space.
Originally posted at http://andirog.blogspot.com.
Do You Got Talent?
After a long day of work, this clip made me laugh. Nothing to do with data storage, just a performance from British TV Show, Britain Got Talent.
Originally posted at http://andirog.blogspot.com.
Online Backup: 100% Install
My last post Online Backup any different from Traditional Backup for Laptop/Desktop? was quickly turned in to us vs. them argument by Beth Pariseau in her blog post Blog dialogue: Online vs. traditional backup. I guess my curiosity and conversation starter about slow adoption of online backup didn’t come across clearly.
… Gupta probably has “too much” experience with backup clients to necessarily see things from the SMB customer’s point of view. For him, installing a backup client isn’t a big deal–for some, it might be enough of a reason to let somebody else deal with it.
Initially, I thought about pulling Tony on her. On a side note, I wonder why Tony spills coffee every time Hu sneezes.
More I analyzed her statements, more I realized her opinions most likely resulted from what she heard as a storage news writer and from whom instead of her own experiences. Keywords like SMB are a good giveaway whom she is listening to. Not many practitioners try to segment customers with mile-wide brush.
Lets start with addressing her installation related concerns. Do online backup services magically appear and start working on your laptop/desktop by themselves? No, someone has to download and install them. Only backup clients that come pre-installed on your system are the ones that don’t require install. As I understand, there are two main backup clients available that don’t require installation and readily available to users, one provided by Microsoft with Windows XP (Windows Backup) and other one provided by Apple with Leopard (Time Machine).
Lets add configuration of the backup client to the part of “difficult to install” equation. Configuration of Mozy Pro [PDF 46 pages] and Windows Backup [Web page - 6 pages if you decide to print], are available online for your review and comparison. Of course, Time Machine is so simple to configure that even someone like me, who misunderstands backup needs of SMB according to a marketer, implemented on MacBook without instructions. BTW, AppleInsider article Road to Mac OS X Leopard: Time Machine is a good overview of Time Machine.
You be the judge how difficult each one is to install and configure.
As I wrote in my comment on Beth’s blog, my intention is not to promote one method over another, just to show similarities and question the current implementations. Hopefully, these posts are setting the stage for future opinions and conversations that will help improve current BaaS offerings and develop new ones.
More to come.
Originally posted at http://andirog.blogspot.com.
Online Backup any different from Traditional Backup for Laptop/Desktop?
Recently, Beth Pariseau wrote in her blog post HP unveils unlimited online storage for SOHO market that bandwidth is one of the hurdles in adoption of online backup services.
Like most online storage offerings to date, this offering is small in scale and limited in its features when compared with on-premise products. Most analysts and vendors say online storage will be limited by bandwidth constraints and security concerns to the low end of the market, with most services on the market looking a lot like HP Upline.
Though, it is a validation of my thoughts expressed in blog post Bandwidth, one hurdle in adopting Cloud Storage, I am not totally convinced of bandwidth being the root cause of limited adoption. There may be something else hindering adoption of online backup services.
Recently, Scott Waterhouse, an EMC blogger also has been discussing the virtues of Mozy, an online backup service (acquired) by EMC. I agree with his argument about the challenges of traditional backup clients in post Mozy as the Future of Backup.
Big business has a lot of data on laptops and desktops. Traditionally, installing backup clients on these systems has been costly, full of headaches, and generally causes more problems than it solves. The consequence of this is that most folks just don’t protect them.
Is Mozy client any different? Is there any difference in installing, configuring, using and maintaining traditional backup client versus Mozy client on laptop/desktop? Nothing, I noticed after reading his posts.
My intention is not to pick on Mozy or Scott but there is nothing unique in most Online Backup Services that couldn’t be in traditional backup for laptop/desktop. At least traditional backup also come with peace of mind that all backups are stored on company’s own infrastructure. In last few years, I tried over a dozen online backup services in addition to putting up with traditional backup clients for laptop/desktop and I don’t see much difference among the two.
IMO, most online backup services are just taking existing on-premise backup strategy for laptops/desktops and repackaging it to run backups to somebody else’s infrastructure instead of your own. What do you think?
Originally posted at http://andirog.blogspot.com.
Storage Jobs @ Startups
Recently, Nathan Kaiser at nPost contacted me regarding his new widget displaying Startup Jobs on blogs. As sidebar on my blog is already too long, I decided to include his widget in a blog post. Try it out and let me know your feedback (positive and negative).
Startup Jobs at nPostP.S. If you are using a RSS reader like Google Reader and don’t see the widget, please visit my blog. While I am writing this post, I am not sure if widget will show up in the blog post either. In case it doesn’t, please visit nPost Startups Jobs site to check out the startup jobs. Use keyword “storage” to find storage jobs at startups.
Originally posted at http://andirog.blogspot.com.
Is number of objects true indicator of Amazon S3 growth?
In my last blog post, I estimated the data stored on Amazon S3 in exabyte range using 18 billion objects stored reported by Amazon CTO, Werner Vogels in his blog post.
In retrospect, it was an over-estimation by several order of magnitude (my bad) that was promptly corrected by MikeDoug using another data point AWS revenue. MikeDoug estimated (comment excerpts below) the data stored to be in 20PB (petabyte) range, way short of my estimates and may be more closer to reality.
No, doubt, it is still a significantly large number for a service that is only few years old. But, S3 growing up fast may not be as obvious from growth in stored objects as Vogels would like us to believe.
A recent report puts ALL of AWS at the 50 to 70 million in revenue for the year.Let us pretend that, of the 70 million, 40 million in revenue was attributed to S3 alone for last year. That would be $3,333,333 a month for S3. This converts to 22,222,222 gigabytes, or 0.02 exabytes.
Other interesting tidbits if S3 has 20PB of stored data, 18 billion objects and 330,000 registered developers:
On average, each object is only storing about a megabyte of data. This number seems quite low so either deleted objects are being included in the published number of objects or developers are keeping object size low to prevent transfer timeouts.
On average, each developer is only storing 54GB of data. Considering some services like SmugMug are storing terabytes of data on S3, most probably there are lot of registered developers either not using S3 actively for storing data or have services under development.
Originally posted at http://andirog.blogspot.com.
| M | T | W | T | F | S | S |
|---|---|---|---|---|---|---|
| « May | ||||||
| 1 | 2 | 3 | 4 | 5 | ||
| 6 | 7 | 8 | 9 | 10 | 11 | 12 |
| 13 | 14 | 15 | 16 | 17 | 18 | 19 |
| 20 | 21 | 22 | 23 | 24 | 25 | 26 |
| 27 | 28 | 29 | 30 | |||
