Scale Computing CTO: Big Data demands scale-out storage
By tightly integrating storage and virtualisation, midmarket companies can ensure high availability for their applications
By Sophie Curtis | Techworld | Published: 13:48, 05 November 2012
The only reason that scale-up storage is still so predominant in the enterprise is because most business applications were built to run on scale-up architecture, but as demand grows for big data applications, enterprises will increasingly be looking to adopt a scale-out model.
So says Jason Collier, CTO of Scale Computing, who was recently in London for the UK launch of the company's HC3 “datacentre-in-a-box” solution. According to Collier, applications that are written for the enterprise are not written to take advantage of big scale-out infrastructure.
“A lot of enterprises are still running Microsoft apps, the SAPs, the Oracles, that are engineered to run on what they're running on today, but that will change,” he said.
Related Articles on Techworld
“Taking unstructured data in an organisation and running analytics on it – that cannot be done on a traditional scale-up model. The way the data structure's laid out, there's not enough compute horse power that you can hook up through a single array to pull off that kind of analytics.
“When those applications do get built for doing analysis of unstructured data, that's when you're going to see that application shift in the enterprise adoption of scale-out.”
Collier said that organisations working in the field of high-performance computing (HPC) have relied on scale-out storage for years, because they require a certain level of flexibility and cannot risk having a single point of failure.
Web 2.0 companies like Facebook and Google, as well as cloud providers like Amazon and Rackspace, also all have a scale-out architecture running on commodity hardware. This is because it data is distributed and therefore very safe and quick to access.
Meanwhile, enterprises are becoming technology laggards, because they continue to use traditional scale-up systems from the likes of EMC and Hitachi Data Systems (HDS) while massively growing their data.
“It is going to be very cost inefficient when you compare that to a stack of commodity hardware running the cheapest drives you can sling behind them, but still having the resiliency where you can lose racks and lose data centres and still have availability of that data,” said Collier.
Some companies are looking to the cloud as a way to handle their big data needs without having to invest in massive storage arrays, but Collier warned that the feasibility of cloud computing is entirely dependent upon the application.
“The access to the data, the security round the data, those are things that are still concerns for enterprises,” he said.
“The large cloud providers are not set up to provide large instances to run applications in. They're set up basically for test and development. So most of the things that go on inside Amazon, Rackspace and the other cloud providers are basically spinning up small instances to run test applications on and spinning them down.”
Collier said that large enterprises will continue to invest in their own storage arrays because most of them are not yet ready to fully virtualise their applications.
“It's one of the reasons why VMware and some of the other virtualisation companies aren't growing as fast as they have been. It's because they've saturated the test and dev in these enterprises but they haven't been able to move them to production.”
In the mid-market, however, organisations will not buy virtualisation for test and development but because they want to have high availability for their production apps.
Virtualisation, in combination with scale-out storage, is the easiest way to ensure high availability, but a lot of mid-market businesses do not have the in-house expertise to do the work themselves.
“In SMB and mid-market, less that 10 percent have done full-on production-level virtualisation, and almost none of them have actually done any virtualisation whatsoever, and that primary driver is the complexity,” said Collier.
“They have the same needs as the enterprise, in that they need high availability for their applications, they need to be able to dynamically grow their business when they need to, but they don't have the virtualisation experts, they don't have SAN administrators, they don't have networking guys with that level of expertise.”