Tag Archive for 'replication'

New Webinar Replay: The Future of Big Data for the Enterprise

You may have heard that we’ve just launched the world’s first production-ready Apache Hadoop 2 distro. This WANdisco Distro (WDD) is a fully tested, production-ready version of Apache Hadoop, based on the most recent Hadoop release. We’re particularly excited, as the release of WDD lays the foundation for our upcoming enterprise Hadoop solutions. If you want to find out more about WANdisco’s plans for big data, the replay of our ‘The Future of Big Data for the Enterprise’ webinar is now available.

This webinar is led by WANdisco’s Chief Architect of Big Data, Dr. Konstantin Shvachko, and Jagane Sundar, our Chief Technology Officer and Vice President of Engineering for Big Data. Jagane and Konstantin were part of the original Apache Hadoop team, and have unparalleled expertise in Big Data.

This 30 minute webinar replay covers:

  • The cross-industry growth of Hadoop in the enterprise.
  • The new “Active-Active Architecture” for Apache Hadoop that improves performance.
  • Solving the fundamental issues of Hadoop: usability, high availability, HDFS’s single-point of failure and disaster recovery.
  • How WANdisco’s active-active replication technology will alleviate these issues by adding high-availability, data replication and data security to Hadoop, taking a fundamentally different approach to Big Data.

You can watch the full ‘The Future of Big Data for the Enterprise’ replay, along with our other webinars, at our Webinar Replays page.

Putting The Cloud Into An Eyedropper

GeekPhilosopher: Instant download of free stock photos, images, backgrounds, and desktop wallpapers. Pictures can be used for personal and commercial web sites.

When you think of “the cloud”, what comes to mind? For some, it’s racks and racks of commodity hardware. Others may consider the benefits of dynamic scalability or worldwide availability.

There’s another side to the cloud on my mind, which is that many, and likely most, of the applications that enterprises use to run their business were not architected for cloud infrastructures. For example, SCM tools like Subversion, Perforce, and others eye an uneasy life in the cloud. Ten or more years ago, IT environments often centered around bare metal deploys of “big iron” servers. Lights flashed and cables snaked around massive refrigerated server farms.

Fast forward to today’s young high and not-so-high tech firms. On my first day at WANdisco, I found myself standing in the kitchen, wondering where the server room was. Turns out, I was looking at it: a few small boxes tacked up on the wall near the paper towels. Of course! Today’s companies have outsourced many previously in-house applications to cloud based services such as Gmail, Google Docs, Salesforce.com, WorkDay, and many more.

Legacy applications, often used to the high processor speeds, large amounts of RAM, and RamSans of traditional vertically scaled environments, are often fish-out-of-water in a typical horizontally-scaling, public cloud environments such as Amazon EC2.

The solution moving forward has to be grueling, ground-up rewrites of these applications for the multi-machine, multi-master environment of today’s cloud.

Or does it?

While learning about WANdisco’s patented replication technology and implementation called DConE, it occurred to me that it brings many of the benefits of the cloud to existing applications. DConE has effectively condensed many of the benefits of the cloud into a virtual eyedropper, where it can easily be dripped onto almost any existing database application.

For example, it converts a centralized, vanilla Subversion repository into a multi-machine, multi-master, robust replication group with no single point of failure. It adds scalability by easily configuring read-only nodes. HADR (High Availability Disaster Recovery) is built in: if a node loses connectivity, the others transparently take over the load. In many cases, users will not even be aware of a problem. When the previously isolated node comes back on line, it silently catches up with the others.

Importantly, DConE can easily and quickly be integrated with new applications. It’s not the cloud as most think of it, but could it be considered a form of virtualization that delivers some of the promise of the cloud to essential applications?

What applications are you dependent on that could benefit from cloudlike capabilities of HADR, global multi-site, and horizontal scalability? We’d love to hear about them.

We Just Acquired Big Data / Hadoop Company AltoStor.

I believe that the combination of AltoStor’s expertise and WANdisco’s patented active-active replication technology is the proverbial ‘marriage-made-in-heaven’.  The AltoStor acquisition will enable us to launch products into the highly lucrative Big Data / Hadoop market early next year.

So how lucrative is this market?  Well, I recently read an interesting article in Wikibon “Big Data: Hadoop, Business Analytics and Beyond” that Big data / hadoop market sizereiterated what we already knew.  Big Data isn’t a might happen next year thing.  No, it’s here today, to steal a quote from the excellent article: “Make no mistake: Big Data is the new definitive source of competitive advantage across all industries. Enterprises and technology vendors that dismiss Big Data as a passing fad do so at their peril and, in our opinion, will soon find themselves struggling to keep up with more forward-thinking rivals…. For those organizations that understand and embrace the new reality of Big Data, the possibilities for new innovation, improved agility, and increased profitability are nearly endless.”

So why did we acquire AltoStor?

First off, the founders (Dr. Konstantin Shvachko and Jagane Sundar) are really good guys.  This was an ‘old-school’ acquisition.  An initial deal was struck very quickly with a handshake.  Both sides could see very clear value – so doing the deal was incredibly simple.  I love the fact that they wanted stock as consideration – that’s real proof that they see significant long term-value creation rather than short-term gain.

For WANdisco Big Data is a Big Market.  We can see clear synergy between our unique / patented active-active replication technology and the hadoop logocreation of Hadoop high availability (HA) solutions.  This is one of the reasons why AltoStor was so attractive to us.  They have unique knowledge in the space:

•            The AltoStor founders have been working on Hadoop since its inception in 2006 at Yahoo.  Konstantin was part of Doug Cuttings team that created and implemented Hadoop.  His focus was massive scale, performance and availability of Hadoop – developing the Hadoop Distributed File System (HDFS).  He then went on to eBay where he implemented Hadoop.

•            The Founders are intimately aware of the problem WANdisco is planning to solve around Hadoop HA and hence understand the value of the solution in large scale Big Data replication over a Wide Area Network.

•            Finally, AltoStor are developing a product that is slated for release in Q1 2013, that will significantly simplify deployment of Hadoop / Big Data for enterprises.

Following the acquisition we now expect to have products available in the first quarter of 2013.  That’s very good news.

There’s going to be a lot of noise in this space over the coming months and years.  Many will jump on the ‘bandwagon’, making all sorts of lavish claims to be ‘the big data this’ and ‘the big data that’.  It always happens in hype-cycles like this.  In reality most are just companies repurposing existing legacy products and slapping a new label on it.  This is NOT one of those.  We are building from the ground-up with unique knowledge and information that only a few in the world have (the amount of brain-power in the room during some of the early design meeting was frightening!)

In 2005 when we founded WANdisco my peers would tell me that active-active replication over a Wide Area Network was impossible.  Well we’ve got hundreds-of-thousands of users using the technology for core development every day.  Applying this technology to Hadoop is groundbreaking and I think it will change the way the industry views network storage.  We like making the impossible possible at WANdisco.


About David Richards

David is CEO, President and co-founder of WANdisco and has quickly established WANdisco as one of the world’s most promising technology companies. Since co-founding the company in Silicon Valley in 2005, David has led WANdisco on a course for rapid international expansion, opening offices in the UK, Japan and China. David spearheaded the acquisition of Altostor, which accelerated the development of WANdisco’s first products for the Big Data market. The majority of WANdisco’s core technology is now produced out of the company’s flourishing software development base in David’s hometown of Sheffield, England and in Belfast, Northern Ireland. David has become recognised as a champion of British technology and entrepreneurship. In 2012, he led WANdisco to a hugely successful listing on London Stock Exchange (WAND:LSE), raising over £24m to drive business growth. With over 15 years' executive experience in the software industry, David sits on a number of advisory and executive boards of Silicon Valley start-up ventures. A passionate advocate of entrepreneurship, he has established many successful start-up companies in Enterprise Software and is recognised as an industry leader in Enterprise Application Integration and its standards. David is a frequent commentator on a range of business and technology issues, appearing regularly on Bloomberg and CNBC. Profiles of David have appeared in a range of leading publications including the Financial Times, The Daily Telegraph and the Daily Mail. Specialties:IPO's, Startups, Entrepreneurship, CEO, Visionary, Investor, ceo, board member, advisor, venture capital, offshore development, financing, M&A

How to Cut Development Time in Half, Improve Build Performance by 500% and Eliminate Downtime

SSP, a leader in software applications for insurance and financial services, with developers in the UK, Australia and South Africa did it by implementing Subversion WAN Clustering (MultiSite) .  As soon as developers at one site commit changes they’re available everywhere at LAN-speed. SSP’s developers in the UK, Australia and South Africa checkout and commit changes to the same files simultaneously and have immediate access to each other’s work.  Merge conflicts and other problems that weren’t discovered for days or weeks until it was time to create a build, are caught and fixed when they happen.  The best talent for a project regardless of location can work together as one agile, virtual development team to get the job done faster.  The net result is that the time SSP has to spend on QA and rework has gone down so dramatically that development cycles have been cut in half.

Now that every developer has instant access to the latest changes regardless of where they came from, builds can be created and tested at each site in less than a day, instead of waiting up to 5 days for a central team to complete their development work and schedule in builds for other sites.

SSP has also been able to go 24-by-7 with no downtime because Subversion WAN Clustering has turned their distributed Subversion servers into mirrors of each other. When one server goes down for either a planned or unplanned outage, users failover to another site and keep working. When a server comes back online it recovers automatically, grabbing all of the changes that happened at other sites while it was offline.

Get SSP’s full story to learn more about how all of this was accomplished.


About Jim Campigli