DevOps: Hidden risks and how to achieve results

Industry insights from experts including Gene Kim & the Dynatrace Innovation Lab

Executive Summary

The top performing organizations know that they must adopt a continuous and collaborative IT delivery model to meet the ever-increasing market demands for more features, faster.

DevOps is proven to produce faster deployments with fewer problems. And the leaders are seeing real business gains: they are 2.5x more likely to exceed profitability, market share and productivity goals and have 50% higher market capital growth over three years.

The market leaders’ results show us that DevOps is necessary to be competitive, but there are many challenges in the way. Fortunately, we have some experts to help. Recommendations in this eBook are based upon Gene Kim’s keynote presentation at the Dynatrace annual Perform conference, the book he co-authored, The Phoenix Project, and the expertise of the Dynatrace Innovation Lab.

Leaders are seeing real business gains; they are 2.5x more likely to exceed profitability, market share and productivity goals.

This eBook is for all Development Managers, Operations Managers and DevOps Managers. Whether you are just starting your DevOps journey or still considering how a DevOps transformation can help your organization, this eBook will give you insight into why it is important, what makes the leaders successful and provide guidance on how to accelerate your results.

Chapter 1

DevOps & High Performance

DevOps is Not Just Last Year’s Buzzword

High-performing DevOps is the continuous delivery of technology made possible through a collaborative IT culture and shared performance metrics tied to business results, supported by automated tools throughout the development lifecycle.

Google, Amazon, Netflix, Facebook, Etsy, Microsoft and countless tech start-ups are doing it. So are The Gap, Nordstrom, the World Bank, and Intuit Does it surprise you that even the UK Government and the U.S. Department of Homeland Security are doing DevOps?

All of these organizations know that today’s marketplace is driven by ever-increasing demands for features, speed and reliability; they must continuously deliver new value without disruption to their customers’ experience. And the way to get there is DevOps.

A little skeptical? Are you thinking: wasn’t DevOps just the buzzword of the year last year? Is there really value there for my organization and me?

There are many different views on what it means to have a high-performing DevOps organization. From our experience working with IT organizations across many industries, we have found certain qualities to be critical for today’s high-performing organization.

The principles that make DevOps work are far from new. Lean manufacturers embraced these ideas in the 1980s to build with unprecedented speed and quality. Other organizations like big-box retailers and service firms have gained a competitive advantage through the use of cross-functional teams, shared performance metrics and workflow automation for years. And, Agile software development has been around since 2001.

Today, Your IT Organization is Your Competitive Advantage:

  • Yesterday's IT
  • IT as a cost center
  • Developers blame Ops—Ops blames developers
  • Waterfall development
  • Performance considerations begin at code check-in
  • Deployments only 1-2x a year
  • Change approval board required to deploy
  • Version control for code only or no version control at all
  • Failures found by NOC or worse, by customers
VS.
  • Today’s High Performing DevOps
  • IT as a competitive advantage
  • High trust organization with shared goals
  • Continuous integration and delivery
  • Performance is a critical requirement and is considered throughout the development lifecycle
  • Multiple deployments a day
  • Peer review of code quality
  • Version control for code, system, application configurations and data
  • Proactive monitoring to find failures first

DevOps Results

Organizations that have truly adopted DevOps principles across the application development lifecycle are deploying faster, having fewer problems and are seeing real business gains.

Flickr was one of the first to announce they were following DevOps principles with 10 deployments a day way back in 2009. TurboTax recently made 165 production changes during peak tax season resulting in 50% increase in website conversion rate. Amazon deploys at an amazing pace: every 11.6s with 23,000 deployments a day. They have had 75% fewer outages since 2006, 90% fewer outage minutes, and only 0.001% deployments cause a problem.

Here are some more interesting facts about high-performing DevOps from the two groundbreaking PuppetLabs reports in 2013 and 2014:

They are Faster, More Reliable and are Winning the Marketplace.

Chapter 2

Hidden Risks

What Are the Biggest DevOps Obstacles You Will Face?

The experiences of market leaders make it clear. DevOps, if it is not now, will be a necessity in the near future just to keep up. But a culture change is never easy. Here are some of the top DevOps challenges that organizational teams face:

15 minute tasks take weeks to complete

“Improving daily work is even more important than doing daily work.”
- Gene Kim, Co-author of, The Phoenix Project: A Novel About IT, DevOps, and Helping Your Business Win

Chapter 3

“The Three Ways” - Principles That Enable High Performance

The Situation Seems Pretty Bleak, Doesn’t It?

It doesn’t have to be.

Some great input on how to build a true DevOps culture comes from The Phoenix Project by Gene Kim et al. In the book, the IT manager saves the day by using The Three Ways, a mysterious yodalike philosophy. It is a conceptual framework that can help drive any organization’s transformation to a high performing DevOps culture.

DevOps Results

“A must-read for anyone wanting to transform their IT to enable the business to win.”
- Mike Orzen, co-author of the the Shingo Prize winning book Lean IT: Enabling and Sustaining Your Lean Transformation

The First Way: Systems Thinking

This first principle of The Three Ways philosophy emphasizes the performance of the entire system and builds quality into the products and the software development lifecycle itself.

Your goal with this principle is to continuously increase the flow from development to operations and minimize any return flow. Operations should be involved early on in the development process so that they can plan for any necessary changes to the production environment (as described in more detail in Continuous Delivery by Jez Humble and Dave Farley).

Smaller, more frequent software releases reduce waste and improve your timeto- market. Automated testing should be used to uncover defects before they affect your business.

Facebook released their chat service in less than a year to 98 million users by deploying code directly to production every single day.

Facebook Chat Service

The Second Way: Amplify Feedback Loops

The second principle is about establishing a feedback process from operations back to development.

Infrastructure and application performance monitoring, combined with strong communication processes, can eliminate finger-pointing. You can recognize and resolve problems quickly, and then prevent them in the future.

“Having a developer add a monitoring metric shouldn’t feel like a schema change.”
- John Allspaw, SVP Tech Ops, Etsy

An example practice is to let development, testing and operations get together frequently to jointly analyze issues in production. After such a meeting:

The Third Way: Continual Experimentation and Learning

The third principle is about continuous improvement.

Having mastered the first two ways, you can now afford to experiment and take risks. You can even inject faults into your system, find out where it breaks and then fix the system to increase resilience. It is about getting out of your comfort zone and into the danger zone - and if it hurts, do it more often.

Aside from fault injection testing, you could monitor whether your business hypotheses hold: are your users using particular features and how are they using them? Also, keep working hard on improving your development, test and release processes.

“Our goal is to do painful things more frequently, breaking all of human nature.”
— Adrian Cockcroft, Technology Fellow, Battery Ventures; formerly Cloud Architect, Netflix

This is at the core of what saved Netflix during the EC2 outage in 2011. The site remained operational because of the Chaos Monkey, a piece of code that kills processes in production randomly requiring new code and environment changes to be strong. The result is Netflix was one of the very few businesses that did not go down during this failure.

Continual Experimentation Learning

Chapter 4

Your Real Life in Dev & Ops

Our Recommendations for Making DevOps Work for YOU!

Is a high-performing DevOps culture really possible for everyone? Yes, yes and yes!

It is a necessity, if the lean manufacturing revolution is to be our model. As W. Edwards Deming, the quality management guru of the ‘80s once said, “Learning is not compulsory… neither is survival.”

Here are a few recommendations from our collective experience that we think will help you along your way:

“It is only through automated testing that we can transform fear into boredom… The only way you can get people productive is to show them there is a safety net underneath them that will catch errors long before they get into production.”
— Eran Messeri, SCM, Google

Chapter 5

Start Small but Start Now!

A Good Place to Start - Application Performance Management

If all the recommendations in Section Four, Your Real Life, seem overwhelming, a great place to start is application performance monitoring.

It is an important capability for DevOps and one that can provide benefits right away. In fact, a recent survey of 1400 organizations ranked Performance Monitoring and Performance Testing as critical for DevOps.

Application performance management tools can provide insight into performance throughout the development lifecycle. It can also expedite collaboration between development and operations with each considering the same performance metrics from early on in the development lifecycle.

Performance monitoring shouldn’t start in production environment. Tracking performance of automated tests, together with analytics to identify outliers and performance problems, helps to keep performance problems from making it into production.

Nordstrom: An Application Performance Management Case Study

One organization that has integrated performance monitoring into its DevOps approach is Nordstrom. Nordstrom.com has had 30% revenue growth year-over-year and is hosted internally on hundreds of servers.

Nordstrom implemented application performance management tools across the application development lifecycle. These tools helped them design for performance and get features out more quickly.

Before they had the tools, they couldn’t do the level of testing necessary to find issues. The issues would end up in production and then it would take time to identify the root cause and ultimately, result in downtime.

“Before [performance management] tools existed it was like beating our heads against the wall. [With the new performance management tools], identifying issues went from days to hours. Our test cycles went from weeks to days.”
— Gupal Brugalette, Senior Applied Architect for Performance Engineering, Nordstrom

Chapter 6

Some Homework for You

DevOps Reading!

We know we’ve given you a lot to think about. A good to great DevOps transformation, like any major culture change, will involve some hard work. We hope that we have given you a few good steps in the right direction. Now, for some homework. Continual learning is a principle of The Three Ways, after all! Here we have identified some resources to support you on your journey.

Recommend Reading:

Blogs to Follow / Blog Posts to Read

DevOps Tools We Love :-)

Here is a collection of tools that we enjoy using (at Dynatrace) for our internal processes. They foster collaboration amongst our Product Management, Development, IT Operations and Technical Support teams, allow us to build more quality into our products, and support us in establishing better feedback loops.

Recommended Tools From Martin & Wolfgang:

Stop back once in a while, and we’ll gradually update this list with more interesting bits. Top DevOps Tools We Love

Chapter 7

Sources

About the Authors

Gene Kim

Gene Kim Co-Author of The Phoenix Project

Gene Kim is a multiple award-winning entrepreneur, the founder and former CTO of Tripwire and a researcher. He has written three books, including coauthoring The Phoenix Project.

Twitter: @RealGeneKim

Martin Etmajer

Martin Etmajer Senior Technology Strategist, Dynatrace Innovation Lab

Martin has 10+ years of experience as a developer and software architect, as well as maintaining highly available performance cluster environments.

Twitter: @metmajer

Wolfgang Gottesheim

Wolfgang Gottesheim Senior Technology Strategist, Dynatrace Innovation Lab

Wolfgang has over a decade of experience as a software engineer and research assistant in the Java enterprise space.

Twitter: @gottesheim

Take free trial     Download eBook