Big Data

Last-Minute Black Friday Rescue & Cyber Monday Readiness

In order to be ready for Christmas season, online retailers typically bring their shops into shape right before Black Friday. Together with Cyber Monday this is the most important day in the retailer’s year. Stilnest.com (@Stilnest) is a publishing house for designer jewelry, running their online shop on Magento. While the guys at Stilnest did a good job in preparing their environment, the interest in their products and, therefore, the traffic … read more

How to get Visibility into Docker Clusters running Kubernetes

Disclaimer: we hope you’ll enjoy this article! If you want to know more about our hot new capabilities around Kubernetes monitoring with Dynatrace, please make sure to visit our Kubernetes monitoring page. Google officially launched today the final version of their cluster manager project Kubernetes, which treats Docker containers as first-class citizens. In this blog post, I will show you how you can monitor the performance of your Kubernetes cluster using the Dynatrace Docker Monitor … read more

Unlocking Insight – How to Extract User Experience by Complementing Splunk

Splunk is a great Operational Intelligence solution capable of processing, searching and analyzing masses of machine-generated data from a multitude of disparate sources. By complementing it with an APM solution you can deliver insights that provide value beyond the traditional log analytics Splunk is built upon: Operational Intelligence: Let Your Data Drive Your Business In a nutshell, the purpose behind Operational Intelligence is the ability to make well-informed decisions quickly based … read more

Top Performance Problems discussed at the Hadoop and Cassandra Summits

In the last couple of weeks my colleagues and I attended the Hadoop and Cassandra Summits in the San Francisco Bay Area. It was rewarding to talk to so many experienced Big Data technologists in such a short time frame – thanks to our partners DataStax and Hortonworks for hosting these great events! It was also great to see that performance is becoming an important topic in the community at … read more

Eating our own dog food – 2x faster Hadoop MapReduce Jobs

For a while now I have been writing about how to analyze and optimize Hadoop jobs beyond just tweaking MapReduce options. The other day I took a look at some of our Outage Analyzer Hadoop jobs and put words into action. A simple analysis of the Outage Analyzer jobs with Compuware APM 5.5 identified three hotspots and two potential Hadoop problems in one of our biggest jobs. It took the … read more

So What? – Monitoring Hadoop beyond Ganglia

Over the last couple of months I have been talking to more and more customers who are either bringing their Hadoop clusters into production or that have already done so and are now getting serious about operations. This leads to some interesting discussions about how to monitor Hadoop properly and one thing pops up quite often: Do they need anything beyond Ganglia? If yes, what should they do beyond it? The Basics … read more

Top 8 Application Performance Landmines

Update October 2014: Follow the best practices discussed in this blog by downloading Dynatrace Free Trial. We have been blogging about the same problems and problem patterns we see while working with our customers over the past few of years. There have always been the classic application performance landmines in the areas of inefficient database access, misconfigured frameworks, excessive memory usage, bloated web pages and not following common web performance … read more

Speeding up a Pig+HBase MapReduce job by a factor of 15

The other day I ran a Pig script. Nothing fancy; I loaded some data into HBase and then ran a second Pig job to do some aggregations. I knew the data loading would take some time as it was multiple GB of data, but I expected the second aggregation job to run much faster. It ran for over 15 hours and was not done at that time. This was too … read more

How I Identified a MongoDB Performance Anti Pattern in 5 Minutes

The other day I was looking at a web application that was using MongoDB as its central database. We were analyzing the application for potential performance problems and inside 5 minutes I detected what I must consider to be a MongoDB anti pattern and had a 40% impact on response time. The funny thing: It was a Java best practice that triggered it! Analyzing the Application The first thing I … read more

Website’s Vulnerability to Third-Party Services Exposed

While preparing for a presentation at the Internet Summit in Raleigh this week, I was able to look at some unique data from a new free online service we recently launched called Outage Analyzer. Outage Analyzer showcases the value of “Big Data” performance analytics delivering answers instead of just more data with real-time visualizations and alerts of outages in third-party web services. Outage Analyzer harnesses the collective intelligence of the Compuware … read more