Believing in Numbers

26.8.2010 | 4 minutes reading time

What follows is in my view a typical scenario of one of the common curses of the performance tester. A new version of a test object finds its way into my hands. After some twiddling with it to get it running, I do a first quick test run. Somebody – development, architecture, project management, whoever – gets wind of it and is curious about the results. Defensively, I try a delaying tactic: “It’s the very first run. The test environment wasn’t completely built up. We have to do more tests. We must at first confirm the numbers before it makes any sense to discuss them.” My counterpart is getting more inquisitive. He insists to be aware of all of my objections and that he just wants to satisfy his curiosity. I finally give in and tell the numbers. Shortly thereafter, I regret it deeply. Like an avalanche, my little piece of information has gathered a gigantic mass in no time and like a boomerang it is coming full circle, heading straight back at me.

What happens in cases such as this is often the result of an excessive believe in numbers. When something has been tested and quantified, there is a tendency to trust and rely on the results regardless of the circumstances. A first quick measurement point is frequently discussed and given weight as if a complete and elaborate test series has been conducted. If the numbers are bad, this sometimes leads to hectic and strange activities like escalation, crisis meetings and maybe even management action. If they are unexpectedly good, a premature all-clear may be given.

Even the performance tester himself may be prone to this type of mistake. I can speak out of my own experience here… Either because he wishes for or expects a certain result or simply because of routine he might be seduced to accept his own numbers as valid too quickly. Application systems and their configuration are often very complex, so it is easy to overlook or forget little changes with possibly large effects on the performance test’s results.

The unshakable belief in the validity of numeric data can lead to many other inappropriate follow-ups. There is often a tendency to overanalyze a single data point, e.g. projecting possible future performance improvements or extrapolating the effects on a system n-times the size of the test system and so on. If later tests don’t confirm these projections, sometimes a frantic search for an assumed hidden cause of the “wrong” numbers is started, instead of realizing that incomparable things were compared in the first place. Also sometimes earlier numbers – correctly reported and documented – are rediscovered, but their context forgotten, resulting in similar wrong interpretations, conclusions and activities.

Most of the time it is very hard to argue against this firm belief in numbers, to state that other tests are necessary to confirm them and that rather large variations are typical. Performance measurements as part of QA in software development are definitely not an exact science. As “getting things done” is normally the topmost priority, the preconditions for a controlled experiment as in the academic world are rarely fulfilled. Yet the results of such experiments are often handled as if they are fulfilled.

As a performance tester, one has to be aware of the possible ramifications of the reported test results. Testing – especially performance testing – is done to replace uncertainty with knowledge, but careless handling of test results can easily lead to the opposite outcome. In my opinion, the foremost rule should be to only report data that one understands, can stick by and defend. One should always be aware of the reliability, scope, relevance and comparability of the numbers reported and should document these characteristics and attach them to the report. It is often a good idea to explicitly remark if a measurement is just a statement about one particular test-setup and not suitable for further projections.

If a misinterpretation of test results occurs, one should act swiftly, clearly and forcefully to present the tester’s view of the numbers and their context in order to contain the possible damage. Numbers, once reported, tend to acquire a life of their own. Like a genie, it’s practically impossible to get them back into the bottle. Therefore, one should be very careful what to report and when to report.

One final advice: As a performance tester, always remain skeptical about your own work and your own instincts. Other people should not blindly believe in test results and numbers and neither should the tester.

Was this post helpful?

Blog author

Raymond Georg Snatzke

Do you still have questions? Just send me a message.

fromRaymond Georg Snatzke

codecentric at the European Go Congress 2017

The European Go Congress (EGC) is by far the largest Go event in Europe. It has been held yearly since 1957, usually in a different country every year. In 2017, the EGC came to Germany once more, organized by the DGoB, the German Go Federation . The...

Community
AI
Game programming

29.8.2017 | 6 minutes reading time

Raymond Georg Snatzke

codecentric go challenge 2016

This Sunday, August 28th, the third codecentric go challenge is going to start. The challenge – organized by Prof. Ingo Althöfer of the University of Jena, Germany, and sponsored by codecentric – is a best-of-five match of a strong computer go program...

Game programming

27.8.2016 | 11 minutes reading time

Raymond Georg Snatzke

codecentric go challenge 2015

This Saturday, October 3rd, marks the start of the second installment of the codecentric go challenge. The challenge – organized by Prof. Ingo Althöfer of the University of Jena, Germany, and sponsored by codecentric – is a best-of-five match of a strong...

AI
Game programming

1.10.2015 | 6 minutes reading time

Raymond Georg Snatzke

What you have to deal with when you work with AppDynamics – or other APM...

For many years now I have been working with Application Performance Management (APM) tools in the Java realm. Compared to other performance analyzing tools as for example profilers, APM tools are monitoring as well as analyzing tools. They provide a ...

17.5.2015 | 6 minutes reading time

Raymond Georg Snatzke

codecentric go challenge 2014: Final Interviews

The codecentric go challenge 2014 is over. Franz-Josef Dickhut managed to defeat Crazy Stone, one of the two strongest go programs worldwide, in four games with three wins to one. You can replay and download the games at go.codecentric.de . Congratulations...

Game programming
Go

27.11.2014 | 10 minutes reading time

Raymond Georg Snatzke

codecentric go challenge 2014: Interviews with Franz-Josef Dickhut and...

This Saturday, October 4th, at 4 pm, the first game of the codecentric go challenge 2014 will be started. We will document the finished game – which will be played on the KGS Go Server – at http://go.codecentric.de . Before the start of the first game...

Game programming
AI

30.9.2014 | 6 minutes reading time

Raymond Georg Snatzke

codecentric challenge 2014

Man vs. Machine This fall will see a first in the realm of computer go – a match between a go program and a top European go player on even terms, i.e. without a handicap. And codecentric will be the sponsor of this event. So this will not be your typical...

Game programming
AI

11.9.2014 | 4 minutes reading time

Raymond Georg Snatzke

From a Mathematician’s Point of View: JMeter – Beloved Crap Tool

My company, codecentric AG, has a strong disposition towards open source tools and solutions. Therefore, it is quite natural that for me as a Java performance specialist, the load testing tool of choice is Apache’s JMeter . Let me just state something...

Open Source
APM

28.3.2013 | 5 minutes reading time

Raymond Georg Snatzke

From a Mathematician’s Point of View – Load and Performance Issues in ...

In between big news like the Irish financial crisis and the conflict on the Korean peninsula, two little news from the realm of the Bavarian school system went mostly unnoticed by the public. “Computer-Chaos an den Berufsschulen” from November 16th ...

29.11.2010 | 4 minutes reading time

Raymond Georg Snatzke

From a Mathematician’s Point of View – The Eyes of the Tester

My daily business is the performance of IT systems, java applications in particular. Performance testing and optimizing is done normally to assure compliance to the non functional requirements set for an application. Working on these topics day-in and...

Testing
APM

28.10.2010 | 4 minutes reading time

Raymond Georg Snatzke

From a Mathematician’s Point of View – Career Changers

Nearly all of my colleagues at codecentric are either IT scientists or trained IT specialists. Despite the still severe shortage of skilled IT people in general, as a mathematician and career changer I remain an exception in our company. I am an exception...

Agile
Testing
APM
Frontend
Training
Search
Spring

8.1.2009 | 3 minutes reading time

Raymond Georg Snatzke

Your job at codecentric?

Jobs

Agile Developer und Consultant (w/d/m)

Alle Standorte

Hexagonal Architecture is just an island

Imagine an island called "Alistair Island." This island is a vibrant place with houses, fertile soil, and a well-coordinated community of residents who live by well-defined routines. Every activity on the island has significance and serves a specific...

Software architecture
Testing
Software development

22.1.2025 | 10 [Missing String "readingTime"]

Danny Steinbrecher

Charge your APIs Volume 33 - Definition-Based API Mocking, Simulation,...

Key TakeawaysThis article is the third and last one in a three-part series about definition-based API mocking, simulation, and testing with Microcks (make sure you have read the first and second article)The previous articles focused on (i) Microcks’ ...

Testing
API

23.10.2024 | 11 [Missing String "readingTime"]

Dr. Florian Rademacher

Charge your APIs Volume 32 - Definition-Based API Mocking, Simulation,...

Key TakeawaysThis article is the second one in a three-part series about definition-based API mocking, simulation, and testing with Microcks (make sure you have read the first article)While the previous article concentrated on Microcks’ architecture,...

API
Testing

16.10.2024 | 11 [Missing String "readingTime"]

Dr. Florian Rademacher

Charge your APIs Volume 31 - Definition-Based API Mocking, Simulation,...

Key TakeawaysAPI mocking used, e.g., for integration testing, is challenging as it assumes conformance to mocked API functionality, which can incur significant costs as mock complexity increases with API complexityDefinition-based API mocking can reduce...

API
Testing

9.10.2024 | 9 [Missing String "readingTime"]

Dr. Florian Rademacher

Playwright tests and API Mocking

Problem definition Playwright tests can sometimes depend on external services such as APIs, which might happen to be unavailable at times. In this case there are several options for executing these tests adequately, as described below. Actually call ...

Testing

10.5.2024 | 4 [Missing String "readingTime"]

Ege Inanc

Charge your APIs Volume 25: Contract Testing

I feel the way we do integration testing is sort of like setting your house on fire to test your smoke alarm. It is excessive, tiresome and way too costly. This is not a quote from myself. I typically don't come up with such good ideas when I need....

Testing
Software development
API

2.4.2024 | 11 [Missing String "readingTime"]

Pasquale Brunelli

A/B Testing: Tool support and testing GrowthBook

In the previous blog post we introduced some general concepts of A/B testing: we explored the main aspects, defined test types and explained the most common statistical methods. Now we want to explore the areas in which A/B testing tools can provide...

Testing
Python
Data
UX/UI
Analysis
JavaScript

18.3.2024 | 20 [Missing String "readingTime"]

Francesca Diana

A/B Testing: An introduction

This blog series aims to aid teams who are contemplating adding A/B testing to their toolkit but are unsure of which tool to use. In addition to helping with tool selection, the series also provides the entire team with a consistent initial understanding...

Testing
Data
UX/UI
Analysis

6.2.2024 | 29 [Missing String "readingTime"]

Francesca Diana

Count your queries! Repository integration tests with Hibernate Statistics

If you are using Spring Data JPA as a data access framework, Hibernate is almost certainly hiding under the hood. And although this setup takes a lot of work off your hands by doing a lot of awesome things, the final outcome should better be checked....

Java
Testing
Spring
Database

7.8.2023 | 6 [Missing String "readingTime"]

Kevin Peters

Charge your APIs Volume 6: Perfecting Your APIOps - Harnessing the Power...

Our journey through the expansive landscape of API Operations (APIOps) has led us through various territories. We've delved into Continuous Integration and Deployment, ensuring seamless transitions from coding to production-ready APIs with minimal friction...

API
Testing
GitHub

14.6.2023 | 2 [Missing String "readingTime"]

Daniel Kocot

Charge your APIs Volume 4: Streamlining API Operations with Continuous...

API operations refer to the maintenance and management of APIs (Application Programming Interfaces) throughout their lifecycle. This includes everything from design and development to testing, deployment, and ongoing maintenance. Continuous Integration...

Testing
API

31.5.2023 | 6 [Missing String "readingTime"]

Daniel Kocot

Charge your APIs Volume 3: Optimizing API Testing with Contract Testing

API testing is a crucial part of the development process that ensures the functionality, reliability, and performance of the API. Testing helps to identify and resolve errors early on, which translates into reduced development costs and improved customer...

API
Testing

24.5.2023 | 6 [Missing String "readingTime"]

Daniel Kocot

JavaScript test performance: getting the best out of Jest

In recent years Jest has established itself as the go-to testing framework for JavaScript and TypeScript development. It provides a complete toolkit (test runner, assertion library, mocking library, code coverage and more) out of the box, and requires...

Node.js
JavaScript
APM
Testing

12.11.2021 | 7 [Missing String "readingTime"]

APIOps – Automated processes for even better APIs

In my German Softwerker article (Vol. 14, p. 90) , I already dealt with the continuous design and development cycle of APIs. This was mainly about basic assumptions and tooling, including the introduction of API gateways or platforms into existing development...

DevOps
Cloud
Testing
API

28.1.2021 | 8 [Missing String "readingTime"]

Daniel Kocot

The how of monitoring your services

Lately, there has been a lot of discussion about SLAs, SLOs and SLIs. As this article states, it is hard to define the correct SLOs and SLIs. This discussion is about what part of your services you want to monitor. But it is also difficult to measure...

Infrastructure
APM

17.11.2020 | 5 [Missing String "readingTime"]

Green test pyramids with Cypress – UI testing of the future

Cypress is a young open-source testing framework for web-based user interfaces (UI). Cypress tests are written in JavaScript and, as is also common with Selenium-based technologies, are based on the Document Object Model (DOM) of the HTML of a web application...

Frontend
JavaScript
Testing

29.9.2020 | 8 [Missing String "readingTime"]

Performance optimization of a GraphQL app with Instana

“Works on my machine.” Okay, but we know quite well software never behaves the same when running on different machines… We knew that, but ran into unexpected performance issues when going live with a simple app. Here’s how we fixed the problem and improved...

Cloud
APM
API
JavaScript

21.7.2020 | 8 [Missing String "readingTime"]

Detox vs. Appium – a comparison of React Native testing frameworks

Currently, there are especially two end-to-end testing frameworks which are interesting for React Native developers: Detox and Appium. During my internship at codecentric, I analyzed and compared both frameworks in detail, writing tests with both frameworks...

React
Testing

16.7.2020 | 5 [Missing String "readingTime"]

Anja Bender

Code-based remote API mocking with Typescript and Webpack

IntroductionRemember how you have to set up a whole bunch of infrastructure locally just to be able to independently work on the frontend part of your Jamstack project? If you’re tired of doing so, maybe this short write-up on how to replace the infrastructure...

Frontend
Testing
JavaScript

13.7.2020 | 5 [Missing String "readingTime"]

Implementing and testing an Angular feature flag directive

IntroductionAn important goal of agile software development is to shorten the user feedback loop. To achieve that you want to release your changes as often as possible. This also includes releasing prototypes, e.g. to a smaller audience, gathering customer...

Frontend
Angular
JavaScript
Testing
Webdevelopment

18.5.2020 | 6 [Missing String "readingTime"]

Believing in Numbers

Was this post helpful?

Blog author

More articles

codecentric at the European Go Congress 2017

codecentric go challenge 2016

codecentric go challenge 2015

What you have to deal with when you work with AppDynamics – or other APM...

codecentric go challenge 2014: Final Interviews

codecentric go challenge 2014: Interviews with Franz-Josef Dickhut and...

codecentric challenge 2014

From a Mathematician’s Point of View: JMeter – Beloved Crap Tool

From a Mathematician’s Point of View – Load and Performance Issues in ...

From a Mathematician’s Point of View – The Eyes of the Tester

From a Mathematician’s Point of View – Career Changers

Your job at codecentric?

Agile Developer und Consultant (w/d/m)

More articles in this subject area

Hexagonal Architecture is just an island

Charge your APIs Volume 33 - Definition-Based API Mocking, Simulation,...

Charge your APIs Volume 32 - Definition-Based API Mocking, Simulation,...

Charge your APIs Volume 31 - Definition-Based API Mocking, Simulation,...

Playwright tests and API Mocking

Charge your APIs Volume 25: Contract Testing

A/B Testing: Tool support and testing GrowthBook

A/B Testing: An introduction

Count your queries! Repository integration tests with Hibernate Statistics

Charge your APIs Volume 6: Perfecting Your APIOps - Harnessing the Power...

Charge your APIs Volume 4: Streamlining API Operations with Continuous...

Charge your APIs Volume 3: Optimizing API Testing with Contract Testing

JavaScript test performance: getting the best out of Jest

APIOps – Automated processes for even better APIs

The how of monitoring your services

Green test pyramids with Cypress – UI testing of the future

Performance optimization of a GraphQL app with Instana

Detox vs. Appium – a comparison of React Native testing frameworks

Code-based remote API mocking with Typescript and Webpack

Implementing and testing an Angular feature flag directive