Why every redesign breaks your Playwright project — and how three layers prevent it

3.7.2026 | 9 minutes reading time

TL;DR: We show how a structural separation of UI selectors and business logic can look like when using Playwright, adapting the proven Robot Pattern into the Layered Robot Pattern. This way, browser automation can proceed without fear of UI changes.

Why an Android pattern is relevant for Playwright
Getting started quickly with Playwright and AI
Growing efficiency in day-to-day projects
The Layered Robot Pattern in detail
Distinction from BDD
Benefits and limitations
Conclusion

New design, broken automation

We have used the Robot Pattern over the past two years in two fundamentally different projects: once for regression testing an Android app with Jetpack Compose, and once for server-side control of a web application with Playwright on AWS Lambda. What surprised us was how directly the pattern transfers from one platform to the other. The timing is relevant: AI coding agents are increasingly taking over the creation and maintenance of browser automation. Without clear structural guidelines, they produce arbitrarily structured code that is difficult to control. A consistently followed pattern gives the agent a binding contract. The more disciplined the pattern is maintained, the more reliable the AI assistance becomes. That is exactly what we demonstrate in this article.

A concrete example: a web portal for booking a badminton court at a sports hall is to be automated using Playwright. Login, court selection, time slot selection, booking confirmation — that is four pages with perhaps twenty selectors. What happens to an existing automation when the sports hall redesigns its portal? Alongside new colors, new IDs are assigned, or perhaps an entirely different UI framework is adopted. The Playwright implementation no longer works. Not because the booking process changed — but because the interface evolved.

Why an Android pattern is relevant for Playwright

Jake Wharton introduced the Robot Pattern in 2016 at Kotlin Night in San Francisco [1]. His talk "Testing Robots" addressed a concrete problem: UI tests with Google's Espresso framework that quickly became unmaintainable without structural discipline.

The core idea is simple: separate the what from the how. A Robot encapsulates all UI-specific interactions of a page — selectors, clicks, wait times — behind domain-named methods. The calling code — whether test case or automation workflow — sees only these methods and never a selector.

This sounds like Martin Fowler's Page Object Model [2], and indeed the two are related. The difference lies in ambition: a Page Object primarily encapsulates selectors and exposes them as properties.

A Robot, on the other hand, encapsulates complete actions and ensures that the calling code reads like a domain description.

Playwright explicitly documents the Page Object Model as a recommended pattern [4], but in practice the implementation often stops halfway: selectors are encapsulated but exposed as public Locator properties, rather than hidden behind domain-named methods.

Our answer is the Layered Robot Pattern: a three-layer architecture that combines the selector isolation of the Page Object Model with the action encapsulation of the Robot Pattern, adding an explicit workflow layer.

Getting started quickly with Playwright and AI

Building a Playwright automation is no longer a multi-day project. With playwright codegen, a browser recorder automatically generates type-safe code with robust, semantic selectors — no manual selector hunting required. Playwright MCP goes one step further: an AI agent navigates through the application itself, inspects the DOM, and directly generates a complete PageModel class. A coding agent then takes the generated code and creates a full Robot class following project conventions — including correct inheritance and project structure. What used to require manual refactoring now takes only seconds.

Growing efficiency in day-to-day projects

An effect that only becomes apparent over time: the more Robots exist in the codebase, the more effective the AI assistance becomes. With enough references, the agent recognizes the pattern from context and reliably generates new PageModels, Robots, and even domain workflows. The Layered Robot Pattern acts as a structural contract: it gives the agent clear rules for generated code. Without this pattern, an agent would produce arbitrarily structured Playwright code.

The Layered Robot Pattern in detail

Regardless of the platform, our approach consists of three clearly separated layers. We illustrate them using the badminton court booking example mentioned above.

Layered Robot Pattern - Three-Layer Architecture

The BaseRobot: common foundation for all Robots

All concrete Robots inherit from a common base class. The BaseRobot abstracts Playwright's API — no concrete Robot calls page.fill() or page.click() directly but instead uses the base class methods. This has two advantages: cross-cutting concerns automatically apply to all Robots. And the behavior of all interactions can be changed in exactly one place.

The following is a simplified view:

1export abstract class BaseRobot {
2  protected async click(target: Locator): Promise<void> {
3    await target.click();
4  }
5
6  protected async fill(target: Locator, value: string): Promise<void> {
7    await target.fill(value);
8  }
9
10  protected async waitForSelector(target: Locator): Promise<void> {
11    await target.waitFor({ state: 'visible' });
12  }
13}

The concrete Robot implementation: one class per page

The Robot encapsulates the domain actions of a page. It knows its PageModel; the workflow only sees domain-named methods — never a selector.

1export class BookingRobot extends BaseRobot {
2  private readonly page: BookingPage;
3
4  constructor(page: Page, artifactDir: string) {
5    super(page, artifactDir);
6    this.page = new BookingPage(page);
7  }
8
9  async selectCourt(court: string): Promise<void> {
10    this.logger.info('Selecting court: %s', court);
11    await this.click(this.page.courtSelect);
12    await this.click(this.page.courtOption(court));
13  }
14
15  async bookSlot(date: string, time: string): Promise<void> {
16    this.logger.info('Booking slot: %s %s', date, time);
17    await this.fill(this.page.dateInput, date);
18    await this.click(this.page.timeSlot(time));
19    await this.click(this.page.confirmButton);
20  }
21}

What happens behind the scenes — which dropdown variant the court uses, whether the date is entered via calendar or text field — is an implementation detail that can change at any time without affecting the workflow.

The PageModel: separating selectors from behavior

The PageModel is a separate class per page that exclusively holds selectors — no methods, no logic. A Robot accesses this class; from the outside, it is invisible.

1class BookingPage extends BasePageModel {
2  readonly courtSelect: Locator = this.page.getByLabel('Court');
3  readonly courtOption = (name: string): Locator =>
4    this.page.getByRole('option', { name });
5  readonly dateInput: Locator = this.page.getByLabel('Date');
6  readonly timeSlot = (time: string): Locator =>
7    this.page.getByRole('button', { name: time });
8  readonly confirmButton: Locator = this.page.getByRole('button', { name: 'Book' });
9}

This separation has a concrete advantage: if the target portal changes a form — a label is renamed, a button gets a new role — exactly one file is affected: only the PageModel. The Robot and the workflow remain untouched.

The Workflow: business logic as code

At the top layer sits the workflow. It orchestrates the Robots and reads like a domain description of the booking process:

1export class BadmintonBookingWorkflow {
2  constructor(private readonly config: BadmintonBookingConfig) {}
3
4  async execute(page: Page, artifactDir: string): Promise<void> {
5    const { court, date, time } = this.config;
6
7    const loginRobot = new LoginRobot(page, artifactDir);
8    await loginRobot.login(this.config.email, this.config.password);
9
10    const homeRobot = new HomeRobot(page, artifactDir);
11    await homeRobot.navigateToBooking();
12
13    const bookingRobot = new BookingRobot(page, artifactDir);
14    await bookingRobot.selectCourt(court);
15    await bookingRobot.bookSlot(date, time);
16
17    const confirmRobot = new ConfirmationRobot(page, artifactDir);
18    await confirmRobot.verifyBookingSuccess();
19  }
20}

Anyone reading this code understands the booking process — even without ever having seen the sports hall's portal.

Distinction from BDD

Behavior-Driven Development with Gherkin and Cucumber pursues a similar goal: making domain logic readable. The key difference lies in the overhead: BDD requires feature files, step definitions, and glue code — an additional layer where the mapping between natural language and code relies on Cucumber Expressions or regular expressions. This indirection complicates debugging and makes refactoring fragile: a text change in the feature file breaks steps without the IDE warning.

The Robot Pattern works with pure TypeScript classes. The workflow code is simultaneously documentation and implementation — no separate format that must be kept in sync.

BDD has its place when product owners and testers collaborate on specifications and natural language is the common denominator. For technical automation where developers are the primary audience, the Robot Pattern is more lightweight, type-safe, and maintainable.

Anyone who still prefers BDD can always build it as a layer on top of the Robots — the clean separation of the Robots makes exactly that easy.

Benefits and limitations

The most obvious benefit is maintainability through isolation: UI changes only affect the relevant PageModel (BookingPage) and possibly its associated Robot (BookingRobot). The actual workflow and other Robots remain untouched.

An effect that sets in almost unnoticed is readability. The workflow code reads like a domain description. New team members understand the flow without needing to know the UI details.

At the same time, inheritance from BaseRobot ensures consistency across all Robots. Logging, wait times, error handling — all of this is defined once and reused.

On the other hand, there is the initial complexity. For a flow with three clicks, the Robot Pattern is likely overengineering. The abstraction only pays off when the workflow spans more than two or three pages, when multiple workflows visit the same pages, or when the team consists of more than one person. And even with the pattern, the selector dependency remains — the pattern does not eliminate it, it encapsulates it.

Conclusion

The Layered Robot Pattern is not a new framework or library. It is a structural decision that can be summarized in three rules: one PageModel per page — exclusively selectors. One Robot per page — exclusively domain actions, building on the common base class. Workflows call only Robot methods.

These rules originate from the Android world but work in any UI technology. The investment pays off as soon as a workflow spans more than one page — below that, the overhead is rarely justified. But once complexity grows, the clear separation between what and how pays dividends with every UI change, every new team member, and every debugging session.

What helped us most in day-to-day project work: Playwright records videos of every run on demand — and precisely this has accelerated production debugging the most. A failed run can be traced back to the exact moment in the video, without re-execution, without additional logs.

With today's tools — Playwright MCP and Codegen for selector discovery, AI agents for code generation, headed mode and video recording for debugging and demos — a Robot-based workflow can be built in hours. And the more Robots exist in the project, the more effective the AI assistance becomes: the pattern is the structural contract the agent needs to produce consistent code.

Anyone automating a system via browser — whether for testing or process automation — will find in the Layered Robot Pattern a structure that still works after the next redesign.

References

[1] Jake Wharton, "Testing Robots", Kotlin Night, San Francisco, May 17, 2016 — https://jakewharton.com/testing-robots/

[2] Martin Fowler, "PageObject" — https://martinfowler.com/bliki/PageObject.html

[3] Playwright Documentation, "Codegen" — https://playwright.dev/docs/codegen

[4] Playwright Documentation, "Page Object Model" — https://playwright.dev/docs/pom

[5] Playwright MCP — https://github.com/microsoft/playwright-mcp

Was this post helpful?

Blog authors

Lars Jouon

IT Consultant

Do you still have questions? Just send me a message.

Rebecca Jox

Fullstack Developer & IT Consultant

Do you still have questions? Just send me a message.

Replacing Low-Code Platforms with AI-Driven Custom Development in Healthcare

A healthcare software solution needs to be developed to aggregate information (e.g., patient data, diagnoses, lab results) from various medical systems and provide it to another component for further processing via a custom-defined API. The system must...

AI
Software development
Integration

27.6.2026 | 8 minutes reading time

Autonomous development workflows with Claude Code

Most developers today use AI tools as faster autocomplete. Over the past few months, on a client project, I took a different path: multi-agent setups with Claude Code, where specialized agents work in parallel, review one another, and coordinate on their...

AI
Software development
Generative AI

22.6.2026 | 17 minutes reading time

Christoph Dalski

From prompt to product: Why the design step matters

Anyone working with AI-assisted coding assistants today knows the promise: Type a description, and seconds later a working interface appears. Tools like Cursor, Claude Code, or GitHub Copilot deliver increasingly impressive results. Yet what is convincing...

AI
UX/UI
Frontend
Generative AI

16.6.2026 | 9 minutes reading time

Michel Ehmen

Ensuring accessibility with AI: what works today (and what doesn't)

Since June 2025, the Barrierefreiheitsstärkungsgesetz (BFSG), Germany's law implementing the European Accessibility Act, has been in effect. Most teams know they should be doing something about it, but in day-to-day work, the topic usually falls by the...

Accessibility
AI
UX/UI
Testing

2.6.2026 | 11 minutes reading time

Playwright Auth Mocking Done Right: No Runtime Flags, No Factory Patterns...

When you work on a project that uses a third-party authentication provider, you will inevitably face this question: how do I run my Playwright tests without dealing with real login flows? Real authentication involves browser redirects, multi-factor prompts...

Frontend
Testing

28.5.2026 | 8 minutes reading time

Building MCP Servers with Spring AI

Introduction The Model Context Protocol (MCP) is an open standard that defines how AI models communicate with external tools, services, and data sources. It replaces ad-hoc integrations with a single, well-defined JSON-RPC 2.0 protocol, making it easy...

AI
Software development

17.5.2026 | 5 minutes reading time

Tobias Trelle

From Inference to Governance: Why Agent Metadata Matters When LLMs Already...

Modern LLMs demonstrate strong capability in inferring meaning from column names. A tool such as Genie can typically resolve pct_cust_attrit_q to "churn" or map rev_mrr_usd to a"MRR" through pattern recognition alone. On a small, well-structured table...

AI
LLM
Big Data
Database

15.5.2026 | 6 minutes reading time

Niklas Niggemann

The Accessible Domain: Knowledge Engineering for AI-Assisted Development

The Old Promise In the late 1970s, Stanford computer scientist Edward Feigenbaum coined the term "Knowledge Engineering". He described it as the process of extracting expert knowledge, structuring it, and making it usable within a software system. Central...

Generative AI
AI
LLM
Software Modernization
Software development

11.5.2026 | 10 minutes reading time

Johannes Barop

Benjamin Font Pera

Data Quality Powers AI Analytics: Building Trustworthy Genie Spaces in...

Garbage In, Garbage Out. This computing truism has never been more critical than in the age of AI. Large Language Models don't amplify poor data quality, they wrap it in confident-sounding prose that can mislead even experienced users. As organizations...

Generative AI
LLM
AI
Data

7.5.2026 | 8 minutes reading time

Niklas Niggemann

16,000 Tests in 4 Days – Reaching 80% Test Coverage with Claude Code

The Starting Point When we at codecentric recently took over a codebase from a previous service provider for a client, it quickly became clear that this would be no ordinary challenge. Backends, frontends, batch jobs, services — a grown application landscape...

AI
Software development
Testing

5.5.2026 | 12 minutes reading time

Selvarajah Sivarupan

Is Spring Boot Becoming Obsolete?

In March 2026, we kicked off a modernization project for a client. Spring Boot was an obvious choice. There was a strategic decision behind it. There was existing know-how. There was existing infrastructure. The team was set. The work began. One of the...

Generative AI
LLM
AI
Software development
Software architecture

27.4.2026 | 7 minutes reading time

Johannes Barop

EXACT Coding: AI-powered development that prioritizes quality over chaotic...

TL;DR Uncontrolled agentic coding (“vibe coding”) delivers code quickly—and often leads to security and maintenance issues as soon as the software goes live. EXACT Coding (Example-guided AI-Collaborative Test-driven Coding) combines best practices: ....

Generative AI
AI
Test Driven Development

22.4.2026 | 7 minutes reading time

Marco Emrich

Ferdinand Ade

The Ralph Wiggum Loop: Autonomous Code Generation with a Fresh Context

Ralph Wiggum is the simple-minded boy from The Simpsons who says things like "I'm learnding!" and eats glue. Of all people, he is now the namesake for a technique for autonomous code generation. The idea behind: If the thought of letting code be generated...

Generative AI
LLM
AI
Software development

6.4.2026 | 7 minutes reading time

Johannes Barop

KubeCon Europe 2026: AI agents go to production

tl;dr A summary of KubeCon Europe 2026: It is the year AI agents move from prototypes to production. This article covers what that means: giving agents verifiable identities, routing inference traffic with the new Gateway API Inference Extension, governing...

Cloud native
AI

31.3.2026 | 11 minutes reading time

AI Code Tsunami Hits the QA Dam: The End of Balanced Velocity

Note upfront: This article is specifically aimed at teams working on the modernization and further development of existing systems, not at greenfield projects where completely different rules apply. Everyone is talking about the massive productivity ...

Generative AI
AI
DevOps
Test Driven Development
Testing

30.3.2026 | 8 minutes reading time

DeepFake: Detect AI-Generated Images in 5 Steps

We live in a time when an image is no longer a reliable guarantee of truth. AI‑generated content floods social media feeds, news platforms and messenger groups every single day, and only very few people are able to tell the difference. What once required...

IT-Security
AI
Generative AI
Search
Google
data protection
Digitalization

16.3.2026 | 5 minutes reading time

Nested Fixture Pattern for JUnit

JUnit's [@Nested][nested] classes are usually presented as a way to group related tests. But combined with [@RegisterExtension][register-extension] and [ExtensionContext.Store][store], they become something more powerful: a declarative scenario tree ...

Testing
Java
Software development

9.3.2026 | 11 minutes reading time

Rüdiger zu Dohna

From Stories to Code: How Domain Storytelling and EventStorming Give LLMs...

The Broken Promise of AI-Assisted Development By now, most development teams have tried using an LLM to generate code. The results are familiar: syntactically correct, superficially plausible, and frequently wrong in ways that take hours to diagnose...

4.3.2026 | 15 minutes reading time

Narwhals: Building Dataframe-Agnostic Libraries with Zero Dependencies

After the publication of our article about Ibis, Dr André Schemaitat pointed us to a similar tool with growing popularity – Narwhals. Narwhals describes itself as an "extremely lightweight and extensible compatibility layer between dataframe libraries...

Data
Python
Software development

3.3.2026 | 11 minutes reading time

Niklas Niggemann

Don't Let Your AI Cheat: Isolated Specification Testing with Claude Code

AI agents are powerful — but they will cheat if you let them. Letting the same agent develop and test your application risks one thing: it will no longer fulfill the specification, it will simply learn to pass the tests. This article shows how to ...

AI
LLM
Testing

2.3.2026 | 12 minutes reading time

Thomas Jaspers

Why every redesign breaks your Playwright project — and how three layers prevent it

Table of Contents

New design, broken automation

Why an Android pattern is relevant for Playwright

Getting started quickly with Playwright and AI

Growing efficiency in day-to-day projects

The Layered Robot Pattern in detail

The BaseRobot: common foundation for all Robots

The concrete Robot implementation: one class per page

The PageModel: separating selectors from behavior

The Workflow: business logic as code

Distinction from BDD

Benefits and limitations

Conclusion

References

Was this post helpful?

Blog authors

More articles in this subject area

Replacing Low-Code Platforms with AI-Driven Custom Development in Healthcare

Autonomous development workflows with Claude Code

From prompt to product: Why the design step matters

Ensuring accessibility with AI: what works today (and what doesn't)

Playwright Auth Mocking Done Right: No Runtime Flags, No Factory Patterns...

Building MCP Servers with Spring AI

From Inference to Governance: Why Agent Metadata Matters When LLMs Already...

The Accessible Domain: Knowledge Engineering for AI-Assisted Development

Data Quality Powers AI Analytics: Building Trustworthy Genie Spaces in...

16,000 Tests in 4 Days – Reaching 80% Test Coverage with Claude Code

Is Spring Boot Becoming Obsolete?

EXACT Coding: AI-powered development that prioritizes quality over chaotic...

The Ralph Wiggum Loop: Autonomous Code Generation with a Fresh Context

KubeCon Europe 2026: AI agents go to production

AI Code Tsunami Hits the QA Dam: The End of Balanced Velocity

DeepFake: Detect AI-Generated Images in 5 Steps

Nested Fixture Pattern for JUnit

From Stories to Code: How Domain Storytelling and EventStorming Give LLMs...

Narwhals: Building Dataframe-Agnostic Libraries with Zero Dependencies

Don't Let Your AI Cheat: Isolated Specification Testing with Claude Code