Where Vibe Coding helps—and where it doesn't: A field report

20.10.2025 | 10 minutes reading time

A robot attempts to build an application from prompts. The application appears very unstable and threatens to crash.

Vibe Coding is a programming approach that delegates virtually every task involved in working with source code—from understanding to creation to modification—to a GenAI, placing almost complete trust in the output of these kinds of AI. Based on a recent architecture and code review of a Space-as-a-Service platform (SPaaS platform) that was built from scratch with Vibe Coding, we provide an assessment of the suitability of this programming approach for the implementation of complex software products.

Spoiler: Without context-based structuring and orchestration of the used GenAI, the quality deficits of generated code outweigh supposed productivity gains.

The lure of speed—Anyone can develop now(?)

The promise of Vibe Coding is huge: with simple instructions in natural language (prompting), entire applications can be created in no time at all and without any knowledge of software development. This is made possible by increasingly powerful GenAIs based on Large Language Models such as Claude, Gemini, or GPT. With manageable costs and a growing selection of agents, tools, and platforms, the question arises: Can anyone now implement productive software, and will developers have to look for new jobs sooner or later?

We were confronted with this question when a customer recently tasked us to review a prototype of an SPaaS platform created entirely with Vibe Coding. The platform was developed with no programming experience within a few weeks using a small number of prompts and a specialized Vibe Coding portal. In addition to its core functionality of offering and booking locations, the platform integrates other complex functions such as user management, chat, and payment in a modern UI. Despite its prototypical nature and relatively small codebase (approx. 50,000 lines of TypeScript), initial tests showed that the application is fundamentally executable.

The aim of the commissioned architecture and code review was to assess whether the source code generated by GenAI represents a high-quality basis for bringing the prototype of the SPaaS platform to market maturity in the medium term.

In the following, we report on our findings from the review, and discuss the opportunities and risks of implementing software products with Vibe Coding, which still exist despite a large number of specialized providers.

The analysis—What our review revealed

For the review of the AI-generated SPaaS platform, we drew on established architectural evaluation methods and used analysis tools for the Node.js ecosystem on which the application is ultimately based. The following paragraphs describe our findings for each quality deficit identified.

Vendor lock-in: For the review, we downloaded the source code of the SPaaS platform from the used Vibe Coding portal and first checked it for possible malicious code. We then attempted to run the application locally. To do this, we first followed the AI-generated setup documentation, but this proved to be insufficient. After minor adjustments, the platform could be started locally, but registration and login could not be executed without errors. In addition, it remained unclear how the database management system required could be connected correctly to the platform. The setup documentation lacks the necessary information. Since the Vibe Coding portal can operate the platform without errors and additional configuration, there is a significant risk of vendor lock-in with the portal.

Risky dependencies: For the vast majority of security-critical functions, such as user management and payment, the GenAI followed the best practice of integrating with proven external libraries and frameworks. However, some of these were redundant or competing, such as the use of different hash methods for the same purpose, which can lead to runtime problems that are difficult to locate. In other cases, the AI referenced versions of dependencies that are outdated or whose production readiness is unclear. In one particularly serious case, certain user management functions relied on an external dependency that had not undergone any significant further development since 2014. This was only noticed because the version number appeared suspicious (less than 1.0) and we consequently checked the associated GitHub repository. The integration of outdated dependencies and the associated high risk of security vulnerabilities is particularly problematic for security-critical aspects of a web application that processes user sessions and customer data.

Lack of architecture-related security: While certain security features such as password hashing were successfully coded by the GenAI, we found them completely absent at the architectural level. For example, communication between the frontend and backend, which are essential components of the SPaaS platform's three-tier architecture, is not encrypted. All calls to the backend API by the frontend are made in plain text via HTTP. This means they can be relatively easily intercepted and manipulated, opening the door to man-in-the-middle attacks, session hijacking, and credential theft.

Improvable code quality: We used established static analysis tools to assess the quality of the AI-generated code. Unlike the security risks described above, the deficiencies identified in this process are initially less serious for a prototypical application such as the SPaaS platform under review. Nevertheless, they will become relevant at the latest after successful market validation, when the focus shifts to ensuring medium- to long-term software quality, especially maintainability. However, some of the code-related deficiencies may also come into play at an earlier stage if they cause runtime errors that affect platform operation in unforeseen ways. In fact, the reviewed source code has great potential for improvement, as a large number of smells were uncovered–ranging from high cognitive complexity of individual methods to unused or incorrect import statements and incorrectly implemented concurrency. In addition, we encountered strong coupling due to a lack of separation of concerns, a significant amount of unused code fragments, and a division of modules into folders that did not follow standard best practices. Taken together, all of the above findings suggest that the employed GenAI in conjunction with unstructured Vibe Coding exhibits weaknesses when it comes to extending the codebase: The integration of new features does not seem to be sufficiently accompanied by quality-oriented refactoring which includes, for example, purposeful modularization, establishing reusability, or the removal of obsolete code.

Inadequate test coverage: The risk of significantly reduced maintainability is increased by inadequate test coverage. Although the GenAI produced tests during Vibe Coding sessions even without explicit prompting, these tests do not validate business processes such as payment processing or the creation and booking of advertisements. Instead, they focus on the technical connection to the database by issuing INSERT statements and verifying the correct execution of these statements.

The bottom line—Consequences of unstructured Vibe Coding

In summary, the use of Vibe Coding in the case of the reviewed SPaaS platform led to a number of quality deficits of varying criticality. Despite the prototype stage, the security issues identified should be resolved in the short term, and in particular before the application is validated on the market. If market validation is successful, vendor lock-in and poor maintainability should be addressed during further development, as both deficits represent technical debt that can lead to significant follow-up costs in the medium term and thus impair competitiveness.

The development of software systems is often compared to building a house. If we stick with this analogy, the use of unstructured Vibe Coding likely results in the construction of a house with fragile foundations: the façade may look good and some rooms may be usable, but others may not and the entire building may be at the risk of collapse after moving in. Initial time savings then lead to a mortgage with unacceptably high interest rates, so that in the worst case the costs of renovation exceed those of a redevelopment. Additionally, these costs are usually very high for supposedly finished houses or productive software applications–and in both cases experts are needed to gradually uncover and fix the problems. Consequently, new job profiles such as “Vibe Coding Cleanup Specialist” currently emerge.

In conclusion, based on our review of the AI-generated SPaaS platform, we can say that Vibe Coding and specialized portals enable people with little to no experience in software development to quickly generate usable prototypes. However, without special precautions, these applications do not constitute sustainable, secure, and maintainable products. The dream of the “Citizen Developer” will not automatically come true in the AI age because it neglects the complexity of professional software development.

Our recommendation—Competitive advantages through structured Vibe Coding

If we disregard our findings, the use of GenAI accelerates the development of software systems–at least in the short term. This raises the question of how this speed advantage can be sustained in the medium to long term by ensuring that generated code is and remains of high quality. The answer to this question lies in combining AI tools with human expertise and structured methods.

Instead of relying solely on the current vibe, GenAI should be given guidelines, for example with approaches such as Product Requirements Prompts, Context Engineering, or BMAD (Breakthrough Method of Agile AI-Driven Development). These structured methods have in common that the product idea is defined together with coarse-grained technical specifications, such as the desired architecture style, and precise requirements for technical implementation. Together, this structured information provides the context and a plan for development that GenAI and its agents can follow. This can be illustrated very well by the BMAD method and the following mnemonic:

Big Picture: The Big Picture explains the overall goal of the project to GenAI (“We are building a web-based SPaaS platform for...”).
Methodology: The Methodology sets clear rules and guidelines (“Use React in the frontend, Express.js in the backend, and write all code in TypeScript.”).
Action: The Action formulates precise, atomic instructions (“Create an Express route in the routes.ts file that...”).
Details: The Details provide further information on implementing the action (“Define the API endpoints using OpenAPI”).

This information is usually stored in dedicated files in Markdown format in the repository of the application in question. The files follow a specific naming scheme so that GenAI can automatically identify them as context-relevant. This means that information about the application's intended use, its architecture, and the frameworks for its implementation does not have to be specified repeatedly for each code generation by the AI and is also subject to versioning when applying a revision control system such as Git. For prompts such as “Integrate a login button,” GenAI takes the provided context into account. This is not the case with unstructured Vibe Coding that lacks precise guidelines to limit the degrees of freedom of GenAI, so that the probability of producing deficient code is much higher than with a structured method.

The benefits of structured methods for AI-supported coding are obvious: with a clear target vision and unambiguous rules, GenAI can generate high-quality, secure, and maintainable code at high speed from natural language prompts, while technical debt remains relatively low. This is especially true when combined with deterministic approaches to ensuring high software quality, such as test automation and Infrastructure as Code, as well as the involvement of human expertise, which can assess, edit, and contextualize AI decisions in case of doubt.

Getting ready for the AI-supported future of software development

What does this mean concretely for our SPaaS customer? In the next step, we empower them with our AI-Assisted Coding Workshop, in which we teach the practical basics of structured AI-supported software development so that they can independently increase the quality of their platform with GenAI and ensure its long-term success when entering the market. For those who are still unsure how to find the right use cases for GenAI, we recommend an AI Use Case Workshop to identify and strategically prioritize promising AI use cases.

Conclusion

AI is not an autopilot that can generate complex, production-ready applications from simple instructions in natural language. Rather, in the right hands, AI is an extremely powerful co-pilot that can in fact accelerate software development. The Vibe Coding approach is useful for quick experiments such as clickable prototypes. However, building robust platform products that meet common security requirements and are maintainable in the long term requires more: a structured approach to the use of AI tools and trained developers. Therefore, it is important to invest not only in AI tools, but also in building knowledge to use them effectively. This makes the difference between an expensive experiment and a real competitive advantage.

Was this post helpful?

Blog authors

Patrick Krings

IT Consultant & Developer

Do you still have questions? Just send me a message.

Dr. Florian Rademacher

Service Lead "Software Modernization" & People Lead

Do you still have questions? Just send me a message.

fromPatrick Krings & Dr. Florian Rademacher

Charge your APIs Volume 33 - Definition-Based API Mocking, Simulation,...

Key Takeaways This article is the third and last one in a three-part series about definition-based API mocking, simulation, and testing with Microcks (make sure you have read the first and second article)The previous articles focused on (i) Microcks’...

Testing
API

23.10.2024 | 11 minutes reading time

Dr. Florian Rademacher

Sheila Kolodziej

Charge your APIs Volume 32 - Definition-Based API Mocking, Simulation,...

Key Takeaways This article is the second one in a three-part series about definition-based API mocking, simulation, and testing with Microcks (make sure you have read the first article) While the previous article concentrated on Microcks’ architecture...

API
Testing

16.10.2024 | 11 minutes reading time

Dr. Florian Rademacher

Sheila Kolodziej

Charge your APIs Volume 31 - Definition-Based API Mocking, Simulation,...

Key Takeaways API mocking used, e.g., for integration testing, is challenging as it assumes conformance to mocked API functionality, which can incur significant costs as mock complexity increases with API complexity Definition-based API mocking can reduce...

API
Testing

9.10.2024 | 9 minutes reading time

Dr. Florian Rademacher

Sheila Kolodziej

When Business Meets Technology: From Data Product to Data Architecture...

Abstract The Data Product Canvas (DPC) is a tool for the lightweight and iterative definition of data products. It increases the efficiency of product definition by clearly presenting the key impact areas on data products. Additionally, the DPC motivates...

Software architecture
Data
DDD
Digital product developement

6.8.2024 | 24 minutes reading time

Daniel Engelhardt

Dr. Florian Rademacher

Becoming a Data-Driven Company with Applied Data Products

In recent years, the hype surrounding the value of data has grown continuously, and a multitude of concepts and methods have emerged on how companies can become 'data-driven'. From strategic top management to detail-oriented data analysts attempts are...

Agile
Big Data
Data
Product management
Digitalization
Data Science
Business Intelligence

18.5.2024 | 9 minutes reading time

Dr. Florian Rademacher

Stephan Hochhaus

From Interactive Assistant to Autonomous Developer: Containerizing Claude...

The evolution of AI-assisted development has reached an interesting inflection point. While tools like Claude Code offer powerful coding assistance, the traditional workflow still requires constant human intervention—approving each action, managing permissions...

Generative AI
Container

28.8.2025 | 5 minutes reading time

Denis Stalz-John

Full control despite virus protection and modern systems – How to truly...

Recently, codecentric's security experts were tasked with testing the IT infrastructure security of a company with several hundred employees. The clients believed they were secure: The systems were running on the latest version of Windows 11 and Windows...

IT-Security
Infrastructure

2.7.2025 | 6 minutes reading time

How to Catch the Good Guys: My Learnings on Recruiting IT Security Professionals...

In 2024, I embarked on the journey to become a recruiter for an IT Security Consulting team. I thought, “How hard can it be?” I had already been a recruiter for over 10 years, focusing predominantly on software developers, and I imagined my new task ...

IT-Security
HR

13.6.2025 | 4 minutes reading time

Christine Seagar

Relative path DLL hijacking in Windows programs

As part of a Red Team assessment, a challenge arose to execute our own code via a DLL. The reason for this scenario was the use of Application Allow Listing software, which blocks the execution of unknown executables. The usual options for loading DLLs...

IT-Security

24.3.2025 | 4 minutes reading time

Timo Sablowski

Self-issued JWT for mobile client authentication

Overview Mobile applications frequently authenticate their backend calls via JWT. These tokens are frequently used in conjunction with OIDC to authenticate a user. Sometimes, particularly in high-assurance scenarios, it can be preferable to authenticate...

IT-Security
Mobile
Rust
Kotlin
Android

4.2.2025 | 8 minutes reading time

Elisabeth Schulz

Open Source hits Billion-Dollar Market: DeepSeek-R1 is shaking up the ...

On January 27, 2025, the technology stock exchange experienced an unexpected crash: The NVIDIA stock price plummeted by over 17%, temporarily wiping out nearly $600 billion in market value and setting a new historical record in the stock market. Many...

AI
Generative AI
LLM

29.1.2025 | 8 minutes reading time

How we can hack an AI with just a few words

How we can hack an AI with just a few words Artificial intelligence (AI) has undergone an astonishing transformation in recent years and is now present in many areas of life. Whether in the form of chatbots that help us with everyday questions or generative...

IT-Security
AI

27.1.2025 | 4 minutes reading time

Simplifying LLM Application Development: A Newcomer's Perspective

I. Introduction Large Language Models (LLMs) have become highly popular due to their transformative impact on various fields, especially within IT. They enable developers to create innovative software applications centered around AI interactions, offering...

Generative AI
AI

6.12.2024 | 13 minutes reading time

Function Calling with GPT Models

GenAI is a powerful tool for generating content and interacting with applications using natural language. However, this tool also has significant limitations when you plan to use it in your own software. GenAI's knowledge is limited to information that...

Generative AI
AI
LLM

6.9.2024 | 5 minutes reading time

Dangling DNS in cloud infrastructures

Dangling DNS entries are nothing new. Forgotten, outdated or incorrect DNS records can lead to subdomains being taken over and used in phishing campaigns, for example, to steal employee secrets. Due to dynamic IP addresses of rapidly changing resources...

IT-Security
Validation
Cloud
AWS
Infrastructure

5.9.2024 | 4 minutes reading time

Markus Höfer

How to program my LLM with Prompt Engineering

When developing a feature powered by LLMs, it is essential to make the most use of Prompt Engineering. A well designed prompt written in the “system” role of the LLM (more information here: /en/knowledge-hub/blog/accessing-llms-in-code) will determine...

LLM
Generative AI

19.6.2024 | 8 minutes reading time

Zero Trust Azure Identity & Access Architecture

Falko Lehmann and Hendrik Kamp have already explained in their blog post on Zero-trust Architecture why zero-trust security models are preferable to traditional perimeter security models in order to minimize damage from cyber attacks. Falko and Hendrik...

IT-Security
IAM
Azure
Software architecture

4.6.2024 | 14 minutes reading time

Accessing LLMs in Code – Automating LLM Calls

Hardly any technology has had such an impact in recent years as LLMs – with ChatGPT from OpenAI leading the way. Many media outlets are intensely engaged in how this tool can be used for personal and business purposes. Another aspect, which receives ...

LLM
Generative AI

30.5.2024 | 6 minutes reading time

Zero-trust architecture – Why we need to end perimeter-based security

Introduction This article will help you understand the importance of zero-trust architecture and why it is the state of the art to protect your organization from cyberattacks. We see it as fundamental knowledge for solution and system architects to consider...

IT-Security
Networking

29.9.2023 | 9 minutes reading time

Hendrik Kamp

Fighting Gandalf with magic spells (the spells are prompt injections) ...

Note: Do not attack any systems for which you do not have explicit permission to do so. In this article, I will recount the tale of outwitting a large language model by performing prompt injection attacks. Before we start, let's establish a common baseline...

IT-Security
AI

10.7.2023 | 12 minutes reading time

Michael Wagner

Secure your Kubernetes workloads with OPA Gatekeeper

Last month, Kubernetes 1.25 was released. And with that, the long-announced removal of PodSecurityPolicies (short: PSPs) finally becomes reality. Finally? Yes – as Tabitha Sable from the Kubernetes SIG Security Team said herself in the linked blog post...

IT-Security
Kubernetes
Infrastructure

15.12.2022 | 8 minutes reading time

My Keycloak learning journey

Keycloak is an open-source identity provider. You can add authentication to applications and secure services with minimum effort. No need to deal with storing users or authenticating users. Keycloak provides user federation, strong authentication, user...

Keycloak
IT-Security

22.11.2022 | 8 minutes reading time

Open Policy Agent – Primer

The Open Policy Agent (OPA) is a general-purpose, open-source policy engine, i.e. a collection of components that allows for a uniform and efficient implementation of rules of all kinds. This article shows a small practical example. When was the last...

CI/CD
Software architecture
IT-Security

19.10.2022 | 5 minutes reading time

Marco Paga

CloudWatch on AWS: How to tackle high-security requirements

If you build cloud-native applications, you will also generate log output. Log outputs are essential to log the functionality of the application and to be able to localize errors very quickly in the event of a crash. However, log outputs of any kind ...

AWS
Cloud
IT-Security

23.8.2022 | 15 minutes reading time

Jörg Riegel

GitLab security scanning – part 3: Kubernetes deployments

In part 1 and part 2 , we focused on different types of security scanning practices. In this article we will take a look at Kubernetes deployments with Helm and Helmfile. In particular, we are interested in how to ensure that objects deployed to Kubernetes...

DevOps
IT-Security
CI/CD
GitLab
Cloud
Kubernetes

15.5.2022 | 4 minutes reading time

Sven Hertzberg

Where Vibe Coding helps—and where it doesn't: A field report

The lure of speed—Anyone can develop now(?)

The analysis—What our review revealed

The bottom line—Consequences of unstructured Vibe Coding

Our recommendation—Competitive advantages through structured Vibe Coding

Getting ready for the AI-supported future of software development

Conclusion

Was this post helpful?

Blog authors

More articles

Charge your APIs Volume 33 - Definition-Based API Mocking, Simulation,...

Charge your APIs Volume 32 - Definition-Based API Mocking, Simulation,...

Charge your APIs Volume 31 - Definition-Based API Mocking, Simulation,...

When Business Meets Technology: From Data Product to Data Architecture...

Becoming a Data-Driven Company with Applied Data Products

More articles in this subject area

From Interactive Assistant to Autonomous Developer: Containerizing Claude...

Full control despite virus protection and modern systems – How to truly...

How to Catch the Good Guys: My Learnings on Recruiting IT Security Professionals...

Relative path DLL hijacking in Windows programs

Self-issued JWT for mobile client authentication

Open Source hits Billion-Dollar Market: DeepSeek-R1 is shaking up the ...

How we can hack an AI with just a few words

Simplifying LLM Application Development: A Newcomer's Perspective

Function Calling with GPT Models

Dangling DNS in cloud infrastructures

How to program my LLM with Prompt Engineering

Zero Trust Azure Identity & Access Architecture

Accessing LLMs in Code – Automating LLM Calls

Zero-trust architecture – Why we need to end perimeter-based security

Fighting Gandalf with magic spells (the spells are prompt injections) ...

Secure your Kubernetes workloads with OPA Gatekeeper

My Keycloak learning journey

Open Policy Agent – Primer

CloudWatch on AWS: How to tackle high-security requirements

GitLab security scanning – part 3: Kubernetes deployments