Talk to your Data Part 1: How to generate Insights with MotherDuck MCP Server and OpenCode

12.2.2026 | 5 minutes reading time

MotherDuck's new MCP server gives us the opportunity to have a conversation with an AI models like Claude or ChatGPT and ask questions about our data that are directly transformed into SQL. The queries are executed against the actual data in our cloud data warehouse. This removes one more layer of indirection when it comes to bridging the gap between intent and insight. In this article, we want to explore the initial setup and run first queries to have a baseline for further investigations.

The Tool Chain

MCP is Anthropic's open standard for connecting AI assistants to external data sources and tools. MotherDuck is offering the MCP server and we are using opencode to create a MCP client. opencode is an open-source AI coding agent with a terminal user interface (TUI). It lets you switch between different models and sessions, and integrate multiple tools and agents.

The easy-to-use MCP support made it a good alternative for us to inspect the performance of different models and quickly create agents that could potentially help us to achieve better results. The client discovers available tools and uses them to create and execute SQL queries against the MotherDuck databases. The connection is read-only so that the agent will not be able to change data. The big advantage: no custom integration, no proprietary APIs, no coding.

Test Data and Context

We decided to use two different datasets that could eventually be combined to look for insights. One dataset that we know and one that we only have meta information about. This approach tests how well the system handles initial knowledge retrieval and whether MotherDuck MCP works for us to inspect unknown data.

The known dataset contains 15 years of historical weather data for Munich. The unknown dataset contains bike traffic measurements in Munich from 2008 onward, sourced from public records. The bike data comes with inconsistent column names, schema evolution, missing data, and a few other artifacts.

Initial Setup

The initial setup is straightforward. You need a MotherDuck account, opencode installed, and the MCP connection settings in your config file. After running the authentication command (opencode mcp auth <name-of-mcp-server>) and allowing the connection via the MotherDuck website in your browser, the connection establishes and you can execute prompts against your data in MotherDuck.

Context Management

The agent starts stateless with no knowledge of your database, schema, or contents. Initial context priming is necessary to create a baseline before asking specific questions about the data, like "which month has the most days with temperatures above 30 degrees celsius?". MotherDucks mentions this specifically in the documentation. Good first prompts create context that supports more specific queries later. Additionally, well-documented tables with COMMENT ON descriptions should help the model to interpret the data better based on the user's request.

The initial prompts inspired by MotherDuck's examples were "What tables and databases do I have access to in MotherDuck?", "Can you give more details about my tables in my_db?" and "Do all bicycle tables in my_db have the same schema?" We wanted to create an overview of the tables in my_db and highlight the schema changes in the tables containing bicycle data. Without any prior knowledge, the agent understood the assignment, inspected the schema and targeted the appropriate tables with tool calls to the MotherDuck MCP server.

Besides seeing the responses, we can also observe the thinking and SQL executed. This almost always follows the "thought, tool call, observation" pattern. When we export the session contents, we can also see the data returned from the MCP server. However, the data is often truncated due to its size in the Markdown export.

Deeper Analysis

With context established, we can test the real potential of this approach. We know that months in the middle of the year are almost always the hottest in Munich. To find out which months are the hottest and to unravel some trends, it is essential to frame our question with context. The documentation suggests including time ranges, filters, metrics, and an output format.

We created the following prompt: "Use the MotherDuck query tool to first identify the weather table schema. Then find the hottest month for every year by returning the year, month name, and maximum temperature. Additionally, analyze if the timing of the warmest month is shifting earlier or later in the year and calculate the year over year temperature change to see if peak heat is rising. Provide the results in a table followed by a brief summary of the climate trends.".

We received results in 15 seconds. Writing these queries manually would have taken several minutes at minimum. We could see the different queries being executed to group the data by year and month, find the hottest date overall for each month, and apply a mapping for the month from a number to its name. This worked not only for the initial gathering of data points but also for the comparison that was part of the task.

The query returned detailed year-over-year trends:

1| Year | Month | Max Temperature (°C) | YoY Change (°C) | Timing Shift |
2|------|-------|----------------------|-----------------|--------------|
3| 2015 | July | 36.5 | — | — |
4| 2016 | July | 34.6 | -1.9 | Same |
5| 2017 | June | 34.8 | +0.2 | Earlier |
6| 2018 | August | 36.2 | +1.4 | Later |
7| 2019 | July | 40.0 | +3.8 | Earlier |
8| 2020 | August | 36.2 | -3.8 | Later |
9| 2021 | June | 33.5 | -2.7 | Earlier |
10| 2022 | July | 37.7 | +4.2 | Later |
11| 2023 | July | 34.1 | -3.6 | Same |
12| 2024 | August | 33.6 | -0.5 | Later 
13| 2025 | July | 35.1 | +1.5 | Earlier |

Additionally, we received an explanation that there is no clear shift when it comes to the occurrence of the hottest month, and no clear indication of a trend when it comes to peak temperatures. This makes sense given the limited time frame of only a few years, which is also supported by the average temperature not changing drastically from 2015 to 2025. If the time frame ranged back to 1950 or 1900, the answer would be drastically different.

Conclusion and Outlook

The setup proved straightforward, and creating context for analysis took minimal effort. More sophisticated prompts worked well, delivering complete answers to our questions. Asking clarifying questions after receiving the initial response was also seamless.

This concludes the first part of our multi-part series covering MotherDuck's MCP server. In the next part we will see how far we can push the natural language interface when it comes to retrieval performance and accuracy and find out where hallucinations can occur.

Was this post helpful?

Blog authors

Niklas Niggemann

Working Student Data & AI

Do you still have questions? Just send me a message.

Hendrik Kamp

IT Consultant

Do you still have questions? Just send me a message.

Ibis: Selecting the Right Execution Engine Without Rewriting Your Logic

Ibis: Selecting the Right Execution Engine Without Rewriting Your Logic In our previous benchmarks, DuckDB consistently outperformed Polars and Pandas on large analytical workloads, but performance comparisons miss a critical question: what happens when...

MotherDuck
Data
Big Data
Data Science

10.2.2026 | 6 minutes reading time

Niklas Niggemann

DuckDB vs. Polars: Performance & Memory on Massive Parquet Data

Update 02.02.26 – After helpful insights from the Polars team on LinkedIn, we enhanced our benchmark setup with a configuration of Polars where async is forced. This is elaborated in the article. Our previous benchmark compared DuckDB, Polars, and Pandas...

MotherDuck
Data Science
Data

20.1.2026 | 15 minutes reading time

Niklas Niggemann

MotherDuck: Access Management and Scalable Analytics Overview

MotherDuck's architecture for storage management and user access is built on several key design principles that shape how data is organized and shared. To understand how MotherDuck manages access control, you need to understand three key concepts: organizations...

Data
MotherDuck

8.12.2025 | 6 minutes reading time

Hendrik Kamp

DuckDB vs. DataFrame Libraries

Update 10.12.25 – After helpful insights from Polars Engineer Thijs Nieuwdorp following the initial posting of this article, we were able to refactor our use of the deprecated .count() function in Polars, replacing it with the correct .len() function...

MotherDuck
Data
Data Science
Python
Database

1.12.2025 | 10 minutes reading time

Niklas Niggemann

ODPS: The Standard for Data Products

The data landscape in an organization often looks like this: teams gather and produce data everyday. Each team develops their own metadata models and documentation, if there is any at all. Governance policies exist in scattered documentation (spreadsheets...

Data

7.11.2025 | 4 minutes reading time

DuckDB and MotherDuck for customer facing analytics

MotherDuck
Data

21.10.2025 | 5 minutes reading time

Matthias Niehoff

DuckDB’s friendly SQL is a game changer for developer experience

I don’t think anyone will be surprised when I say that SQL is not the nicest language to work with. Some might even say that it has terrible ergonomics, especially for larger and more complex queries. Still, there are very good reasons why SQL is the...

Data
MotherDuck

14.10.2025 | 12 minutes reading time

Zero-ETL with MotherDuck: A Technical Deep Dive

MotherDuck, the cloud-native service built on DuckDB, fundamentally transforms how organizations interact with data stored in cloud blob storage. By eliminating the traditional ETL/ELT pipeline, MotherDuck enables direct SQL analytics on Parquet, JSON...

MotherDuck
Data

7.10.2025 | 6 minutes reading time

Hendrik Kamp

Your First Data Analysis with MotherDuck and DuckDB: From CSV to Insights...

In this post, we'll explore how MotherDuck, powered by DuckDB, revolutionizes the way you interact with your data, particularly when dealing with CSV files. You'll learn how to quickly parse and filter even large datasets directly from your local machine...

Data
Database
MotherDuck
Big Data

30.9.2025 | 8 minutes reading time

5 Reasons Why We’re Excited About MotherDuck Launch in AWS Frankfurt

5 Reasons We’re Excited About MotherDuck’s Launch in AWS Frankfurt For some time, a key challenge for European data teams has been balancing innovation with strict regulation. We’ve often seen powerful tools launch first in the US, while our need for...

Data
Big Data
Database
News
MotherDuck

24.9.2025 | 6 minutes reading time

Marcel Mikl

Using Dagster with DuckDB

DuckDB has rapidly emerged as a popular in-process analytics database. Dagster, on the other hand, is a modern data orchestration framework that makes it easy to build and manage data pipelines. Combining Dagster with DuckDB allows data engineers to ...

Data

16.5.2025 | 4 minutes reading time

Hendrik Kamp

Querying Databricks Delta Tables in Motherduck

Intro In a previous article, my colleague Matthias Niehoff demonstrated how duckdb can serve as a viable alternative to Spark for processing data stored in Databricks, specifically by directly accessing the Unity Catalog. Building upon that, a next ...

Data

25.4.2025 | 4 minutes reading time

Hendrik Kamp

Introducing Data Interface Quadrants (DIQs)

In today’s rapidly evolving, data-driven world, organisations face an increasingly complex challenge: how to design, implement, and manage data interfaces that meet both immediate operational demands and long-term strategic business objectives. A data...

API
Data

30.1.2025 | 8 minutes reading time

Daniel Kocot

Miriam Greis

Access Databricks UnityCatalog from duckdb

Databricks is a great platform when it comes to data management and governance, mostly due to the unity catalog. But Spark as an engine for processing the data is just ok'ish, especially when data is not really big. New engines like polars, datafusion...

Data

20.1.2025 | 5 minutes reading time

Matthias Niehoff

Charge your APIs Volume 36 - Trends for 2025

As 2025 approaches, new trends are emerging in the world of APIs. After 2024 was user-centric, the focus is now shifting back to developer needs and increasing productivity. APIs are evolving and the technologies surrounding them are becoming more powerful...

Integration
API
Data
Software architecture

11.12.2024 | 5 minutes reading time

Daniel Kocot

When Business Meets Technology: From Data Product to Data Architecture...

Abstract The Data Product Canvas (DPC) is a tool for the lightweight and iterative definition of data products. It increases the efficiency of product definition by clearly presenting the key impact areas on data products. Additionally, the DPC motivates...

Software architecture
Data
DDD
Digital product developement

6.8.2024 | 24 minutes reading time

Dr. Florian Rademacher

Charge your APIs Volume 28: Empowering application and data integration...

In today's fast-paced world, seamless application and data integration is crucial for organisational success. This blog explores how frameworks like Maslow's Pyramid, Team Topologies, Evolutionary Architectures, API Federation, and API Marketplaces, ...

API
Data
Integration

25.7.2024 | 8 minutes reading time

Daniel Kocot

Data for the Masses Volume 2: Data Products, Data Contracts and API Contracts

The pillars of modern data architectures as success factors for organisations In the digital economy, a well-thought-out data architecture and the efficient use of data are crucial for organisational success. Data products, data contracts and API contracts...

Data
API

13.6.2024 | 7 minutes reading time

Daniel Kocot

Becoming a Data-Driven Company with Applied Data Products

In recent years, the hype surrounding the value of data has grown continuously, and a multitude of concepts and methods have emerged on how companies can become 'data-driven'. From strategic top management to detail-oriented data analysts attempts are...

Agile
Big Data
Data
Product management
Digitalization
Data Science
Business Intelligence

18.5.2024 | 9 minutes reading time

Dr. Florian Rademacher

A/B Testing: Tool support and testing GrowthBook

In the previous blog post we introduced some general concepts of A/B testing: we explored the main aspects, defined test types and explained the most common statistical methods. Now we want to explore the areas in which A/B testing tools can provide...

Testing
Python
Data
UX/UI
Analysis
JavaScript

18.3.2024 | 20 minutes reading time

Francesca Diana

Talk to your Data Part 1: How to generate Insights with MotherDuck MCP Server and OpenCode

The Tool Chain

Test Data and Context

Initial Setup

Context Management

Deeper Analysis

Conclusion and Outlook

Was this post helpful?

Blog authors

More articles in this subject area

Ibis: Selecting the Right Execution Engine Without Rewriting Your Logic

DuckDB vs. Polars: Performance & Memory on Massive Parquet Data

MotherDuck: Access Management and Scalable Analytics Overview

DuckDB vs. DataFrame Libraries

ODPS: The Standard for Data Products

DuckDB and MotherDuck for customer facing analytics

DuckDB’s friendly SQL is a game changer for developer experience

Zero-ETL with MotherDuck: A Technical Deep Dive

Your First Data Analysis with MotherDuck and DuckDB: From CSV to Insights...

5 Reasons Why We’re Excited About MotherDuck Launch in AWS Frankfurt

Using Dagster with DuckDB

Querying Databricks Delta Tables in Motherduck

Introducing Data Interface Quadrants (DIQs)

Access Databricks UnityCatalog from duckdb

Charge your APIs Volume 36 - Trends for 2025

When Business Meets Technology: From Data Product to Data Architecture...

Charge your APIs Volume 28: Empowering application and data integration...

Data for the Masses Volume 2: Data Products, Data Contracts and API Contracts

Becoming a Data-Driven Company with Applied Data Products

A/B Testing: Tool support and testing GrowthBook