MotherDuck: Access Management and Scalable Analytics Overview

8.12.2025 | 6 minutes reading time

MotherDuck's architecture for storage management and user access is built on several key design principles that shape how data is organized and shared. To understand how MotherDuck manages access control, you need to understand three key concepts: organizations, shares and Ducklings. First of all, MotherDuck uses organizations to group users. This is the top-level entity that handles administrations, data sharing, billing, and security, to name a few. Users can create organizations or join an existing one when signing up. Databases can be shared inside the organization with other users or outside via share URLs.

Creating the share works via the UI or SQL commands. Below is an SQL code snippet that explains the necessary steps.

1ATTACH "md:";
2
3USE flights;
4CREATE SHARE IF NOT EXISTS flights FROM flights
5   (ACCESS ORGANIZATION , VISIBILITY DISCOVERABLE, UPDATE MANUAL);

In this example, everyone in the user’s organization can see the share and read the data. The ACCESS parameter controls who can see the share, VISIBILITY determines if it appears in the UI, and UPDATE controls how data refreshes are handled. The other scopes for sharing are restricted and unrestricted, allowing either read access to a limited group of users or unlimited read access for every user who is signed into MotherDuck and has the shared URL. Share URLs are constructed as follows md:_share/<source_database_name>/<share_token>.

To consume shared data a user must attach the share to their workspace. This creates a read-only zero-copy clone of the source database. Attaching the database is possible in two ways. The UI offers all available shares in an overview when the creators make their shares discoverable. The share can easily be attached via the press of a button. However, hidden shares must be attached via the share URL, which must be sent to the consumer by the data provider. Public shares can also be attached via the respective share URL if the consumer is not using the UI.

The UPDATE keyword determines if adding new data is reflected for the share consumers automatically. If it is set to MANUAL, the data provider must refresh the data via an SQL statement

1UPDATE SHARE flights;

Consumers on the other site can use the REFRESH statement to check for new data manually. Auto-updating shares are refreshed periodically every minute.

Read-Only only?

This sharing model naturally leads to an important question: why are so many of these connections read-only? This is a limitation of the underlying technology for MotherDuck, which is DuckDB. DuckDB allows multiple parallel read-only connections but only one read-write connection at a time per database file. Each user has their own Duckling, a compute instance that has a very fast cold-start and shuts down as quickly when idle. To allow shared storage, the DuckDB instance (the Duckling) must be in read-only mode to realize the access. Consumers can decide to create a full replica of shared data and use the write access for their own data to manipulate the copy, however updates from the original share can no longer be reflected in this dataset automatically.

There are actually two types of Ducklings available. One is the standard read-write Duckling that is assigned to every user by default, providing full access to create tables, modify structures, and perform data transformations. The second type is Read Scaling Ducklings, which are used specifically for read operations that could cause a bottleneck if only one Duckling served requests from multiple users.

Read Scaling

The challenge MotherDuck identified was that Business Intelligence tools typically use a single shared database connection across an entire organization, making all queries appear as if they come from one user. Within Motherduck this would mean all requests route to the same Duckling, potentially causing performance issues when dozens or hundreds of people use dashboards simultaneously. For instance, if your marketing team of 50 people all opened the same dashboard at 9 AM Monday morning, without Read Scaling that single Duckling could be overwhelmed. Read Scaling solves this by automatically spinning up additional Ducklings as needed when clients connect using read-only tokens. These replicas are clones of the original Duckling that have access to the same data and distribute the query load across multiple instances.

Read Scaling can be enabled when creating an access token in MotherDuck. The other available token type offers read-write access and does not scale since it is using the default Duckling and multiple read-write connections are not available. This bundling of scaling capabilities with access management is an unconventional approach. In most data warehouses, scaling is controlled through separate compute configuration or warehouse sizing settings, independent of the authentication. MotherDuck ties these concerns together: choosing read-only access automatically enables the scaling mechanism, while write access inherently limits users to a single instance. When a client connects using a Read Scaling token, it is directed to one of the read scaling replicas, each powered by its own dedicated duckling. As more users connect via read scaling tokens, the flock of ducklings expands, aiming to give each user their own dedicated duckling up to the configured limit, which defaults to 16 replicas but can be adjusted by contacting MotherDuck support. If this limit is exceeded, new connections share existing ducklings while maintaining user-to-duckling affinity where possible.

MotherDuck intelligently routes queries to maintain session affinity. Applications can provide a session_hint parameter in the connection string to ensure all queries from a specific end user route to the same duckling replica. This improves caching effectiveness and provides a more consistent view of data across queries for that user. The session hint can be set to a user session ID, user ID, or hashed value for privacy. Additionally, DuckDB integrations now support instance caching with a configurable time-to-live parameter (dbinstance_inactivity_ttl) that helps maintain session affinity even across separate queries or short connection gaps. Read scaling replicas are eventually consistent, meaning read operations might see data that slightly lags behind the very latest writes made to the primary instance, typically syncing within a few minutes. For applications requiring stricter synchronization, users can manually create snapshots on the writer connection and refresh databases on read-scaling connections. This architecture has delivered significant performance improvements, with MotherDuck's own BI dashboards loading much faster after switching to Read Scaling tokens, making it a lightweight solution compared to traditional data warehouse scaling approaches.

Conclusion

MotherDuck's access management centers on understanding the interplay between shares, Ducklings, and tokens. Shares enable zero-copy data distribution across your organization but lock you into read-only access due to DuckDB's single-writer constraint. In practice, this means only one user, often a technical service account, can provide and update data while all the other users consume it read-only. This aligns with common data warehouse patterns where most data remains immutable to prevent users from breaking shared datasets. Standard read-write Ducklings work well for individual users, but shared BI connections can bottleneck without Read Scaling tokens. The key consideration is eventual consistency: Read Scaling replicas may lag behind primary instances by a few minutes, which matters for real-time scenarios but is acceptable for most analytics workflows. Plan your architecture around these limitations, particularly if your use case requires immediate data synchronization or frequent write operations on shared datasets.

A look into the future: DuckLake-based storage will mitigate the limitation of single-account write access in a future release. Currently, write permissions are limited to one account per database, which can perform multiple concurrent writes as long as they are append-only.

To see these concepts in action, enroll in our on-demand Hands-on Workshop: Introduction to MotherDuck for a complete practical walkthrough.

Was this post helpful?

Blog author

Hendrik Kamp

IT Consultant

Do you still have questions? Just send me a message.

Narwhals: Building Dataframe-Agnostic Libraries with Zero Dependencies

After the publication of our article about Ibis, Dr André Schemaitat pointed us to a similar tool with growing popularity – Narwhals. Narwhals describes itself as an "extremely lightweight and extensible compatibility layer between dataframe libraries...

Data
Python
Software development

3.3.2026 | 11 minutes reading time

Niklas Niggemann

Talk to your Data Part 3: The Potential of Natural Language

This is the last and final part of our article series covering the new MCP server by MotherDuck. We have already presented the basics and challenges in previous parts. Now, we want to conclude with our findings and comments on the current state and give...

MotherDuck
Data

27.2.2026 | 7 minutes reading time

Hendrik Kamp

Niklas Niggemann

Talk to your Data Part 2: Limits and Performance Enhancements

In part one of this series, we introduced the MotherDuck MCP server in combination with opencode and showcased initial context engineering. We also showed deeper knowledge retrieval using natural language instead of SQL. In this article we will dive ...

MotherDuck
Data

19.2.2026 | 8 minutes reading time

Niklas Niggemann

Hendrik Kamp

Talk to your Data Part 1: How to generate Insights with MotherDuck MCP...

MotherDuck's new MCP server gives us the opportunity to have a conversation with an AI models like Claude or ChatGPT and ask questions about our data that are directly transformed into SQL. The queries are executed against the actual data in our cloud...

MotherDuck
Data

12.2.2026 | 6 minutes reading time

Niklas Niggemann

Hendrik Kamp

Ibis: Selecting the Right Execution Engine Without Rewriting Your Logic

In our previous benchmarks, DuckDB consistently outperformed Polars and Pandas on large analytical workloads, but performance comparisons miss a critical question: what happens when you need to move from local DuckDB development to a BigQuery production...

MotherDuck
Data
Big Data
Data Science

10.2.2026 | 6 minutes reading time

Niklas Niggemann

DuckDB vs. Polars: Performance & Memory on Massive Parquet Data

Update 02.02.26 – After helpful insights from the Polars team on LinkedIn, we enhanced our benchmark setup with a configuration of Polars where async is forced. This is elaborated in the article. Our previous benchmark compared DuckDB, Polars, and Pandas...

MotherDuck
Data Science
Data

20.1.2026 | 15 minutes reading time

Niklas Niggemann

DuckDB vs. DataFrame Libraries

Update 10.12.25 – After helpful insights from Polars Engineer Thijs Nieuwdorp following the initial posting of this article, we were able to refactor our use of the deprecated .count() function in Polars, replacing it with the correct .len() function...

MotherDuck
Data
Data Science
Python
Database

1.12.2025 | 10 minutes reading time

Niklas Niggemann

ODPS: The Standard for Data Products

The data landscape in an organization often looks like this: teams gather and produce data everyday. Each team develops their own metadata models and documentation, if there is any at all. Governance policies exist in scattered documentation (spreadsheets...

Data

7.11.2025 | 4 minutes reading time

DuckDB and MotherDuck for customer facing analytics

MotherDuck
Data

21.10.2025 | 5 minutes reading time

Matthias Niehoff

DuckDB’s friendly SQL is a game changer for developer experience

I don’t think anyone will be surprised when I say that SQL is not the nicest language to work with. Some might even say that it has terrible ergonomics, especially for larger and more complex queries. Still, there are very good reasons why SQL is the...

Data
MotherDuck

14.10.2025 | 12 minutes reading time

Zero-ETL with MotherDuck: A Technical Deep Dive

MotherDuck, the cloud-native service built on DuckDB, fundamentally transforms how organizations interact with data stored in cloud blob storage. By eliminating the traditional ETL/ELT pipeline, MotherDuck enables direct SQL analytics on Parquet, JSON...

MotherDuck
Data

7.10.2025 | 6 minutes reading time

Hendrik Kamp

Your First Data Analysis with MotherDuck and DuckDB: From CSV to Insights...

In this post, we'll explore how MotherDuck, powered by DuckDB, revolutionizes the way you interact with your data, particularly when dealing with CSV files. You'll learn how to quickly parse and filter even large datasets directly from your local machine...

Data
Database
MotherDuck
Big Data

30.9.2025 | 8 minutes reading time

5 Reasons Why We’re Excited About MotherDuck Launch in AWS Frankfurt

5 Reasons We’re Excited About MotherDuck’s Launch in AWS Frankfurt For some time, a key challenge for European data teams has been balancing innovation with strict regulation. We’ve often seen powerful tools launch first in the US, while our need for...

Data
Big Data
Database
News
MotherDuck

24.9.2025 | 6 minutes reading time

Marcel Mikl

Using Dagster with DuckDB

DuckDB has rapidly emerged as a popular in-process analytics database. Dagster, on the other hand, is a modern data orchestration framework that makes it easy to build and manage data pipelines. Combining Dagster with DuckDB allows data engineers to ...

Data

16.5.2025 | 4 minutes reading time

Hendrik Kamp

Querying Databricks Delta Tables in Motherduck

Intro In a previous article, my colleague Matthias Niehoff demonstrated how duckdb can serve as a viable alternative to Spark for processing data stored in Databricks, specifically by directly accessing the Unity Catalog. Building upon that, a next ...

Data

25.4.2025 | 4 minutes reading time

Hendrik Kamp

Introducing Data Interface Quadrants (DIQs)

In today’s rapidly evolving, data-driven world, organisations face an increasingly complex challenge: how to design, implement, and manage data interfaces that meet both immediate operational demands and long-term strategic business objectives. A data...

API
Data

30.1.2025 | 8 minutes reading time

Daniel Kocot

Miriam Greis

Access Databricks UnityCatalog from duckdb

Databricks is a great platform when it comes to data management and governance, mostly due to the unity catalog. But Spark as an engine for processing the data is just ok'ish, especially when data is not really big. New engines like polars, datafusion...

Data

20.1.2025 | 5 minutes reading time

Matthias Niehoff

Charge your APIs Volume 36 - Trends for 2025

As 2025 approaches, new trends are emerging in the world of APIs. After 2024 was user-centric, the focus is now shifting back to developer needs and increasing productivity. APIs are evolving and the technologies surrounding them are becoming more powerful...

Integration
API
Data
Software architecture

11.12.2024 | 5 minutes reading time

Daniel Kocot

When Business Meets Technology: From Data Product to Data Architecture...

Abstract The Data Product Canvas (DPC) is a tool for the lightweight and iterative definition of data products. It increases the efficiency of product definition by clearly presenting the key impact areas on data products. Additionally, the DPC motivates...

Software architecture
Data
DDD
Digital product developement

6.8.2024 | 24 minutes reading time

Dr. Florian Rademacher

Charge your APIs Volume 28: Empowering application and data integration...

In today's fast-paced world, seamless application and data integration is crucial for organisational success. This blog explores how frameworks like Maslow's Pyramid, Team Topologies, Evolutionary Architectures, API Federation, and API Marketplaces, ...

API
Data
Integration

25.7.2024 | 8 minutes reading time

Daniel Kocot

MotherDuck: Access Management and Scalable Analytics Overview

Read-Only only?

Read Scaling

Conclusion

Was this post helpful?

Blog author

More articles in this subject area

Narwhals: Building Dataframe-Agnostic Libraries with Zero Dependencies

Talk to your Data Part 3: The Potential of Natural Language

Talk to your Data Part 2: Limits and Performance Enhancements

Talk to your Data Part 1: How to generate Insights with MotherDuck MCP...

Ibis: Selecting the Right Execution Engine Without Rewriting Your Logic

DuckDB vs. Polars: Performance & Memory on Massive Parquet Data

DuckDB vs. DataFrame Libraries

ODPS: The Standard for Data Products

DuckDB and MotherDuck for customer facing analytics

DuckDB’s friendly SQL is a game changer for developer experience

Zero-ETL with MotherDuck: A Technical Deep Dive

Your First Data Analysis with MotherDuck and DuckDB: From CSV to Insights...

5 Reasons Why We’re Excited About MotherDuck Launch in AWS Frankfurt

Using Dagster with DuckDB

Querying Databricks Delta Tables in Motherduck

Introducing Data Interface Quadrants (DIQs)

Access Databricks UnityCatalog from duckdb

Charge your APIs Volume 36 - Trends for 2025

When Business Meets Technology: From Data Product to Data Architecture...

Charge your APIs Volume 28: Empowering application and data integration...