Data Mesh
- Anand Nerurkar
- Mar 14
- 2 min read
What Data Mesh is
Why it is needed
When to use it
How governance is maintained
Let’s structure it clearly so you can explain confidently in an interview.
1️⃣ What is Data Mesh?
Data Mesh is a decentralized data architecture where business domains own and manage their data as products, instead of relying on a centralized data team.
Traditional model:
All data → Central Data Team → Data Warehouse → ConsumersThis creates bottlenecks as the organization grows.
Data Mesh model:
Domain A → Data ProductDomain B → Data ProductDomain C → Data Product ↓Shared Data Platform + GovernanceEach domain manages its own data pipelines and datasets.
2️⃣ Core Principles of Data Mesh
There are four key principles.
Domain-Oriented Data Ownership
Each business domain owns its data.
Examples in a bank:
Domain | Data Product |
Payments | Transaction dataset |
Lending | Loan data |
Customer | Customer profile data |
Fraud | Fraud detection dataset |
This reduces dependency on a central data team.
Data as a Product
Each dataset is treated like a product with clear ownership and quality standards.
A data product includes:
well-defined schema
documentation
access policies
service-level expectations.
Example:
Customer 360 Data ProductOwner: Customer DomainConsumers: Marketing, Risk, AnalyticsSelf-Service Data Platform
A central platform team provides tools that allow domains to build data products easily.
Examples:
data pipelines
data catalog
data governance tools
security policies
ML platforms.
This prevents every domain from reinventing infrastructure.
Federated Governance
Governance is shared between:
central governance teams
domain teams.
This ensures:
compliance
security
data standards.
3️⃣ When Would You Use Data Mesh?
Data Mesh is useful in large enterprises with many domains and large data volumes.
Typical situations include:
1. Many independent business domains
Example in banking:
payments
lending
wealth
digital banking
risk.
A central data team cannot scale effectively.
2. Data bottlenecks
If every team must wait for the central data engineering team, innovation slows.
Data Mesh allows domains to move faster.
3. Large-scale analytics and AI initiatives
AI teams require high-quality domain data.
Data Mesh ensures:
domain experts own the data
better data quality.
4. Multi-cloud or distributed architectures
Large organizations often run data platforms across:
on-prem systems
cloud data lakes
multiple cloud providers.
Data Mesh helps manage distributed data ownership.
4️⃣ Example in a Banking Transformation
In a bank implementing Data Mesh:
Domain | Data Product |
Payments | Real-time transaction data |
Lending | Loan portfolio analytics |
Customer | Customer 360 profile |
Fraud | Fraud detection datasets |
Each domain:
owns pipelines
publishes datasets
manages quality.
These datasets are then used by:
AI fraud models
customer analytics
marketing insights.
5️⃣ How Governance Still Works
Even though ownership is decentralized, governance is centralized through:
data catalog
data classification
security policies
data quality rules.
The platform enforces these automatically.
Data Mesh is a decentralized data architecture where business domains own and manage their data as products while a central platform provides shared infrastructure and governance. It is typically used in large organizations where centralized data teams become bottlenecks and domain ownership improves scalability, data quality, and agility.
Data Mesh shifts the organization from centralized data management to domain-driven data ownership, enabling scalable analytics and AI capabilities across large enterprises.
.png)

Comments