100% Open Source • Apache V2 License

Open Kubernetes Data Platform

A free, open-source and cloud-native data platform built on Kubernetes. Modular, sovereign, and community-driven.

7+
Contributors
~20
Weekly Downloads
2024
Project Started

What is OKDP?

OKDP (Open Kubernetes Data Platform) is a comprehensive data management services platform composed of containerized open source software and products, running on Kubernetes infrastructure.

OKDP addresses the complete data lifecycle: collection, storage, processing, analysis, and data exposition. It's modular by design—deploy all components or select only what you need.

Why OKDP?

📊

Data Centric

Comprehensive platform covering the entire data lifecycle with built-in governance, facilitating data sharing and minimizing duplication.

☁️

Cloud Native

Kubernetes-native platform designed for modern cloud environments with high availability, scalability, and multi-cloud support.

🔓

True Open Source

100% open source with Apache V2 license. Complete control of the technology lifecycle from build to deployment, with no vendor lock-in.

🏛️

Sovereignty & Cost Control

Maintain full autonomy over your data infrastructure while eliminating licensing costs. Free to use, modify, and deploy at any scale.

🎛️

Modular & Future-Proof

Flexible architecture adapts to your needs. Built on Kubernetes to prevent technical debt, ensuring sustainability and continuous modernization.

🌍

Community-Driven

Built by the community, for the community. Open collaboration from public and private organizations with a growing ecosystem.

Modular Architecture

OKDP's architecture is built around two major layers: an ecosystem of reference Data & AI Modules and a unified Control Plane.

Data & AI Modules

A catalog of reference open source tools. Usable individually or combined, without depending on the OKDP Control Plane.

Ingestion & Streaming

Future

Connectivity, ETL ingestion and real-time processing.

Lakehouse & Analytics

High-performance SQL engines and modern lakehouse.

Data Science

Interactive environments for exploration and analysis.

AI & MLOps

Future

MLOps platform and language model inference.

Kubeflow MLflow LLM Serving

Visualization & BI

Dashboards and visual data exploration.

Orchestration & Governance

Workflow automation and metadata cataloging.

Apache Airflow OpenMetadata

OKDP Control Plane

OKDP's automation layer: orchestration, multi-tenant isolation, and governance for a turnkey experience across the entire stack.

🖥️

Server / UI / CLI

Unified web portal and interfaces for administrators and users.

🗂️

Project & Quota Management

Secure multi-tenant isolation and per-project resource limit management.

🔒

Auth & Secrets Management

End-to-end OIDC authentication and secure secrets and RBAC management.

📈

Observability

Centralized collection of metrics, logs and traces across the entire platform.

Kubernetes cluster required.

Roadmap

First release v1.0.0 planned for June 2026.

View the full roadmap

Join Our Community

OKDP is built by the community, for the community. Join us in shaping the future of open-source data platforms.

Weekly Technical Meeting

Every Wednesday at 10:00 AM (CET) - Contact us to receive the meeting details.

Contact Us to Join

Call for Contributions

Help us build the Open Kubernetes Data Platform. Whether you enjoy infrastructure, data engineering, or docs, there is room for you.

Contribute on GitHub

About TOSIT

OKDP is a project initiated by DGFiP, joined by Orange and other organizations, within the TOSIT association (The Open Source I Trust). The goal is to provide a sovereign, powerful, and fully open-source data & AI technology stack accessible to all.

The association brings together numerous companies and administrations, including BPCE (Banque Populaire, Caisse d'Epargne et Natixis), Société Générale, among others. It also hosts the TDP project, initiated by DGFiP and EDF.

Participation in TOSIT projects is open to all.