100% Open Source • Apache V2 License

Open Kubernetes Data Platform

A free, open-source and cloud-native data platform built on Kubernetes. Modular, sovereign, and community-driven.

7+
Contributors
~20
Weekly Downloads
2024
Project Started

What is OKDP?

OKDP (Open Kubernetes Data Platform) is a comprehensive data management services platform composed of containerized open source software and products, running on Kubernetes infrastructure.

OKDP addresses the complete data lifecycle: collection, storage, processing, analysis, and data exposition. It's modular by design—deploy all components or select only what you need.

Why OKDP?

📊

Data Centric

Comprehensive platform covering the entire data lifecycle with built-in governance, facilitating data sharing and minimizing duplication.

☁️

Cloud Native

Kubernetes-native platform designed for modern cloud environments with high availability, scalability, and multi-cloud support.

🔓

True Open Source

100% open source with Apache V2 license. Complete control of the technology lifecycle from build to deployment, with no vendor lock-in.

🏛️

Sovereignty & Cost Control

Maintain full autonomy over your data infrastructure while eliminating licensing costs. Free to use, modify, and deploy at any scale.

🎛️

Modular & Future-Proof

Flexible architecture adapts to your needs. Built on Kubernetes to prevent technical debt, ensuring sustainability and continuous modernization.

🌍

Community-Driven

Built by the community, for the community. Open collaboration from public and private organizations with a growing ecosystem.

Modular Architecture

OKDP offers a modular architecture that allows you to select and deploy only the components you need for your specific use case.

Query Engine

High-performance SQL query engines for distributed data.

ML/AI

Complete MLOps platform for data science workflows.

Processing & Orchestration

Data processing engines and workflow orchestration.

Storage & Catalog

Modern lakehouse with transactional tables and metadata management.

Visualization & Database

Built on Kubernetes

Running on any Kubernetes distribution (RKE, EKS, AKS, GKE) with comprehensive security, observability, and resource management.

Security & RBAC TLS/Certificates SSO & LDAP Monitoring Backup & DRP Load Balancing Ingress Control Resource Scheduling

Each component can be deployed independently, allowing you to build a customized data platform that meets your specific needs.

Roadmap

2026: Evolution & Scale

🔨

Technology Builds

Ongoing

Continuous building of source code and Docker images for all platform technologies.

🧊

Apache Polaris & Iceberg

Q1-Q2

Integration of Apache Iceberg and Polaris catalog with STS S3 support and RBAC.

💻

New Frontend & Server

Q2-Q3

Overhaul of the user interface and server for a unified and high-performance experience.

🔄

Apache Airflow

Q3-Q4

Implementation of Apache Airflow for workflow automation and orchestration.

Cross-Cutting Themes

🔒
Security

End-to-end OIDC authentication

⚖️
Resources

Queue management system

🤖
MLOps

Kubeflow & MLflow

📊
Observability

Logs, monitoring, audit

2024-25: Foundation
View previous milestones

Initial Data Technologies

Successfully integrated core data technologies: JupyterHub, Apache Spark, Trino, Hive Metastore, and Superset.

Spark & Jupyter Images

Provision and support of official base images for Apache Spark and JupyterLab.

OKDP Server/UI

Published the first version of OKDP Server and user interface for platform management.

Sandbox & Documentation

Released comprehensive sandbox environment with user guide and end-to-end test application.

Join Our Community

OKDP is built by the community, for the community. Join us in shaping the future of open-source data platforms.

Weekly Technical Meeting

Every Wednesday at 10:00 AM (CET) - Contact us to receive the meeting details.

Contact Us to Join

Call for Contributions

Help us build the Open Kubernetes Data Platform. Whether you enjoy infrastructure, data engineering, or docs, there is room for you.

Contribute on GitHub

About TOSIT

TOSIT is an association that promotes community-driven initiatives to create truly open-source technologies and platforms. It hosts the TDP project, which was initiated by DGFiP and EDF.

Since January 2024, DGFiP started the OKDP adventure, which was later joined by Orange. The association brings together numerous companies and administrations including BPCE (Banque Populaire, Caisse d'Epargne et Natixis), Société Générale, among others.

Participation in TOSIT projects is open to all, with the aim of ensuring that the technology stack is accessible, efficient, and powerful for everyone.