Open Kubernetes Data Platform
A free, open-source and cloud-native data platform built on Kubernetes. Modular, sovereign, and community-driven.
What is OKDP?
OKDP (Open Kubernetes Data Platform) is a comprehensive data management services platform composed of containerized open source software and products, running on Kubernetes infrastructure.
OKDP addresses the complete data lifecycle: collection, storage, processing, analysis, and data exposition. It's modular by design—deploy all components or select only what you need.
Why OKDP?
Data Centric
Comprehensive platform covering the entire data lifecycle with built-in governance, facilitating data sharing and minimizing duplication.
Cloud Native
Kubernetes-native platform designed for modern cloud environments with high availability, scalability, and multi-cloud support.
True Open Source
100% open source with Apache V2 license. Complete control of the technology lifecycle from build to deployment, with no vendor lock-in.
Sovereignty & Cost Control
Maintain full autonomy over your data infrastructure while eliminating licensing costs. Free to use, modify, and deploy at any scale.
Modular & Future-Proof
Flexible architecture adapts to your needs. Built on Kubernetes to prevent technical debt, ensuring sustainability and continuous modernization.
Community-Driven
Built by the community, for the community. Open collaboration from public and private organizations with a growing ecosystem.
Modular Architecture
OKDP offers a modular architecture that allows you to select and deploy only the components you need for your specific use case.
Visualization & Database
Built on Kubernetes
Running on any Kubernetes distribution (RKE, EKS, AKS, GKE) with comprehensive security, observability, and resource management.
Each component can be deployed independently, allowing you to build a customized data platform that meets your specific needs.
Roadmap
2026: Evolution & Scale
Technology Builds
OngoingContinuous building of source code and Docker images for all platform technologies.
Apache Polaris & Iceberg
Q1-Q2Integration of Apache Iceberg and Polaris catalog with STS S3 support and RBAC.
New Frontend & Server
Q2-Q3Overhaul of the user interface and server for a unified and high-performance experience.
Apache Airflow
Q3-Q4Implementation of Apache Airflow for workflow automation and orchestration.
Cross-Cutting Themes
Security
End-to-end OIDC authentication
Resources
Queue management system
MLOps
Kubeflow & MLflow
Observability
Logs, monitoring, audit
2024-25: Foundation
View previous milestones
Initial Data Technologies
Successfully integrated core data technologies: JupyterHub, Apache Spark, Trino, Hive Metastore, and Superset.
Spark & Jupyter Images
Provision and support of official base images for Apache Spark and JupyterLab.
OKDP Server/UI
Published the first version of OKDP Server and user interface for platform management.
Sandbox & Documentation
Released comprehensive sandbox environment with user guide and end-to-end test application.
Join Our Community
OKDP is built by the community, for the community. Join us in shaping the future of open-source data platforms.
Weekly Technical Meeting
Every Wednesday at 10:00 AM (CET) - Contact us to receive the meeting details.
Contact Us to JoinCall for Contributions
Help us build the Open Kubernetes Data Platform. Whether you enjoy infrastructure, data engineering, or docs, there is room for you.
Contribute on GitHub