The Modern Open Source Data Platform

A free, open-source, and cloud-native data platform designed for Kubernetes. Built by the community, for the community.

Key Features

OKDP provides a comprehensive suite of data technologies designed for modern cloud-native environments.

Data Centric

Empowering data-driven decisions with modern architecture patterns like data mesh and data fabric.

Cloud Native

Kubernetes-native platform for hosting and managing data services with high availability and scalability.

Community Driven

Open-source platform built by the community, for the community, with no licensing costs.

About TOSIT

TOSIT is an association that promotes community-driven initiatives to create truly open-source technologies and platforms. The association brings together numerous companies and administrations including DGFiP (Direction Générale des Finances Publiques), BPCE (Banque Populaire, Caisse d'Epargne et Natixis), Société Générale, among others.

OKDP is currently mainly implemented and managed by DGFiP. Participation in OKDP is open to all, with the aim of ensuring that the technology stack is accessible, efficient, and powerful for everyone.

Modular Architecture

OKDP offers a modular architecture that allows you to select and deploy only the components you need for your specific use case.

Query Engine

Choose the query engine that best fits your data access patterns and performance requirements.

ML/AI

Select the machine learning and AI tools that align with your data science workflows.

Processing

Deploy the data processing tools and engines that match your workload requirements.

Storage

Choose the storage solutions that best fit your data types and access patterns.

Each component can be deployed independently, allowing you to build a customized data platform that meets your specific needs.

Roadmap

Our planned milestones and upcoming features.

JupyterHub: On-Demand Notebooks

  • Automatic building of JupyterLab images via GitHub Actions.
  • Providing a customized HELM chart based on the Jupyter Community's one.

Apache Spark: A large-scale data analytics engine

  • Development of an authentication module for Spark History Server and Spark UI, including its release management to Maven Central.
  • Provide a customized HELM chart for the Spark History Server.
  • Provide customized OKDP images.

Trino & Superset

  • Provide customized HELM chart.
  • Provide OKDP images.

OKDP Sandbox with User Guide

  • Provide a sandbox to deploy OKDP components locally.
  • Provide a User Guide documentation to deploy OKDP components on a kubernetes cluster.

Events

Join us at these upcoming events to learn more about OKDP.

DINUM BlueHats Workshop

Join us at the DINUM BlueHats Workshop where we'll showcase OKDP's capabilities.

TOSIT Day

A special day dedicated to open-source technologies and OKDP's role in the ecosystem.

Data and AI Leaders Paris 2024

Join us at Paris Porte de Versailles for Data and AI Leaders Paris 2024.

BlueHats

Join us at BlueHats for an insightful presentation of OKDP. Time: 11:00 AM - 12:30 PM

BercyInnov

OKDP will be showcased at BercyInnov, demonstrating innovative data solutions for public sector.

Big Data AI Paris 2025

Join us at Big Data AI Paris 2025 where we'll present OKDP's latest features and use cases.

Join Our Community

OKDP is built by the community, for the community. Join us in shaping the future of open-source data platforms.

Contribute

Join our development efforts and help shape the future of OKDP.

Discuss

Join our community discussions and share your ideas with other members.

Deploy

Get started with OKDP in your environment and share your experience.