Datadex

Datadex is a fully open-source, serverless, and local-first Data Platform that improves how communities collaborate on Open Data. Datadex is not a new tool, it is a pattern showing an opinionated bridge between existing ones.

Principles

Open: Code, standards, infrastructure, and data, all public and open source. Rely on open source tools, standards, public infrastructure, and accessible data formats.
Modular and Interoperable: Easy to replace, extend or remove components of the pattern. Environment flexibility (laptop, cluster, browser) when running and when deploying (S3 + GH Pages, IPFS, Hugging Face).
Permissionless: Any improvement is one Pull Request away. Update pipelines, add datasets, or improve documentation. No API limits, just plain open files.
Simple: Static assets, batch jobs.
Data as Code: Reproducible datasets with declarative stateless transformations tracked in git. Data is versioned alongside the code. Models are reusable, packaged, and versioned.
Glue: Be a bridge between tools and approaches. Follow UNIX philosophy.