Datadex is a fully open-source, serverless, and local-first Data Platform that improves how communities collaborate on Open Data. Datadex is not a new tool, it is a pattern showing an opinionated bridge between existing ones.
Principles
- Open
- Code, standards, infrastructure, and data, all public and open source. Rely on open source tools, standards, public infrastructure, and accessible data formats.
- Modular and Interoperable
- Easy to replace, extend or remove components of the pattern. Environment flexibility (laptop, cluster, browser) when running and when deploying (S3 + GH Pages, IPFS, Hugging Face).
- Permissionless
- Any improvement is one Pull Request away. Update pipelines, add datasets, or improve documentation. No API limits, just plain open files.
- Simple
- Static assets, batch jobs.
- Data as Code
-
Reproducible datasets with declarative stateless transformations tracked
in
git. Data is versioned alongside the code. Models are reusable, packaged, and versioned. - Glue
- Be a bridge between tools and approaches. Follow UNIX philosophy.