Datadex
The Open Data Platform for your Community's Data.
Datadex is a fully open-source, serverless, and local-first Data
Platform that improves how communities collaborate on Open Data. Datadex
is not a new tool, it is a pattern showing an opinionated bridge between
existing ones.
Principles
- Open: Code, standards, infrastructure, and data,
all public and open source. Rely on open source tools, standards,
public infrastructure, and accessible data formats.
- Modular and Interoperable: Easy to replace, extend
or remove components of the pattern. Environment flexibility (your
laptop, in a cluster, or from the browser) when running and when
deploying (S3 + GH Pages, IPFS, Hugging Face).
- Permissionless: Any improvement is one Pull Request
away. Update pipelines, add datasets, or improve documentation. When
consuming, there are no API limits, just plain files.
- Data as Code: Reproducible datasets with
declarative stateless transformations tracked in
git
.
Data is versioned alongside the code. Models are reusable, packaged,
and versioned.
- Glue: Be a bridge between tools and approaches.
E.g: Use software engineering good practices like types, tests,
materialized views, and more.