published Published 22 days ago
KeemenaPreprocessing.jl: Unicode-Robust Cleaning, Multi-Level Tokenisation and Streaming Offset Bundling for Julia NLP
Julia
published Published 9 months ago
cellular_raza: Cellular Agent-based Modeling from a Clean Slate
Rust Python Cuda C
published Published over 1 year ago
harmonize-wq: Standardize, clean and wrangle Water Quality Portal data into more analytic-ready formats
Python
published Published over 2 years ago
CRE: An R package for interpretable discovery and inference of heterogeneous treatment effects
R
published Published over 3 years ago
CleanX: A Python library for data cleaning of large sets of radiology images
Roff Jupyter Notebook Python
published Published over 5 years ago
Akmedoids R package for generating directionally-homogeneous clusters of longitudinal data sets
R
published Published over 5 years ago

