published Published 19 days ago
KeemenaPreprocessing.jl: Unicode-Robust Cleaning, Multi-Level Tokenisation and Streaming Offset Bundling for Julia NLP
Julia
published Published over 5 years ago
Talisman: a JavaScript archive of fuzzy matching, information retrieval and record linkage building blocks
JavaScript
published Published about 6 years ago

