published Published 4 months ago
Talisman: a JavaScript archive of fuzzy matching, information retrieval and record linkage building blocks
JavaScript
published Published 7 months ago
htmldate: A Python package to extract publication dates from web pages
Python
published Published 8 months ago
ldaPrototype: A method in R to get a Prototype of multiple Latent Dirichlet Allocations
R
published Published about 1 year ago
WordTokenizers.jl: Basic tools for tokenizing natural language in Julia
Julia
published Published about 1 year ago
Pubmed Parser: A Python Parser for PubMed Open-Access XML Subset and MEDLINE XML Dataset XML Dataset
Python
published Published about 1 year ago