Fast, Consistent Tokenization of Natural Language Text

R C++ Submitted 27 March 2018Published 28 March 2018
Review

Editor: @arfon (all papers)
Reviewers: @arfon (all reviews)

Authors

Lincoln A. Mullen (0000-0001-5103-6917), Kenneth Benoit (0000-0002-0797-564X), Os Keyes (0000-0001-5196-609X), Dmitry Selivanov, Jeffrey Arnold (0000-0001-9953-3904)

Citation

Mullen et al., (2018). Fast, Consistent Tokenization of Natural Language Text. Journal of Open Source Software, 3(23), 655, https://doi.org/10.21105/joss.00655

Copy citation string · Copy BibTeX  
Tags

text mining tokenization natural language processing

Altmetrics
Markdown badge

 

License

Authors of JOSS papers retain copyright.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Creative Commons License

Public user content licensed CC BY 4.0 unless otherwise specified.
ISSN 2475-9066