Fast, Consistent Tokenization of Natural Language Text

R C++ Submitted 27 March 2018Published 28 March 2018
Review

Editor: @arfon (all papers)
Reviewers: @arfon (all reviews)

Authors

Lincoln A. Mullen (0000-0001-5103-6917), Kenneth Benoit (0000-0002-0797-564X), Os Keyes (0000-0001-5196-609X), Dmitry Selivanov, Jeffrey Arnold (0000-0001-9953-3904)

Citation

Mullen et al., (2018). Fast, Consistent Tokenization of Natural Language Text. Journal of Open Source Software, 3(23), 655, https://doi.org/10.21105/joss.00655

@article{Mullen2018, doi = {10.21105/joss.00655}, url = {https://doi.org/10.21105/joss.00655}, year = {2018}, publisher = {The Open Journal}, volume = {3}, number = {23}, pages = {655}, author = {Lincoln A. Mullen and Kenneth Benoit and Os Keyes and Dmitry Selivanov and Jeffrey Arnold}, title = {Fast, Consistent Tokenization of Natural Language Text}, journal = {Journal of Open Source Software} }
Copy citation string · Copy BibTeX  
Tags

text mining tokenization natural language processing

Altmetrics
Markdown badge

 

License

Authors of JOSS papers retain copyright.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Creative Commons License

Table of Contents
Public user content licensed CC BY 4.0 unless otherwise specified.
ISSN 2475-9066