Inscriptis - A Python-based HTML to text conversion library optimized for knowledge extraction from the Web

Python Submitted 12 July 2021Published 16 October 2021
Review

Editor: @sbenthall (all papers)
Reviewers: @reality (all reviews), @rlskoeser (all reviews)

Authors

Albert Weichselbraun (0000-0001-6399-045X)

Citation

Weichselbraun, A., (2021). Inscriptis - A Python-based HTML to text conversion library optimized for knowledge extraction from the Web. Journal of Open Source Software, 6(66), 3557, https://doi.org/10.21105/joss.03557

@article{Weichselbraun2021, doi = {10.21105/joss.03557}, url = {https://doi.org/10.21105/joss.03557}, year = {2021}, publisher = {The Open Journal}, volume = {6}, number = {66}, pages = {3557}, author = {Albert Weichselbraun}, title = {Inscriptis - A Python-based HTML to text conversion library optimized for knowledge extraction from the Web}, journal = {Journal of Open Source Software} }
Copy citation string · Copy BibTeX  
Tags

web mining knowledge extraction text conversion gold standard creation annotated text output

Altmetrics
Markdown badge

 

License

Authors of JOSS papers retain copyright.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Creative Commons License

Table of Contents
Public user content licensed CC BY 4.0 unless otherwise specified.
ISSN 2475-9066