Inscriptis - A Python-based HTML to text conversion library optimized for knowledge extraction from the Web

Python Submitted 12 July 2021Published 16 October 2021
Review

Editor: @sbenthall (all papers)
Reviewers: @reality (all reviews), @rlskoeser (all reviews)

Authors

Albert Weichselbraun (0000-0001-6399-045X)

Citation

Weichselbraun, A., (2021). Inscriptis - A Python-based HTML to text conversion library optimized for knowledge extraction from the Web. Journal of Open Source Software, 6(66), 3557, https://doi.org/10.21105/joss.03557

Copy citation string · Copy BibTeX  
Tags

web mining knowledge extraction text conversion gold standard creation annotated text output

Altmetrics
Markdown badge

 

License

Authors of JOSS papers retain copyright.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Creative Commons License

Public user content licensed CC BY 4.0 unless otherwise specified.
ISSN 2475-9066