tag:joss.theoj.org,2005:/papers/tagged/gold%20standard%20creationJournal of Open Source Software2021-10-16T07:59:11ZJournal of Open Source Softwarehttps://joss.theoj.orgtag:joss.theoj.org,2005:Paper/28412021-10-16T07:59:11Z2021-10-17T00:03:10ZInscriptis - A Python-based HTML to text conversion library optimized for knowledge extraction from the Webacceptedv2.0.02021-07-12 09:20:23 UTC662021-10-16 07:59:11 UTC620213557AlbertWeichselbraunSwiss Institute for Information Science, University of Applied Sciences of the Grisons, Pulvermühlestrasse 57, Chur, Switzerland0000-0001-6399-045X10.21105/joss.03557https://doi.org/10.5281/zenodo.5562417Pythonhttps://joss.theoj.org/papers/10.21105/joss.03557.pdfweb mining, knowledge extraction, text conversion, gold standard creation, annotated text output