htmldate: A Python package to extract publication dates from web pages

Python Submitted 17 June 2020Published 30 July 2020
Review

Editor: @danielskatz (all papers)
Reviewers: @geoffbacon (all reviews), @proycon (all reviews)

Authors

Adrien Barbaresi (0000-0002-8079-8694)

Citation

Barbaresi, A., (2020). htmldate: A Python package to extract publication dates from web pages. Journal of Open Source Software, 5(51), 2439, https://doi.org/10.21105/joss.02439

@article{Barbaresi2020, doi = {10.21105/joss.02439}, url = {https://doi.org/10.21105/joss.02439}, year = {2020}, publisher = {The Open Journal}, volume = {5}, number = {51}, pages = {2439}, author = {Adrien Barbaresi}, title = {htmldate: A Python package to extract publication dates from web pages}, journal = {Journal of Open Source Software} }
Copy citation string · Copy BibTeX  
Tags

metadata extraction date parsing web scraping natural language processing

Altmetrics
Markdown badge

 

License

Authors of JOSS papers retain copyright.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Creative Commons License

Table of Contents
Public user content licensed CC BY 4.0 unless otherwise specified.
ISSN 2475-9066