Thanks for your support! Your sponsorship helps fund ongoing maintenance, bug fixes, and new features.
CLI tool to parse document files into Markdown. Supports 18 formats. Office formats (docx, pptx, xlsx) are ZIP archives containing XML. The parsers use Python's stdlib zipfile + xml.etree.ElementTree ...
Ben Butcher is the Data Journalism Editor at The Telegraph, where he leads a team transforming complex data into news stories, personalised tools and analysis. Ben Butcher is the Data Journalism ...