Thanks to some awesome contributors, there is a new release for invoice2data. This Python package allows you to get structured data from PDF invoices. Major enhancements:
- powerful Yaml-based template format for new invoice issuers.
- improved date-parseing thanks to dateparser.
- improved PDF conversion thanks to new feature in xpdf
- better testing and CI
- option to add multiple keywords and regex to each field
- option to define currency and date format (day or month first?)
All details and download on Github.