Parsr (GitHub Repo)
Parsr is a document parsing and extraction tool that generates usable data for data scientists and developers. It can perform document hierarchy regression, page number detection, whitespace removal, link detection, and more. It takes an image or PDF as input and outputs JSON, Markdown, text, CSV, or PDF.