A PDF file parser that converts PDF binaries to text based JSON, powered by porting a fork of PDF.JS to Node.js
github.com/modesty/pdf2json
modesty/pdf2json