PDF to HTML or Text conversion using Apache Tika. Also generate PDF thumbnail using Apache PDFBox.
github.com/shebinleo/pdf2html
shebinleo/pdf2html