Apache Tika bridge. Text extraction, metadata extraction, mimetype detection and language detection.
github.com/ICIJ/node-tika
ICIJ/node-tika