a non-commercial, fair-use subset of the penn-treebank, in JSON
github.com/nlp-compromise/penn-treebank
nlp-compromise/penn-treebank