Takes in a document, spits out a tokenised and stemmed array of terms.
github.com/waltervascarvalho/stm
waltervascarvalho/stm