buffer utf8 characters that would otherwise span chunk boundaries
github.com/substack/utf8-stream
substack/utf8-stream