Automatically extracts structured information from webpages
github.com/indix/web-auto-extractor
indix/web-auto-extractor