url-metadata
Version:
Request a url and scrape the metadata from its HTML using Node.js or the browser.
25 lines (21 loc) • 1.19 kB
Markdown
5.0+ Roadmap
- [ ] basic pdf support
- [ ] /Info fields
- [ ] XMP fields
- [X] use `node-fetch` v2 for now to get SSRF support
- [X] README: clearly demarcate browser vs. node-only options
- [X] request headers:
- [X] check our default vs custom headers against
https://www.npmjs.com/package/node-fetch/v/2.7.0#default-headers
- [X] use options.compress = true, pass to our users from node-fetch
- [X] set custom `User-Agent`, keep `From` otherwise some hosts in the test suite serve response.status 400+
- [X] pass max `size` option to our users from node-fetch
- [X] pass `useAgent` thru to node-fetch
- [X] remove `decode` from request obj in `main.js`
- [X] handle all errors properly incl cleanup fetch requests
- [X] split index.js into index.js & browser.js, test in both /example-* dirs w package.json "browser" field
- [X] add `agent` option
- [X] issue #97: prevent SSRF attacks
- [X] issue #97: prevent infinite redirect loops
- [X] fix issue #90: temporarily remove itemprop meta support
- [X] fix issue #90: implement itemprop support correctly for meta tags in head