UNPKG

js-harvester

Version:

Harvester is a lightweight and highly optimized javascript library for extracting data from the DOM tree. It supports extraction of tag texts with specified types and attributes. it's tiny and has no dependencies and also works with Puppeteer

15 lines (12 loc) 487 B
import { harvestPageAll } from 'js-harvester/playwright.js' import { open } from './utils.js' const NEWS_QUERY = '.container_sub_news_list div.article_news_list' const TPL = ` div{time} div a{title}` const page = await open() await page.goto('https://www.pravda.com.ua/news/', { waitUntil: 'load' }) await page.waitForSelector(NEWS_QUERY) const news = await harvestPageAll(page, TPL, NEWS_QUERY, { inject: true, dataOnly: true }) console.log(news, '\nPress Ctrl-C to stop...')