spiking
Version:
A lightweight web crawler.
25 lines (13 loc) • 980 B
Markdown
## Usage
First select the crawl mode, `Multiple` for multi-line concurrent crawling, `DynamicMultiple` for dynamic web page concurrent crawling.
`URL` Enter the link you want to crawl.
`Local Path` Select local storage path.
`Concurrency` Enter the maximum number of concurrency. (This option is related to the crawl speed and is limited by the performance of the computer.)
`Proxy` Enter the proxy address.
`Chormo Path` Select the Chormo browser executable file.
`Display` Select whether to display the browser window.
`Login` Select whether you want to log in (please select the display first).
If 'Login' is selected, the program will enter the wait and click 'Continue' to begin crawling.
Crawling is done automatically, and when everything is over, you can retry the failed link.
## More
When you choose the `DynamicMultiple` mode, you not only need to enter the correct proxy address in the program, but also need to change the IE proxy.