Search results
155 packages found
Crawler is a web spider written with Nodejs. It gives you the full power of jQuery on the server to parse a big number of pages as they are downloaded, asynchronously
Real transparent HTTP-Proxy-Server. Upstream your requests whatever you want!
- proxy
- tunnel
- ssl
- http-proxy
- mitm
- pinning
- proxy-authentication
- transparent
- upstream
- server
- squid
- privoxy
- tcp
- intercept
- View more
Dependency free module for scraping and crawling websites using [Crawlbase](https://crawlbase.com) API
- scraping
- crawling
- scraper
- scrape
- crawler
- crawlbase
- scraping-websites
- scraping-framework
- crawlbase-api
- leads
- leads-api
Distributed web crawler powered by Headless Chrome
Crawler is a web spider written with Nodejs. It gives you the full power of jQuery on the server to parse a big number of pages as they are downloaded, asynchronously
Crawler (spider) of site web pages by domain name
A version of goldwasher that runs as a module on AWS Lambda.
🤛🏻 Regular Expression Data Grabber
- typescript
- pattern
- pattern-matching
- regex
- regexp
- regexp-match
- regular-expression
- string
- crawling
- parse
- parser
- scraping
- grab
- grabber
An `URL` parser for crawling purpose.
- crawler-url-parser
- url-parser
- extract-url
- url-parse
- is-parent-url
- is-child-url
- url
- parser
- parse
- crawler
- extract
- extractor
- absolute
- relative
- View more
A library to recursively retrieve and serialize Notion pages with customization for machine learning applications.
Build web scraping agents using AI to auto-extract the data from websites
A set of shared utilities that can be used by crawlers
Lightweight crawler written in TypeScript using ES6 generators.
EasyScrape is a NodeJS module designed to be integrated into your web scraping project. With it, you can more easily get information from the web from a JSON object to organized data, as a REST API could give you!
- EasyScrape
- Module
- Dynamic
- Puppeteer
- Jsdom
- Cheerio
- Static
- Easy
- Scrape
- Powerful
- Simple
- Fast
- Easy to Use
- Embed
- View more
Fast asynchronous NodeJS module for crawling/scraping a web through worker_threads.
Priority based Semantic Web Crawler.
- semantic-crawler
- pdf-crawler
- text-crawler
- priority
- priority-crawler
- scraper
- crawling
- spider
- scraping
- simplecrawler
- crawler
- osmosis
- js-crawler
- supercrawler
- View more
Gracefully handle timeout and network error with auto retry.
- graceful
- retry
- retries
- error
- errors
- handling
- timeout
- ERR_NETWORK
- ERR_CONNECTION
- ERR_SOCKET
- page
- crashed
- goto
- playwright
- View more
Helper to extract confessions from webpages
Extraction of text and related metadata.
Package to find style links from the site you want