Search results

448 packages found

A specification compliant robots.txt parser with wildcard (*) matching support.

published version 3.0.1, 2 years ago85 dependents licensed under $MIT
5,475,967

Very straightforward, event driven web crawler. Features a flexible queue interface and a basic cache mechanism with extensible backend.

published version 1.1.9, 5 years ago77 dependents licensed under $BSD-2-Clause
194,182

ECMAScript parser that produces a Shift format AST

published version 8.0.0, 3 years ago30 dependents licensed under $Apache-2.0
123,865

JavaScript module detecting bots/crawlers/spiders via user-agent

published version 1.2.0, 6 years ago11 dependents licensed under $MIT
88,712

A tiny node module to detect spiders/crawlers quickly and comes with optional middleware for ExpressJS

published version 2.1.0, 10 months ago6 dependents licensed under $MIT
20,108

This is an ES6 adaptation of the original PHP library CrawlerDetect, this library will help you detect bots/crawlers/spiders vie the useragent.

published version 4.0.2, 2 months ago7 dependents licensed under $MIT
28,380

Crawler is a ready-to-use web spider that works with proxies, asynchrony, rate limit, configurable request pools, jQuery, and HTTP/2 support.

published version 2.0.2, a year ago123 dependents licensed under $MIT
17,765

Get a list of local URL links from a root URL.

published version 3.0.0, 4 years ago1 dependents licensed under $MIT
11,444

A lightweight robots.txt parser for Node.js with support for wildcards, caching and promises.

published version 2.0.3, 3 years ago5 dependents licensed under $MIT
8,025

Isomorphic Javascript SDK for Spider Cloud services

published version 0.1.38, 6 days ago0 dependents licensed under $MIT
6,558

A tiny node module to detect spiders/crawlers quickly and comes with optional middleware for ExpressJS

published version 2.0.3, a year ago0 dependents licensed under $MIT
5,483

Node.js SDK for Crawlab

published version 0.6.0-12, 3 years ago0 dependents licensed under $BSD-3-Clause
6,251

Parses the wget spider output into an object

published version 2.0.0, 9 years ago0 dependents licensed under $MIT
4,023

A high-performance charting library.

published version 7.1.1, 2 months ago2 dependents licensed under $SEE LICENSE IN LICENSE
3,971

Generic web crawler powered by Node.js

published version 1.4.1, 9 years ago5 dependents licensed under $BSD-2-Clause
868

Crawler (spider) of site web pages by domain name

published version 1.2.3, 4 years ago0 dependents licensed under $MIT
772

A web crawler for Nodejs.

published version 0.8.2, 11 years ago1 dependents licensed under $MIT
826

Ananse is a lightweight NodeJs framework with batteries included for building efficient, scalable and maintainable USSD applications.

published version 1.9.11, a month ago0 dependents licensed under $MIT
654

Crawler is a web spider written with Nodejs. It gives you the full power of jQuery on the server to parse a big number of pages as they are downloaded, asynchronously

published version 0.8.0, 9 years ago6 dependents licensed under $ISC
628

a reliable web crawling & scraping framework for Node.js.

published version 1.12.4, 3 months ago0 dependents licensed under $GPL-3.0
565
OSZAR »