Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
TypeScriptapify/crawlee

crawlee

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

93.2/100
22.1KForks: 1.2K
View on GitHubHomepage →
Loading report...

Similar Projects

firecrawl

92

🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data

TypeScript89.6K

maxun

91

🔥 The open-source no-code platform for web scraping, crawling, search and AI data extraction • Turn websites into structured APIs in minutes 🔥

TypeScript15.2K

openbrowser

75

Let AI agents browse the web. An autonomous toolkit for browser-based AI agents.

TypeScript9.0K

crawlee-python

91

Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Parsel, BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.

Python8.3K
Back to List