-
Notifications
You must be signed in to change notification settings - Fork 23
Open
Labels
t-toolingIssues with this label are in the ownership of the tooling team.Issues with this label are in the ownership of the tooling team.
Description
Summary
Our current documentation guides cover traditional web scraping libraries (BeautifulSoup, HTTPX, Playwright, Selenium, Scrapy, etc.), but we're missing guides for modern LLM-based web scraping frameworks that are becoming increasingly popular.
We should add new guides showing how to use these frameworks with the Apify SDK.
Guides to add
- Crawl4AI – LLM-friendly web crawler and scraper with built-in support for structured extraction
- Scrapling – high-performance, adaptive web scraping library with intelligent content extraction
- Browser Use – AI agent framework for browser automation using LLMs
- ScrapeGraphAI – LLM-powered scraping pipelines using graph-based logic
Notes
- Each guide should follow the same structure and conventions as the existing guides (e.g. BeautifulSoup, Scrapy)
- Include a practical example demonstrating integration with the Apify SDK (Actor input, storage, etc.)
- Related issue for actor templates: Add Python actor templates for LLM-based web scraping frameworks actor-templates#739
🤖 Generated with Claude Code
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
t-toolingIssues with this label are in the ownership of the tooling team.Issues with this label are in the ownership of the tooling team.
Type
Fields
Give feedbackNo fields configured for issues without a type.