aioscraper¶

High-performance asynchronous Python framework for large-scale API data collection.

Warning

Beta status: APIs and behavior may change, so pin versions and expect occasional breakage while things stabilize.

What is aioscraper?¶

aioscraper is an async Python framework designed for mass data collection from APIs and external services at scale.

Built for:

NOT built for:

Think: “I need to fetch data from 10,000 product API endpoints” or “I need to poll 50 microservices every minute” → aioscraper is for you.

Async-first core with pluggable HTTP backends (aiohttp/httpx) and aiojobs scheduling
Declarative flow: requests → callbacks → pipelines, with middleware hooks at each stage
Priority queueing plus configurable concurrency limits per group
Adaptive rate limiting with EWMA + AIMD algorithm - automatically backs off on server overload
Small, explicit API that is easy to test and compose with existing async applications

Contents:

Project Links: