The ability of a system to handle growing amounts of load or traffic by scaling horizontally or vertically.
Store frequently accessed data in-memory to reduce latency. Tools: Redis, Memcached.
Distribute requests across servers to ensure no server is overwhelmed. Ex: NGINX, HAProxy.
Splitting large databases into smaller, faster, and more manageable parts called shards.
Controls how many requests a user can make. Prevents abuse and DDoS attacks.
if (requests > limit) return 429;