/// documentation
Docs
How Wide Area Intelligence works, and how to run it.
[ how it works ]Routing & failover→
How the gateway decides where a request runs — edge cache, your own GPUs, then capability-aware cloud failover. Substitution mode, vision models, cross-provider backups.
[ patterns ]Recipes: multi-model pipelines→
Mix a fast, cheap model and a strong one per step — classify-then-generate, extract-then-write — over one endpoint. Run the cheap step on your own GPUs for free.
[ node ops ]wai CLI reference→
Every command for the node CLI: status, logs, start/stop/restart, update, models-dir, uninstall, plus config and auto-update.