sass-maker module — v1.0

Call 40+ free LLMs
from one OpenAI key

Drop-in OpenAI-compatible gateway across 8 free providers — Groq, Gemini, Cerebras, SambaNova, NVIDIA, OpenRouter, Voyage, and Workers AI. Auto-failover when one provider throttles.

8
Free Providers
40+
Chat Models
$0
Per Request
SSE
Streaming
Free requests routed
$0 in provider API costs
8
Provider fallback chain
Auto-skips throttled or failed providers
$0
vs. GPT-4o mini pricing
Paid APIs charge ~$0.30/1K requests
Capabilities
One endpoint. 8 providers. Auto-failover when one breaks.

Multi-Provider Routing

Routes across all 8 providers — Groq, Gemini, SambaNova, NVIDIA, Cerebras, OpenRouter, Voyage, and Workers AI — with health-aware fallback.

Health-Aware Selection

Tracks success rates, latency, and cooldowns per model. Unhealthy providers are automatically skipped.

Streaming SSE

Full Server-Sent Events streaming for chat completions and responses API. Drop-in OpenAI SDK compatible.

Embeddings

Generate embeddings via Workers AI, Gemini, or Voyage AI. Standard OpenAI embeddings format.

Built-in Analytics

Request logging, per-provider stats, and a live usage dashboard. Monitor everything from /v1/analytics.

Rate Limiting

IP-based rate limiting on public endpoints. Per-provider daily limits with configurable thresholds.

Up and running in one curl

Point your existing OpenAI SDK or curl commands at the gateway. Set model: "auto" and the gateway handles provider selection, retries, and failover.

Heads up: this aggregates free provider tiers — best-effort, no SLA. Per-IP limit is ~10 burst / ~20 rpm, each model has a daily cap, and the gateway returns 503 only when every capable model is rate-limited — rate-limit windows reset within minutes. For production throughput, pin a model and bring your own keys.

Full API reference
terminal openai sdk
# 1. Check which providers are live right now (no key needed) curl https://free-ai-gateway.sarthakagrawal927.workers.dev/v1/routing/status # 2. Send your first chat completion curl https://free-ai-gateway.sarthakagrawal927.workers.dev/v1/chat/completions \ -H "Authorization: Bearer $KEY" \ -H "Content-Type: application/json" \ -d '{ "model": "auto", "messages": [{ "role": "user", "content": "Hello!" }] }' # Embeddings curl https://free-ai-gateway.sarthakagrawal927.workers.dev/v1/embeddings \ -H "Authorization: Bearer $KEY" \ -H "Content-Type: application/json" \ -d '{ "model": "@cf/baai/bge-base-en-v1.5", "input": "Hello world" }'