Free LLM Inference for Research
OpenAI-compatible API powered by frontier open models. Built at Harvard SEAS.
Why freeinference.org
Everything you need to build and ship LLM-powered applications.
Free to use
No credit card required. Generous quota for research and prototyping.
Drop-in OpenAI replacement
Point your existing OpenAI client at our base_url. No code changes required.
Frontier models
GLM, Minimax, Qwen, Kimi, and Anthropic models — all behind a single unified API.
Streaming and tool calls
Server-sent streaming, tool calls, and structured output supported end-to-end.
Live usage and keys
Track token usage, manage API keys, and monitor quotas from your dashboard.
Get started in three steps
- 1
Sign up
Create a free account with your email — no credit card needed.
- 2
Create an API key
Generate a key from your dashboard in one click.
- 3
Call the API
Use any OpenAI-compatible client. Just change the base URL.
One curl away
Use the same OpenAI client libraries you already know.
curl https://freeinference.org/v1/chat/completions \
-H "Authorization: Bearer $FREEINFERENCE_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "glm-4.7",
"messages": [{"role": "user", "content": "Hello!"}]
}'Sponsors