FreeInferenceHarvard SEAS
Status

Free LLM Inference for Research

OpenAI-compatible API powered by frontier open models. Built at Harvard SEAS.

Why freeinference.org

Everything you need to build and ship LLM-powered applications.

Free to use

No credit card required. Generous quota for research and prototyping.

Drop-in OpenAI replacement

Point your existing OpenAI client at our base_url. No code changes required.

Frontier models

GLM, Minimax, Qwen, Kimi, and Anthropic models — all behind a single unified API.

Streaming and tool calls

Server-sent streaming, tool calls, and structured output supported end-to-end.

Live usage and keys

Track token usage, manage API keys, and monitor quotas from your dashboard.

Get started in three steps

  1. 1

    Sign up

    Create a free account with your email — no credit card needed.

  2. 2

    Create an API key

    Generate a key from your dashboard in one click.

  3. 3

    Call the API

    Use any OpenAI-compatible client. Just change the base URL.

One curl away

Use the same OpenAI client libraries you already know.

bash
curl https://freeinference.org/v1/chat/completions \
  -H "Authorization: Bearer $FREEINFERENCE_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "glm-4.7",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Sponsors

NVIDIA logoHarvard SEAS logo

Service is provided without guarantee.

All prompts and responses are logged for research purposes.