Nimblesite Docs

Nimblesite is Agents as a Service. You POST a JSON config once and get back a stable HTTP endpoint that behaves like a stateful agent — with memory, tools, multi-tenancy, and your choice of model — all already wired up.

What problem this solves

Raw LLM APIs are stateless. To turn one into a real assistant, you have to:

Store every user and assistant message in your database
Re-send the entire history on every call
Parse the model's tool-call responses and dispatch them
Loop until the model returns a final answer
Repeat all of the above per tenant, per conversation, per agent variant

That stack is the same in every AI product on Earth. Nimblesite runs it for you.

The whole product in two HTTP calls

# 1. Define an agent (once)
curl -X POST https://api.nimblesite.dev/api/v1/configs \
  -H "X-API-Key: your-key" \
  -H "Content-Type: application/json" \
  -d '{
    "name": "Site Editor",
    "system_prompt": "You edit websites.",
    "model_config": {"provider": "anthropic", "model": "claude-sonnet-4-6"},
    "tools_config": ["read_file", "write_file"]
  }'

# 2. Talk to it (forever)
curl -X POST https://api.nimblesite.dev/api/v1/chat/$CONFIG_ID \
  -H "X-API-Key: your-key" \
  -H "Content-Type: application/json" \
  -d '{
    "message": "Update my phone number"
  }'

That's the entire integration.

Two ways to run tools

Pick per agent config:

Client-side tools — agent emits tool calls, your app runs them. Same shape as OpenAI function calling or Anthropic tool use, but stateful.
Sandboxed agents — we provision a managed sandbox (filesystem, shell, build pipeline) and run the tools inside it. Same idea as Code Interpreter or E2B, but with persistent memory and multi-tenancy.

Read Agent execution modes for the full picture.

Where to next

Quickstart — From zero to a working agent in under 10 minutes.
Agent configs — How to define an agent with one JSON POST.
Agent execution modes — Client-side tools vs sandboxed agents.
Chat & conversations — How conversation memory works.
Tools — How tools are declared, surfaced, and executed.
Models — Switch providers with a JSON edit.
Multi-tenancy — Hard isolation, built in.