API Claw - Smart Hardware & Third-Party Developer Integration

Use Buda's OpenAPI to give your hardware or software product cloud-based AI Agent capabilities, no inference infrastructure required

Smartwatches, IoT devices, chat plugins - they all want built-in AI Agents. But running a large model on-device isn't viable: not enough compute, too much power draw, poor results.

API Claw is built for this. With a few API calls to Buda's OpenAPI, third-party developers can give their products a cloud-based OpenClaw AI Agent with conversation capability and a private knowledge base, waking on demand and consuming zero resources at rest.

In practice, you are not integrating a raw model API. You are integrating a managed Agent capability layer that already includes model access, runtime, knowledge base, session management, and operational scaling.

Who it's for

Smart hardware makers - watches, earbuds, IoT devices that want a built-in AI assistant
Software developers - mini-programs, apps, browser extensions that want to embed AI chat
SaaS providers - who want to offer customers an "AI-powered private space"

Two common buyer types

1. Chat apps, SaaS tools, and third-party software

If you already have a product with users, conversations, forms, or workflows, API Claw lets you add an AI Agent without building the whole backend stack yourself.

Examples:

A chat app that wants to add an AI assistant into every conversation
A customer service SaaS that wants each customer to have a private AI support agent
A browser extension or internal enterprise tool that needs a cloud-based AI worker

2. Hardware manufacturers

If you are building a smartwatch, earbuds, a voice terminal, or another connected device, API Claw lets the device connect to a cloud AI brain instead of trying to run the intelligence locally.

That means:

The device handles input and output
The Agent runs in the cloud
Upgrades happen server-side instead of through constant firmware complexity

How it works

Your product (hardware / software)
        ↓  API call
  Buda OpenAPI
        ↓
  Space (tenant)
        ↓
  Agent (AI assistant) + Drive (knowledge base)
        ↓
  Chat Session (conversation)

Each end user gets an independent Chat Session with isolated context. The Agent sleeps when idle and wakes automatically when a message arrives.

What you do not need to build yourself

With API Claw, the developer does not need to separately build or operate:

Model configuration and provider switching
Inference machines or GPU infrastructure
Agent runtime and tool sandboxing
File ingestion and knowledge base processing
Session and context management
Tenant isolation for different users or customers
Wake/sleep orchestration for idle agents

This is the key value: your team can focus on the product experience, while Buda provides the Agent infrastructure layer behind it.

Why not just call an LLM API directly?

Calling a model API gives you model output. API Claw gives you an operational AI Agent.

	Raw model API	API Claw
Output	Text or structured completion	Agent reply with runtime context
Sessions	You manage them	Built-in session model
Knowledge base	You build retrieval	Drive-based knowledge included
Runtime	You build orchestration	Managed Agent runtime
Multi-tenant support	You design it	Space-based isolation built in
Ops	You run it	Managed by Buda

If you are a product company, this difference matters more than model quality alone.

API Claw vs OpenClaw

OpenClaw is the Agent capability and runtime layer. API Claw is the productized API surface that lets external hardware and software connect to that capability.

In simple terms:

OpenClaw = the underlying Agent runtime / capability model
API Claw = the developer-facing integration layer built on top of it

API surface at a glance

API Claw is easiest to understand as five cooperating API groups:

API group	Main endpoints	What it controls	Use it when
Space API	`/api/v1/spaces`	Tenant boundary, customer boundary, or device fleet boundary	You need isolation between customers, teams, devices, or paid accounts
Agent API	`/api/v1/spaces/{spaceId}/agents`	The managed Claw Agent inside a Space	You want each tenant or product line to have its own assistant role and instructions
Drive API	`/api/v1/spaces/{spaceId}/agents/{agentId}/drive/files`	Durable knowledge, manuals, policies, state, and long-lived memory	The Agent should keep improving from files instead of receiving the same giant prompt every time
Chat Session API	`/api/v1/spaces/{spaceId}/agents/{agentId}/chat-sessions`, `/api/v1/agent-sessions/{sessionId}`	User conversations, async Agent runs, status, and messages	Your backend can safely hold the main API Key
Embed API	`/api/v1/spaces/{spaceId}/agents/{agentId}/embed-urls`, `/api/v1/spaces/{spaceId}/agents/{agentId}/embed-sessions`, `/api/v1/embed/chat-sessions/...`	Short-lived iframe URL or frontend-safe chat access	A browser, mobile app, mini program, or extension needs to talk to Buda without seeing your main API Key

API Claw has its own hosted iframe surface. Your backend creates a short-lived embed URL:

POST /api/v1/spaces/{spaceId}/agents/{agentId}/embed-urls
Authorization: Bearer <your-api-key>
Content-Type: application/json

{
  "externalUserId": "customer-123",
  "displayName": "Kelly",
  "ttlSeconds": 3600,
  "mode": "chat"
}

The response includes:

{
  "embedUrl": "https://buda.im/embed/api-claw/tsk_abc123#token=SHORT_LIVED_TOKEN",
  "sessionId": "tsk_abc123",
  "expiresAt": "2026-05-26T12:00:00.000Z"
}

Then iframe the returned embedUrl:

<iframe
  src="https://buda.im/embed/api-claw/tsk_abc123#token=SHORT_LIVED_TOKEN"
  title="API Claw Agent"
  allow="microphone"
  style="width: 400px; height: 600px; border: 0;"
></iframe>

The token lives in the URL hash fragment so it is not sent to the server as part of the initial page request. The page reads the token in the browser and uses it only for /api/v1/embed/... JSON calls. When the token expires, create a fresh embed URL from your backend.

How application scenarios combine the APIs

Most products use the same lifecycle: provision, seed, run, read, learn, and update.

Scenario	Space strategy	Agent strategy	Drive strategy	Chat strategy	Embed strategy
Smart hardware fleet	One Space per customer, fleet, or premium device group	One Agent per device model or assistant role	Manuals, firmware notes, troubleshooting guides, device state snapshots	Device backend starts sessions when the user speaks or a device event occurs	Native app or mini program uses embed tokens; websites use API Claw embed URLs
WeChat mini program	One Space per merchant, course, customer, or end-user account	One Agent per business function, such as support, tutor, or shopping assistant	FAQs, product catalog notes, policies, lesson plans, user progress	Your backend mints an embed session for the user's `openid`	Mini program calls `/api/v1/embed/chat-sessions/{sessionId}` with the short-lived token
SaaS copilot	One Space per customer tenant	One Agent per workflow, such as onboarding, reporting, or support	Tenant docs, integration settings, playbooks, historical summaries	SaaS backend starts sessions and stores returned session IDs	Usually server-to-server; iframe only if you want a Buda-hosted chat surface
Customer support widget	One Space per customer company or brand	One support Agent per brand or queue	Support articles, refund policy, product docs, escalation rules	Your backend creates one embed URL per visitor or account	Use API Claw embed URLs for hosted iframe chat
Internal enterprise Agent	One Space per department or project	Multiple Agents for ops, sales, research, or engineering	Internal SOPs, meeting notes, runbooks, shared decisions	Internal tools start sessions from buttons, forms, or scheduled jobs	Use server-to-server for tools; use API Claw embed URLs for a quick internal chat page

Rule of thumb:

Put durable knowledge in Drive.
Put transient user input in a Chat Session.
Put customer or device isolation in Spaces.
Put role, behavior, and runtime identity in Agents.
Use Embed API only when the caller is a frontend that cannot safely hold your main API Key.

Integration steps

Register a developer account and get an API Key

Create a Space (tenant)

Each of your customers (or each device) maps to one Space.

POST /api/v1/spaces
Authorization: Bearer <your-api-key>
Content-Type: application/json

{
  "name": "Acme Device Fleet",
  "slug": "acme-device-fleet"
}

Save the returned spaceId.

Create an Agent inside the Space

POST /api/v1/spaces/{spaceId}/agents
Authorization: Bearer <your-api-key>
Content-Type: application/json

{
  "name": "Watch Assistant",
  "emoji": "AI",
  "instructions": "You are a smartwatch assistant. Use Drive files as source of truth. Keep replies short enough for a small screen."
}

Save the returned agentId and driveId. Buda also writes the instructions into the Agent's Drive as durable context, so the Agent can keep evolving from files instead of relying only on one prompt.

Add durable knowledge to Drive

If your Agent needs to answer based on product manuals, FAQs, policies, or device-specific state, write those files into its Drive:

PUT /api/v1/spaces/{spaceId}/agents/{agentId}/drive/files
Authorization: Bearer <your-api-key>
Content-Type: application/json

{
  "path": "manuals/reset-device.md",
  "mimeType": "text/markdown",
  "content": "# Resetting the device\n\nHold the crown for 8 seconds, then confirm on screen."
}

Files persist across sessions. Updating a file updates what the Agent can use on the next run.

Start a conversation

POST /api/v1/spaces/{spaceId}/agents/{agentId}/chat-sessions
Authorization: Bearer <your-api-key>
Content-Type: application/json

{
  "message": "How do I reset my device?",
  "mode": "chat"
}

Buda returns a session.id and starts the Agent run asynchronously. Store that session.id if you want to continue the same user's conversation later.

Poll the session result

GET /api/v1/agent-sessions/{sessionId}
Authorization: Bearer <your-api-key>

The response includes the session status and messages. Poll until the status becomes completed, waiting_for_input, failed, or cancelled.

OpenAPI endpoints for API Claw

These endpoints are available in the OpenAPI JSON at /api/v1/openapi.json and the interactive Swagger UI at /api/v1/doc.

Endpoint	Purpose
`GET /api/v1/spaces`	List Spaces owned by the API key user
`POST /api/v1/spaces`	Create a tenant Space
`GET /api/v1/spaces/{spaceId}/agents`	List Claw Agents in a Space
`POST /api/v1/spaces/{spaceId}/agents`	Create a managed Claw Agent with its own Drive
`GET /api/v1/spaces/{spaceId}/agents/{agentId}/drive/files`	Browse the Agent's durable Drive
`PUT /api/v1/spaces/{spaceId}/agents/{agentId}/drive/files`	Create or overwrite a Drive file
`POST /api/v1/spaces/{spaceId}/agents/{agentId}/chat-sessions`	Send a message and start or resume an Agent session
`GET /api/v1/agent-sessions`	List Agent Sessions visible to the API key user
`GET /api/v1/agent-sessions/{sessionId}`	Read session status and messages
`POST /api/v1/spaces/{spaceId}/agents/{agentId}/embed-urls`	Create a short-lived hosted iframe URL for API Claw
`POST /api/v1/spaces/{spaceId}/agents/{agentId}/embed-sessions`	Mint a short-lived frontend-safe embed token for native UI
`POST /api/v1/embed/chat-sessions/{sessionId}/messages`	Send a frontend message using an embed token
`GET /api/v1/embed/chat-sessions/{sessionId}`	Poll frontend-visible session status and messages using an embed token

Which URL should I iframe?

For API Claw, your backend should call this OpenAPI endpoint first:

POST /api/v1/spaces/{spaceId}/agents/{agentId}/embed-urls

It returns an embedUrl like:

https://buda.im/embed/api-claw/{sessionId}#token={shortLivedToken}

That exact returned URL is the iframe src.

Use this when:

You want a ready-made Buda-hosted API Claw chat UI.
You need a link that expires on a schedule.
You are embedding into a website, admin panel, customer portal, or partner console.
Your product backend can refresh the URL before or after expiration.

Do not iframe /api/v1/embed/.... Those routes are JSON APIs used by the hosted iframe page and by native frontends. For WeChat mini programs, mobile apps, and browser extensions, build your own chat UI and call:

POST /api/v1/spaces/{spaceId}/agents/{agentId}/embed-sessions
POST /api/v1/embed/chat-sessions/{sessionId}/messages
GET  /api/v1/embed/chat-sessions/{sessionId}

That gives the frontend a scoped, short-lived token without exposing your main Buda API Key.

Embedded chat for mini programs

For WeChat mini programs, mobile apps, browser extensions, or any frontend you do not fully control, do not ship your main Buda API Key to the client.

Use the embed flow instead:

Your backend calls Buda with the real API Key and mints a short-lived embed token.
Your mini program stores only that short-lived token.
The mini program sends messages and polls status through /api/v1/embed/....
When the token expires, your backend mints a fresh one.

Backend: mint an embed session

POST /api/v1/spaces/{spaceId}/agents/{agentId}/embed-sessions
Authorization: Bearer <your-api-key>
Content-Type: application/json

{
  "externalUserId": "wechat-openid-oABC123",
  "displayName": "Kelly",
  "ttlSeconds": 3600,
  "mode": "chat",
  "metadata": {
    "miniProgram": "wechat",
    "scene": "device-support"
  }
}

Return only token, sessionId, and the required API URLs to the mini program. For an iframe, call /embed-urls instead and return embedUrl.

Mini program: send a message

POST /api/v1/embed/chat-sessions/{sessionId}/messages
Authorization: Bearer <embed-token>
Content-Type: application/json

{
  "message": "How do I reset my device?",
  "mode": "chat"
}

Mini program: poll for replies

GET /api/v1/embed/chat-sessions/{sessionId}
Authorization: Bearer <embed-token>

This gives mini programs a clean native integration path without iframe constraints, third-party cookie issues, or leaked server credentials.

Minimal Node.js example

const API_BASE = "https://buda.im/api/v1";
const headers = {
  Authorization: `Bearer ${process.env.BUDA_API_KEY}`,
  "Content-Type": "application/json",
};

const space = await fetch(`${API_BASE}/spaces`, {
  method: "POST",
  headers,
  body: JSON.stringify({ name: "Acme Device Fleet" }),
}).then((res) => res.json());

const agent = await fetch(`${API_BASE}/spaces/${space.id}/agents`, {
  method: "POST",
  headers,
  body: JSON.stringify({
    name: "Watch Assistant",
    instructions: "Answer from Drive first. Keep replies concise.",
  }),
}).then((res) => res.json());

await fetch(`${API_BASE}/spaces/${space.id}/agents/${agent.id}/drive/files`, {
  method: "PUT",
  headers,
  body: JSON.stringify({
    path: "manuals/reset-device.md",
    content: "# Resetting\n\nHold the crown for 8 seconds.",
    mimeType: "text/markdown",
  }),
});

const started = await fetch(`${API_BASE}/spaces/${space.id}/agents/${agent.id}/chat-sessions`, {
  method: "POST",
  headers,
  body: JSON.stringify({ message: "How do I reset my device?", mode: "chat" }),
}).then((res) => res.json());

const status = await fetch(`${API_BASE}/agent-sessions/${started.session.id}`, {
  headers: { Authorization: headers.Authorization },
}).then((res) => res.json());

Building a continuously evolving Drive-based Agent

The useful pattern is not "send everything in every prompt." Let your product treat Drive as the Agent's durable brain:

Write product manuals, policies, release notes, and user-specific state into Drive.
Start each conversation with a small runtime message, such as the user's question or latest device event.
Poll the Chat Session for status and messages.
When your product learns something durable, write it back to Drive as a file.
Let the next run inherit that improved context automatically.

This is how a device or third-party app gets an Agent that improves over time: the API call wakes the Agent, the session captures the immediate conversation, and Drive carries forward the durable knowledge.

Billing model

Buda charges per Space, not per end user. You purchase Spaces as the developer; how you charge your end users is entirely up to you.

Typical model:

You buy Spaces from Buda (volume pricing available)
You charge your end users a subscription or activation fee
The margin is yours

Why hardware teams care

For hardware companies, this changes the cost structure:

No need to ship a device powerful enough to run a full AI stack locally
No need to keep a user's laptop or phone acting as the primary runtime
No need to maintain a separate AI backend team just to support one device line

Your hardware can stay lightweight while the AI capability keeps improving in the cloud.

Analogy: ChatGPT vs OpenAI

	OpenAI	Buda
Consumer product	ChatGPT	Buda App
Developer API	OpenAI API	Buda OpenAPI
What developers build	Their own AI products	Complete AI spaces with Agent + Drive

The difference: Buda OpenAPI isn't just model inference — it provides a complete Agent runtime including knowledge base, session management, and tool-calling capabilities.

Why not OpenClaw?

OpenClaw is a great open-source AI Agent project, perfect for individuals who want to self-host an assistant on their own machine. But for hardware or software developers who need to scale, it's the wrong tool.

For a full comparison, see: Buda vs OpenClaw

	OpenClaw	Buda
Purpose	Open-source personal assistant, self-hosted	Commercial enterprise platform, managed service
Infrastructure	Runs on user's local machine	Self-built Kubernetes cluster (Claw Computer), elastic scaling
Gateway	Heavy Gateway layer, single-machine bottleneck	No traditional Gateway, lightweight API layer, horizontal scaling by design
Multi-tenancy	Not supported, each instance is independent	Native multi-tenancy, one API Key manages millions of Spaces
Token management	User configures their own model	Commercial Token management, predictable, controllable costs
Isolation	Shares host OS	Each Agent in its own sandbox, container-level isolation
SLA & support	Community support, no SLA	Enterprise SLA, commercial support
Best for	Personal use, developer tinkering	Hardware at scale, SaaS integration, commercial deployment

In one line: OpenClaw is a tool for personal use. Buda is infrastructure developers sell to their users.

If you need to give 100,000 smartwatches each their own AI assistant, you need Buda, not a separate OpenClaw instance running behind every device.

FAQ

Do end users need a Buda account? No. End users have no visibility into Buda. Your product proxies all interactions through the API.

Does the Agent consume resources when idle? No. The Agent sleeps when there are no active conversations and wakes automatically when a message arrives.

Can I configure different Agent instructions for different users? Yes. Each Space has its own independent Agent configuration. Create separate Spaces and Agents per user as needed.

API Claw - Smart Hardware & Third-Party Developer Integration

On this page