Is RESTai self-hosted and open-source?

Yes. RESTai is fully self-hosted and released under the Apache 2.0 open-source license. Install via PyPI, Docker, or Kubernetes Helm chart. No vendor lock-in, no telemetry phone-home, your data stays on your infrastructure.

RESTai — Open-Source AIaaS Platform · Self-Hosted AI as a Service

Q: What is AIaaS (AI as a Service)?

AIaaS — AI as a Service — is a delivery model in which AI capabilities (LLM inference, RAG, agents, image generation, embeddings) are exposed as managed services consumable via APIs. RESTai is an open-source, self-hosted AIaaS platform: you run the same kind of platform that hosted AI vendors offer, but on your own infrastructure.

Q: Which LLM providers does RESTai support?

OpenAI, Anthropic, Ollama, Google Gemini, Groq, Grok (xAI), LiteLLM, vLLM, Azure OpenAI, AWS Bedrock, and any OpenAI-compatible endpoint. You can mix multiple providers per team.

Why RESTai

The open-source AIaaS platform built to be self-hosted

A complete AI-as-a-Service stack — not just another wrapper. Full Web UI, analytics, security, and enterprise features out of the box, on your infrastructure, under Apache 2.0.

Multi-project platform

RAG with SQL-to-NL and auto-sync, Agents with MCP tools, Block visual logic, App builder, and direct Inference — all in one platform.

Full Web UI + analytics

React dashboard with token tracking, cost analytics, latency monitoring, per-project usage charts, and model fleet view. Not just an API.

Any LLM provider

OpenAI, Anthropic, Ollama, Gemini, Groq, Grok, LiteLLM, vLLM, Azure, AWS Bedrock, and any OpenAI-compatible endpoint.

Enterprise security

Teams, RBAC, OAuth/LDAP/OIDC, TOTP 2FA, input/output guardrails, audit logging, per-project rate limits, and budget caps.

Extensible tools (MCP)

Model Context Protocol support for unlimited agent integrations. Connect any MCP server via HTTP/SSE or stdio.

White-label branding

Custom logos, colors, and app names per team. Built-in knowledge sync from S3, Confluence, SharePoint, and Google Drive.

Analytics

Dashboard & Analytics

Track token usage, costs, latency, and project activity from a centralised dashboard. Daily charts for tokens, costs, and response latency per project — identify performance regressions at a glance.

Knowledge

RAG — Retrieval-Augmented Generation

Upload documents and query them with LLM-powered retrieval. Multiple vector stores, ColBERT and LLM-based reranking, and natural language to SQL.

ChromaDB, PGVector, Weaviate, Pinecone
Auto-sync from S3, Confluence, SharePoint, Google Drive
Smart chunking analytics with recommendations
Built-in evaluation framework with DeepEval

Agents

Agents + MCP

Zero-shot ReAct agents with built-in tools and MCP (Model Context Protocol) server support. Connect any MCP-compatible server via HTTP/SSE or stdio for unlimited tool access.

Playground

Projects & Chat Playground

Create and manage AI projects with their own LLM, system prompt, tools, and configuration. Test instantly in the built-in chat playground with streaming responses and multimodal support.

Visual Logic

Block — Visual Logic Builder

Build processing logic visually using a Blockly-based IDE — no LLM required. Drag-and-drop blocks to define how input is transformed into output. Use the "Call Project" block to compose AI pipelines.

Quality

Evaluation Framework

Built-in evaluation system to measure and track AI project quality over time. Create test datasets, run evaluations with multiple metrics, and visualise score trends.

Answer Relevancy — is the answer relevant?
Faithfulness — is it grounded in context? (RAG)
Correctness — does it match expected output?

Versioning

Prompt Versioning

Every system prompt change is automatically versioned. Browse history, compare versions, and restore any previous prompt. Eval runs link to prompt versions for A/B comparison.

GPU

Image Generation

Local and remote image generators loaded dynamically. Supports Stable Diffusion, Flux, DALL-E, RMBG2, and more. Auto-detects NVIDIA GPUs with detailed hardware monitoring.

Enterprise

Teams, Branding & Security

Multi-tenant with teams, RBAC, custom branding per team (white-labelling), TOTP 2FA, input/output guardrails, and a full audit log.

Custom logos, colors, and app names per team
OAuth, LDAP, OIDC single sign-on
TOTP 2FA with admin enforcement
Full audit trail for all mutations

Auto-Sync

Knowledge Base Sync

Automatically keep your RAG knowledge base up-to-date by syncing from external sources on a schedule. Configure per project with independent settings.

Web URLs — re-scrape pages automatically
Amazon S3 — sync documents from buckets
Confluence — sync space pages
SharePoint — sync via Microsoft Graph
Google Drive — sync via service account

Embed

Embeddable Chat Widget

Add an AI chat bubble to any website with a single <script> tag. Streams responses in real-time, maintains conversation context, and works on any domain.

Fully customisable — colors, title, avatar, position, welcome message
Shadow DOM isolation — won't conflict with host page styles
Live preview in the admin UI before deploying
Secured via read-only, project-scoped API key

WordPress

WordPress Plugin

Drop a single zip into any WordPress site and turn your RESTai instance into the AI engine behind it. Each capability maps to its own RESTai project, so models, prompts and budgets stay tunable per task — and the plugin auto-provisions the starter projects on first connect.

Generate content, excerpts and SEO meta from Gutenberg — writes into Yoast and Rank Math fields
One-click featured image generation using whichever generator your team has access to
Translate any post to N languages as drafts — Polylang/WPML compatible
AI comment moderation, WooCommerce product descriptions, knowledge sync, AI site search and email personalisation
Embeddable chat widget with the right widget key auto-provisioned, no copy-paste required

RESTai WordPress plugin in the Gutenberg editor

OpenAI-Compatible

Direct Access API

Use LLMs, image generators, and audio transcription directly via OpenAI-compatible endpoints — no project required. Team-level permissions control access, and all usage counts toward budgets.

POST /v1/chat/completions — Chat with any LLM (streaming supported)
POST /v1/images/generations — Generate images via DALL-E, Flux, SD, etc.
POST /v1/audio/transcriptions — Transcribe audio files

Works with any OpenAI-compatible SDK — just point base_url to your RESTai instance.

Quick start

Self-host your AIaaS in 30 seconds

Install from PyPI

# Install
pip install restai-core

# Setup database
restai init
restai migrate

# Start server
restai serve

# Open http://localhost:9000/admin
# Login: admin / admin

Docker (prebuilt image)

# Pull and run — multi-arch (amd64/arm64)
docker run -p 9000:9000 apocas/restai:latest

# Or pin a release (with env file)
docker run -p 9000:9000 \
  --env-file .env \
  apocas/restai:6.2.13

# Also on GHCR: ghcr.io/apocas/restai

From source

# Clone and install
git clone https://github.com/apocas/restai
cd restai && make install

# Open http://localhost:9000/admin
# Login: admin / admin

With env file: restai serve -e .env -p 8080 -w 4 · Compose: docker compose --env-file .env up --build

PyPI Package View on GitHub

FAQ

Open-source AIaaS, self-hosted — answered

What is AIaaS (AI as a Service)?

AIaaS — AI as a Service — is a delivery model where AI capabilities (LLM inference, RAG, agents, image generation, embeddings) are exposed as managed services consumable via APIs. RESTai is an open-source, self-hosted AIaaS platform: you run the same kind of platform that hosted AI vendors offer, but on your own infrastructure.

Is RESTai really self-hosted and open-source?

Yes. RESTai is fully self-hosted and released under the Apache 2.0 open-source license. Install via PyPI, Docker, or a Helm chart on Kubernetes. No vendor lock-in, no telemetry phone-home, your data and your models stay on your infrastructure.

Which LLM providers does this AIaaS platform support?

OpenAI, Anthropic (Claude), Ollama (local), Google Gemini, Groq, Grok (xAI), LiteLLM, vLLM, Azure OpenAI, AWS Bedrock, and any OpenAI-compatible endpoint. Mix multiple providers per team, with per-project budgets and fallbacks.

How is RESTai different from a hosted AIaaS provider?

Hosted AIaaS providers run AI workloads on their cloud and bill per token. RESTai gives you the same product surface — RAG projects, agents, MCP tools, eval, analytics, branding — as open-source software you self-host. You keep full control of data, model selection, prompts, and budgets.

What can I build with the RESTai AIaaS platform?

RAG over your own documents, MCP-powered agents, visual logic pipelines (Block IDE), full apps via the App Builder, embeddable chat widgets, WordPress AI integrations, and direct OpenAI-compatible LLM/image/audio endpoints — all behind one self-hosted REST API.

RESTai — Open-source AIaaS Platform · Self-hosted AI as a Service

See it in action

The open-source AIaaS platform built to be self-hosted

Multi-project platform

Full Web UI + analytics

Any LLM provider

Enterprise security

Extensible tools (MCP)

White-label branding

Dashboard & Analytics

RAG — Retrieval-Augmented Generation

Agents + MCP

Projects & Chat Playground

Block — Visual Logic Builder

Evaluation Framework

Prompt Versioning

Image Generation

Teams, Branding & Security

Knowledge Base Sync

Embeddable Chat Widget

WordPress Plugin

Direct Access API

Any LLM provider

Self-host your AIaaS in 30 seconds

Open-source AIaaS, self-hosted — answered