A themeable, pluggable AI chat widget for websites — built in Typescript with zero framework dependencies. It renders using Vanilla JS.
Persona gives you a drop-in UI for your AI assistant that works on basically any site or product on the web. It ships with support for streaming responses, voice I/O, multi-modal content, tool call visualization, artifact rendering, and a plugin system so you can customize every layer of the UI.
Persona works with any SSE-capable backend. It's pre-integrated with Runtype out of the box, so you can go from install to live assistant with zero configuration.
persona-chat.dev hosts the interactive gallery (35+ pages): streaming chat, voice, docked and fullscreen layouts, themes, tool calls, artifacts, and more. It mirrors examples/embedded-app. To run the same pages on your machine with hot reload while you edit code, run pnpm dev from the repository root: the Vite dev server reloads the demo, and the app resolves @runtypelabs/persona from the workspace (packages/widget), so widget changes apply without publishing to npm.
| Package | npm | Description |
|---|---|---|
packages/widget |
@runtypelabs/persona |
The installable chat widget |
packages/proxy |
@runtypelabs/persona-proxy |
Optional Hono-based proxy server for flow configuration |
| Example | Platform | Description |
|---|---|---|
examples/embedded-app |
Vite | Vanilla JS demo with runtime configuration (live) |
examples/vercel-edge |
Vercel / Railway / Fly.io | Node.js proxy server |
examples/cloudflare-workers |
Cloudflare Workers | Edge proxy server |
corepack enable
pnpm install
pnpm devThis starts the proxy on http://localhost:43111 and the demo app at http://localhost:5173. Both depend on the local widget package via workspace linking, so changes hot-reload without publishing.
Note: Requires Node.js 20+ (
nvm usereads.nvmrc). Corepack manages pnpm for you.
npm install @runtypelabs/persona # widget
npm install @runtypelabs/persona-proxy # proxy (optional)Everything below is opt-in and configurable via the widget config, feature flags, or the plugin system.
SSE-based message streaming with pluggable parsers (plain text, JSON, XML, regex). Bring your own stream parser or use the built-ins. Supports partial JSON parsing for incomplete chunks.
Text, images (PNG, JPEG, GIF, WebP, SVG), and documents (PDF, DOCX, TXT, CSV, JSON, Excel). Configure allowed file types, size limits, and previews through the attachments config.
Optional speech-to-text via the Web Speech API or Runtype's WebSocket voice service with barge-in interruption and voice activity detection. Text-to-speech playback for assistant responses. Enable via the voice config.
Collapsible reasoning bubbles that display model chain-of-thought with duration tracking and streaming. Controlled by features.showReasoning — on by default, or override the renderer with a plugin hook.
Expandable tool call bubbles showing name, status, arguments, and results. Optional human-in-the-loop approval system with configurable timeout. Controlled by features.showToolCalls with a plugin hook for custom rendering.
Optional side-panel for rendering markdown and component content. Desktop split layout (resizable) or mobile drawer. Enable via features.artifacts, configure toolbar presets, copy behavior, and appearance.
Optional real-time event capture with search/filter, badge coloring, timestamps, and expandable payloads. Enable via features.showEventStreamToggle. Customize rows, toolbar, and payload rendering through plugin hooks.
Light and dark themes included. Full design token system (palette, semantic, component-level) with CSS variable support. Extend with built-in plugins for accessibility, reduced motion, high contrast, and branding — or create your own.
Start from a built-in preset (shop, minimal, fullscreen) or configure from scratch. Header layouts, message layouts, avatars, timestamps, and slot-based rendering are all customizable. Dock as a floating widget or embed inline.
14 render hooks covering the launcher, header, composer, messages, reasoning, tool calls, approvals, loading/idle indicators, and the event stream. Priority-based ordering with automatic fallback to defaults. Replace any piece of the UI without forking.
Optional message-level upvote/downvote/copy with automatic backend submission. CSAT and NPS survey components. Wire up custom callbacks for your own analytics.
Renders multi-turn agent loops as they stream from the backend — displaying iteration progress, reflections, and stop reasons. Agent metadata is attached to every message. Customize how execution events appear through plugin hooks.
Register custom components and render them inline via directives. Stream-aware parser and middleware for dynamic UI insertion during streaming.
Programmatically insert messages (injectMessage, injectAssistantMessage, injectUserMessage, injectSystemMessage) with dual-content support — display one thing to the user while sending different content to the LLM.
Both proxy examples handle secure API key management, CORS, and multiple flow configurations.
- vercel-edge — best for quick deployment to Vercel or any Node.js host
- cloudflare-workers — best for global edge deployment with low latency
This monorepo uses Changesets for version management.
pnpm changeset # create a changeset after making changes
pnpm changeset version # bump versions and generate changelogs
pnpm release # build and publish to npmSee packages/widget/README.md for the full configuration reference.
MIT
