Parlant 3.2 is out! 🚀 ! The headline: streaming responses.
The story is like this... I ran a poll in our Discord server (addressing more than 1k Parlant developers) asking what's their most wanted feature for our next release.
The highest score, by far, went to the speed of responses, beating more control (Parlant's pretty damn good at that already), deeper observability, and other options.
So in the newly released v3.2, we listened to our awesome community and complied!!!
Here's what's new in this release:
🔹 Streaming responses: A new output mode where responses can now arrive token by token instead of as a single block. For voice agents, this means the agent can start speaking before the full response is even ready, which dramatically improves the UX of voice agents built on Parlant.
🔹 Labels: Tag guidelines, journeys, and states with labels that propagate into session metadata automatically. Easily pull up all sessions where an upsell guideline was activated, or where a customer requested escalation to a human. This is our new foundation for real-world analytics on what your agent is actually doing out there.
🔹 Scoped retrievers: Attach data retrievers to specific guidelines or journeys instead of running them globally. For instance, pricing docs only load whenever pricing is discussed. Cleaner context window, fewer wasted calls, and much less of that irritating scope creep agents tend to get.
🔹 Agent personality: You can now easily customize Parlant's unique preamble messages per agent with dynamic, context-aware instructions.
🔹 Track control: Strict control over whether a guideline fires once per conversation or context, or keeps reapplying every time its condition matches. For example, empathy cues and tone adjustments shouldn't be one-and-done.
And other useful improvements: explicit field dependencies for canned responses, bulk relationship definitions, and a batch of engine and SDK fixes.
Full breakdown on the blog (first comment) 👇