Retell AI lets companies build ‘voice agents’ to answer phone calls
Call centers are embracing automation. Thereâs debate as to whether thatâs a good thing, but itâs happening â and quite possibly accelerating.
According to research firm TechSci Research, the global market for contact center AI could grow to nearly $3 billion in 2028, from $2.4 billion in 2022. Meanwhile, a recent survey found that around half of contact centers plan to adopt some form of AI in the next year.
The motivation is rather obvious: Call centers are looking to reduce costs while scaling up their operations.
âCompanies with heavy call center operations, looking to scale quickly without the constraints of human contact center agents, are highly receptive to adopting effective AI voice agent solutions,â entrepreneur Evie Wang told TechCrunch. âThis approach not only reduces their overall costs but also decreases wait times.â
Wang is one of the co-founders of Retell AI, which provides a platform companies can use to create AI-powered âvoice agentsâ that answer customer phone calls and perform basic tasks such as scheduling appointments. Retellâs agents are powered by a combination of large language models (LLMs) fine-tuned for customer service use cases and a speech model that gives voice to text generated by the LLMs.
Retellâs customers include some contact center operators but also small- and medium-sized businesses that regularly deal with high call volumes, like telehealth company Ro. They can build voice agents using the platformâs low-code tooling, or they can upload a custom LLM (e.g. an open model like Metaâs Llama 3) to further tailor the experience.
âWe invest a lot in the voice conversation experience, as we see that as the most critical aspect of the AI voice agent experience,â Wang said. âWe donât view AI voice agents as mere toys that one can create with a few lines of prompts, but rather as tools that can offer substantial value to businesses and replace complex workflows.â
Retell worked well enough in my brief testing, at least on the call-facing side.
I arranged a call with a Retell bot using the demo form on Retellâs website. The bot walked me through the process of scheduling a hypothetical dentistâs appointment, asking questions like my preferred date and time, phone number and so on.
I canât say the botâs synthetic voice was the best Iâve heard in terms of realism â certainly not on par with Eleven Labs or OpenAIâs text-to-speech API. Wang, in Retellâs defense, said that the teamâs been mostly focused on reducing latency and handling edge cases, like interruptions that might occur in a conversation.
The latency is low: In my test, the bot responded pretty much without hesitation to my answers and follow-up questions. And it stuck to its script. Try as I might, I couldnât confuse it or prompt it to behave in a way it shouldnât. (When I asked the bot about my dental records, it insisted that I speak with the office manager.)
So are platforms like Retell the future of call centers?
Maybe. For basic tasks like appointment scheduling, automation makes a lot of sense, which is probably why both startups and big tech firms alike offer solutions that compete head-on with Retellâs. (See Parloa, PolyAI, Google Cloudâs Contact Center AI, etc.)
Itâs low-hanging â and seemingly revenue-generating â fruit. Retell claims to have hundreds of customers, all of which are paying per minute of voice agent conversation. Retell has raised a total of $4.53 million in capital to date, courtesy of backers including Y Combinator (where the company was incubated).
But the juryâs out on more-complicated queries, particularly given LLMsâ tendency to make up facts and go off the rails even with safeguards in place.
As Retellâs ambitions grow, Iâm curious to see how the company navigates the many well-established technical challenges in the space. Wang, at least, seems confident in Retellâs approach.
âWith the advent of LLMs and recent breakthroughs in speech synthesis, conversational AI is getting good enough to create really exciting use cases,â Wang said. âFor example, with sub-one-second latency and the ability to interrupt the AI, weâve observed users speaking in fuller sentences and conversing as they would with another person. Weâre trying to make it easy for developers to build, test, deploy and monitor AI voice agents, ultimately to help them achieve production readiness.â