Speechify Chrome Extension Adds Voice Typing & Assistant

Speechify Expands Capabilities with Voice Detection Features
Speechify, traditionally known as a platform for listening to articles, PDFs, and documents, is now integrating voice detection functionalities into its Chrome extension. These additions include voice typing capabilities and an AI-powered voice assistant designed to respond to user inquiries.
The Rise of Voice Detection Tools
Over the past year, the availability of voice detection tools has increased significantly. This growth is largely attributed to improvements in the quality of speech recognition models. Speechify is capitalizing on this trend by introducing its own dictation tool, initially supporting the English language.
Like comparable dictation software, Speechify’s voice typing feature is designed to automatically correct errors and eliminate unnecessary filler words during transcription.
Initial Testing and Performance
Preliminary testing, conducted over a period exceeding one day, indicated areas where Speechify’s new tool could be refined. While functionality was observed in applications such as Gmail and Google Docs, challenges were encountered when attempting to activate and utilize voice dictation on platforms like WordPress.
The company has stated that optimization for widely-used websites will be implemented progressively.
In terms of accuracy, the word error rate was found to be comparatively higher than that of alternative tools like Wispr Flow, Willow, and Monologue. Speechify acknowledges that its model is designed to learn and improve with increased usage, leading to a gradual reduction in error rates.
Introducing the Conversational Voice Assistant
Alongside voice typing, Speechify is launching a conversational voice assistant accessible through the browser’s sidebar. Users can pose questions related to the current webpage, such as requesting a summary of key ideas or a simplified explanation of the content.
While platforms like ChatGPT and Gemini also offer conversational modes, Speechify differentiates itself by prioritizing voice interaction. The company asserts that voice functionality is often secondary in these other applications.
“We maintain the belief that chat will consistently represent the primary user experience within ChatGPT and Gemini upon application launch, aligning with user expectations. Voice will invariably be a secondary consideration – and frequently an afterthought – for these platforms. Our extensive experience in developing Speechify has demonstrated a substantial market segment, including our existing users, who desire voice as the primary, default interaction method whenever they access an application and engage with AI,” explained Rohan Pavuluri, Speechify’s chief business officer, in a statement to TechCrunch.
Compatibility and Future Development
Currently, Speechify’s assistant is not compatible with browsers featuring integrated sidebar assistants, such as OpenAI’s Atlas, Perplexity’s Comet, and Dia. However, the company is not overly concerned, given the extension’s primary focus on Chrome and its extensive user base.
Speechify intends to integrate both voice typing and the voice assistant into all its applications across desktop and mobile platforms in the coming months.
Looking Ahead: AI Agents for Task Automation
The company also plans to develop AI agents capable of autonomously completing tasks on a user’s behalf. While the full roadmap remains undisclosed, one example provided was the ability to make phone calls for appointment scheduling or to handle hold times with customer support representatives. Companies like Truecaller and Cloacked are pursuing similar objectives.





