Preview edition

Features Specs FAQ Docs Buy now

Bring
choice to voice

The best way to get started with voice

Bring choice

Top view of the Home Assistant Voice Preview Edition

Our vision

Let’s make open, local, and private voice assistants a reality

January 2023

Year of the voice

Home Assistant declares 2023 the year of the voice; the community and Nabu Casa focus on rapid development. Assist is added to Home Assistant.

April 2023

Home Assistant Cloud gets voice

Home Assistant Cloud adds speech-to-text and text-to-speech capabilities, enabling low-powered hardware to run Assist. More languages and features are added.

December 2024

Voice Preview Edition launches

Dedicated high-quality hardware allows you to experience and develop this community-driven and private voice assistant.

Future

Fully-local voice for everyone

We will match and then surpass other voice assistants. Supporting languages that big tech ignores - while being private, open, and fully customizable. Running inexpensively on local hardware.

In short Advanced audio processing

Built for Home Assistant

A community image of the Voice Preview Edition n a computer desk featuring a keyboard, mouse, and two game controllers arranged neatly on its surface.

Credit: MrDarrenGriffin

The Voice Preview Edition sitting alongside a Home Assistant Green device

A close-up photo of the rotary volume dial and button on the Voice Preview Edition

A close-up photo of the physical mute switch on the Voice Preview Edition

A close-up photo of one of the two microphones on the Voice Preview Edition

A close-up photo of the speaker holes on the Voice Preview Edition

Run local or in the cloud

Preview the future of voice for just

$69 / €59 *

Front view of the Voice Preview Edition showing the speaker holes

Buy now * Recommended MSRP. Prices differ between regions due to varying local market costs and conditions, and subject to individual retailers.

Built for Home Assistant

Easy Setup - one USB-C cable and setup wizard, no assembly required.

Turn on the light. Turn on the TV. What's the temperature? Set a timer for 5 minutes. Turn on the fan. Add milk to my shopping list.

Turned on the light. Turned on the media player. It's 24 degrees. Timer started. Turned on the fan. Added milk.

Advanced audio processing

that works in the home

Click to rotate

Close-up photo showing one of the two microphones on the Voice Preview Edition

Cuts through the noise

Dual mics and XMOS audio chip hears your voice in most environments

Close-up photo of the physical mute (in a muted state) switch on the Voice Preview Edition

Mute switch

Physically cuts power to the microphones for guaranteed privacy

Close-up photo of the rotary volume dial, button and illuminated (blue) LED ring on the Voice Preview Edition

Physical controls

Button and rotary volume dial that feels great and tactile

Speaker

Audio feedback you can hear from across the room

Close-up photo of the illuminated (blue) LED ring on the Voice Preview Edition

Visual feedback

Multicolored LED ring displays when it’s listening, volume levels, and more

Close-up photo of the 3.5mm audio jack on the Voice Preview Edition

Audio out

Stereo 3.5mm jack lets you stream lossless quality music via its dedicated DAC

Injection-moulded case

A small, unobtrusive design that blends into the home

Bringing choice
to voice

Run voice fully locally, or offload speech processing to our privacy-first cloud.

Focused local processing

Limited to a set list of common home control phrases, this allows even low-powered hardware system to process speech locally and offline.

Full local processing

Full speech processing is done locally, requiring high processing power for adequate speed and accuracy.

Generic outline drawing of the Voice Preview Edition

Home Assistant Cloud

Your voice is processed privately on Home Assistant Cloud, allowing Assist to run very accurately on low-powered hardware.

Check supported languages

Assist aims to support more languages than other voice assistants, but this is still a work in progress, and we need your help.

Local processing

Home Assistant Cloud

Help improve language support

Start experimenting with AI

Connect your Voice Preview Edition to popular AI providers, or local LLMs, to control your home with natural language.

Fully open source and community-driven

Fully open software, firmware, and hardware allows you to make it work best for your needs. It includes a Grove port for connecting sensors and a 3.5mm headphone jack for connecting external speakers. With a dedicated community customizing and adding functionality.

Be part of building Assist and preview the future of voice control in the home.

Explore resources Join our community

Specs

Top-down view of the Voice Preview Edition

Side view (mute switch) of the Voice Preview Edition

Bottom view of the Voice Preview Edition featuring the product label and Grove port

Side view (speaker) of the Voice Preview Edition

Back view (audio and power) of the Voice Preview Edition

Dimensions and weight

Device

84x84x21 mm
96 g

With box

94x94x30 mm
120 g

Material

Enclosure

Injection-molded Polycarbonate plastic

Colors

White and semi-transparent

Technical drawing of the top cover for the Voice Preview Edition

Technical drawing of the PCB for the Voice Preview Edition

Technical drawing of the speaker for the Voice Preview Edition

Technical drawing of the case for the Voice Preview Edition

Technical drawing of the rubber feet for the Voice Preview Edition

Outside

Physical controls

Multipurpose button
Rotary dial for volume and other input
Mute switch that physically cuts power to the microphone

Microphone & speaker

Internal speaker
Internal dual-mic array
3.5mm audio output

Expandability

Grove port to connect sensors or other accessories
Easy to open - no clips, only screws to access internals
Exposed pads on PCB for modding

Inside

SoC

ESP32-S3 SoC with 16 MB of FLASH storage
8 MB octal PSRAM

Audio Processing

XMOS XU316
Featuring:
Echo cancellation
Stationary noise removal
Auto gain control
Dedicated I2S lines for audio in and out

Power / data

USB-C, 5 V DC, 2 A

Radios

2.4 GHz Wi-Fi
Bluetooth 5.0

Audio output

3.5 mm (⅛”) stereo headphone jack
Digital to analog converter (DAC):
TI AIC3204
48 kHz sampling rate

Software

ESPHome preloaded
Fully open-source firmware for both the ESP32 and XMOS chip

Environmental conditions for operation

Indoor use only
O °C to 30 °C | 32 °F to 86 °F
Humidity: non-condensing
Keep in dry, not excessively dusty environment as this can cause damage to the unit

Got some
questions?

Why is this called the Preview Edition?

It is our vision to make open, local, and private voice assistants a reality in any language. While we have made great strides in realizing this, it is such a massive undertaking that we need our worldwide community to participate in its development. An essential ingredient for the community to drive the project forward is a standardized hardware platform for voice, built for Home Assistant from the ground up: Home Assistant Voice Preview Edition.

While for some, the current state of our voice assistant may be all they need, we think there is still more to do before it is ready for every home in every country, and until then, we’ll be selling this Preview of the future of voice assistants. Taking back our privacy isn’t for everyone - it’s a journey - and we want as many people as possible to join us and make it better.

Can I run this voice assistant fully locally?

Yes, provided your language is supported. For some languages, you can set up Assist and your Voice Preview Edition to use a Focused Local model, which uses Speech-to-Phrase add-on. It can run locally and accurately on lower-power hardware, by generating a local speech-to-text model specifically for your Home Assistant installation. It is limited to predefined sentences aimed at controlling your home, but not able to process general speech. For instance, it could turn on a device, but would not be able to add something to your shopping list.

For other languages, Speech-to-Phrase may not be supported, which will require Fully Local speech-to-text solution, such as the Whisper add-on. Fully local speech-to-text requires powerful hardware to be accurate and responsive. If you wish to use OpenAI’s Whisper, we recommend using at least an Intel N100 or equivalent processor. This will allow you to use the Whisper Base model for speech-to-text locally. This model runs reasonably fast for languages that have large public datasets to train on, such as English and Spanish. However, for languages with less data available, you will need Whisper’s Small or Large models that require significantly more powerful hardware to run.

Do I need Home Assistant Cloud?

You do not need Home Assistant Cloud. However, for some languages, we believe that Home Assistant Cloud provides the best experience. There are languages that are not supported by the local speech-to-text models we leverage (either our focused local Speech-to-Phrase model, or the fully local Whisper by OpenAI), but are well-supported by the speech processing used by Home Assistant Cloud.

How does Home Assistant Cloud maintain my privacy with voice?

Home Assistant Cloud has been designed from the ground up to uphold the core values of the Home Assistant project, with privacy being one of our highest priorities. Home Assistant Cloud leverages the enterprise services of Microsoft Azure for its industry-leading speech processing, which unlike many consumer offerings, is bound by commercial terms and conditions and does not retain or store your data. In addition, Home Assistant Cloud itself does not keep any record of your voice, data, or commands.

Why is my language listed as “not supported” locally?

Three separate parts are needed for a language to be supported in local operation. Reliable, local speech-to-text models must be available to turn what is said into text commands that can be sent to Home Assistant. Home Assistant then needs to have sentence support for that language, so it knows which actions to perform for each command.

Finally, a local text-to-speech model has to be available for your language, so it can reply to you. If any of these three parts are not available locally, your language is not yet supported. Currently, there is one part that holds back our language support more than the others, and that’s local speech-to-text.

Why is my language listed as “not supported” by Home Assistant Cloud?

There can be a number of reasons why a language is not supported by Home Assistant Cloud, but most often it is because the sentences have not been translated by a member of our community. If you would like to help translate these sentences and have your language added, please visit here.

What can I do to get my language supported locally, or to improve support?

We need your help to improve or add support for your language. Through the help of our global community, ultimately, we aim to support every language possible. In our documentation, we have a list of various ways you can help us advance our open, local, and private voice assistant. Thank you for helping us make voice better in your language.

Can I use a different speech-to-text model than OpenAI’s Whisper?

Yes, Home Assistant can be configured to use any speech-to-text integration that supports the Wyoming protocol. The most commonly used fully local speech-to-text add-on available for Home Assistant users is OpenAI’s Whisper. This has mixed results; specifically, it lacks support for some languages and is hardware-intensive.

Our alternative speech-to-text model, Speech-To-Phrase, can run locally and accurately on lower-power hardware, though this does not provide full speech-to-text capabilities. Based on the Rhasspy project, it is able to create focused local speech-to-text models, limited to predefined sentences aimed at controlling your home, but not able to process general speech. For instance, it could turn on a device, but would not be able to add something to your shopping list.

Which wake words can I select?

Out of the box, the device can listen for “Okay Nabu,” “Hey Jarvis,” or “Hey Mycroft” as wake words. This is provided by the on-device wake word engine called microWakeWord. Creating these wake words requires very powerful hardware and large datasets to train, which is not realistic for most users.

In time we will work with the community to create more wake words, but currently are focused on improving our current wake words to work for a large variety of accents and voice registers.

Why did you pick these default wake words and not something like “computer” or “okay assist”?

A wake word should be uncommon in everyday conversations at home or in media, such as music or TV, to minimize the risk of the device activating unintentionally. “Nabu”, “Jarvis”, and “Mycroft” are fairly unique words, as opposed to generic terms such as “computer” or “assist”. That makes these microWakeWord models perform well for most users.

Can this replace my Google Mini, Apple HomePod, Amazon Echo, or other Big Tech devices?

In the future, we intend to match and then surpass the Big Tech voice assistants, but for now, this Preview Edition can not yet do everything those devices can. For some, the current capabilities of our voice assistant will be all they need; especially those who just want to set timers, manage their shopping list, and control their most used devices. For others, we understand they want to ask their voice assistant to make whale sounds or to tell them how tall Taylor Swift is - our voice assistant doesn’t do those things… yet.

Can I play music on this device?

Yes, if you plug an external speaker into the 3.5mm audio port. The built-in speaker is meant for voice feedback and is not optimized for listening to music, but the included DAC is capable of playing lossless audio on a suitable external speaker. We recommend using Music Assistant to control music playback.

Can I use AI models or LLMs as my voice assistant?

Yes, if you have paid access to a supported cloud LLM or have a local LLM running on suitable hardware, it is possible to either fully replace our voice assistant’s conversation agent with an LLM or use it as a fallback for commands that Home Assistant does not understand natively.

The benefit of this is being able to ask nearly any query that comes to mind, and speak commands in natural language. Just note, we consider the use of AI in the smart home to be experimental, and would recommend caution when letting it control important aspects of your home. Get started with AI and Assist.

Does the device come with a USB-C charger and cable?

No, the device does not come with a USB-C charger and cable. Sustainability is a core value of the Home Assistant project, and we do not wish to send more chargers or cables into the world when most users already own enough of these.

Which devices can I control, and what can I say to Voice Preview Edition?

You can find an overview of everything you can say to the device in our documentation. You may need to expose some devices manually to Assist, in order to let this device control them.