Show HN: Voxtral Mini 4B Realtime running in the browser

by adefaon 2/8/2026, 9:33 PMwith 0 comments

Hello! Earlier this week Mistral released:

https://huggingface.co/mistralai/Voxtral-Mini-4B-Realtime-26...

Last time I ported a TTS model to Rust using candle, this time I ported an ASR model to Rust with burn.

I was able to lean on the wgpu backend to get the model running in the browser after sharding it.

Here is the HF Space:

https://huggingface.co/spaces/TrevorJS/voxtral-mini-realtime

and here are the model weights (q4 + tokenizer):

https://huggingface.co/TrevorJS/voxtral-mini-realtime-gguf

and the code:

https://github.com/TrevorS/voxtral-mini-realtime-rs

Didn't have a chance to use agent teams with this project, maybe next one! :)

0