jamescham 10 hours ago

Pete Warden and team just published a paper on Moonshine, their speech to text model.

Key features include:

- 1.7x overall speed boost compared to Whisper - Flexible-sized input window, allowing for more efficient processing of shorter audio clips - Up to 5x faster performance on 10-second audio clips - Matches or exceeds Whisper's accuracy