Simon Willison (@simonw)

2026.04.27 23:48

Microsoft's MIT licensed VibeVoice speech-to-text model (think Whisper with speaker diarization) is really good - my notes on running the 5.71GB 4bit MLX conversion on an M5 MacBook, using about 60GB of RAM at peak and transcribing 1hr of audio in ~9 mins

显示更多

转发到社区

Simon Willison@simonw

2025.12.31 23:53

Here's my enormous round-up of everything we learned about LLMs in 2025 - the third in my annual series of reviews of the past twelve months This year it's divided into 26 sections! This is the table of contents:

显示更多

103

4.9K

873

转发到社区

Simon Willison@simonw

2025.12.16 00:37

I ported a Python library implementing a full HTML5 parser to JavaScript using GPT-5.2 and Codex CLI in 4.5 hours, and decorated for Christmas and watched Knives Out while I was doing it

显示更多