注册并分享邀请链接,可获得视频播放与邀请奖励。

Kevin Lin (@KevinQHLin) “🌟Introducing🎻Violin — an Open-source Video Translation Skill. 📹Video is the d” — TopicDigg

Kevin Lin 的个人资料封面
Kevin Lin 的头像
Kevin Lin
@KevinQHLin
building multimodal x agent postdoc @UniofOxford phd @NUSingapore | ex @Meta @Microsoft
加入 August 2022
834 正在关注    2K 粉丝
🌟Introducing🎻Violin — an Open-source Video Translation Skill. 📹Video is the dominant medium on the internet, yet most high-quality content (lecture, talk, podcast) is locked behind a single language, leaving global audiences behind. So we built Violin: a video skill that combines speech recognition, LLM translation, and speech synthesis into one seamless pipeline. 🌐 Demo: 📝 Blog: 🔗 GitHub: ✨Key Features: 🎙️High-quality multilingual ASR & Translation & TTS. 🗣️Personalize translation & voice (turn an academic talk into something children can follow). 💬Chat with the video — ask any questions grounded in the video. 🧩Support Web app, CLI, and Agent skill 🍃Fully open-source under MIT. ❤️Built with the wonderful @ShangZhu18 and advised by @james_y_zou ! All features powered by @togethercompute . Try it and let us know what you think! 🎻
显示更多
0
3
51
20
转发到社区