VibeVoice Collection Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 8 items • Updated Mar 2 • 241
Running on A10G Featured 219 faster-qwen3-tts 🎙 219 Generate natural speech from text using custom or cloned voices
Running on T4 Agents Featured 469 Parakeet-TDT-0.6b-V2 469 Transcribe audio files with timestamps and download transcripts
Running on CPU Upgrade Featured 3.14k The Smol Training Playbook 📚 3.14k The secrets to building world-class LLMs
Running Featured 88 Parakeet STT Progressive Transcription 🎤 88 Transcribe speech to text instantly with WebGPU acceleration
openai/whisper-large-v3-turbo Automatic Speech Recognition • 0.8B • Updated Oct 4, 2024 • 7.57M • • 2.98k
SDEdit: Guided Image Synthesis and Editing with Stochastic Differential Equations Paper • 2108.01073 • Published Aug 2, 2021 • 9
Paused Agents Featured 136 Qwen3-ASR Demo 🎙 136 Transcribe audio to text with multi-language timestamps
Running on CPU Upgrade Agents 1.58k Omni Image Editor 🖼 1.58k Image edit, text to image, image upscale, remove watermark
Configuration error Agents Featured 1.91k Qwen3-TTS Demo 🎙 1.91k Generate speech audio from text with custom or cloned voices