Nemotron Speech Collection Open, state-of-the-art, production‑ready enterprise speech models from the NVIDIA Speech research team for ASR, TTS, Speaker Diarization and S2S • 11 items • Updated 2 days ago • 44
Unified model for code-switching speech recognition and language identification based on a concatenated tokenizer Paper • 2306.08753 • Published Jun 14, 2023 • 2
Codec-ASR: Training Performant Automatic Speech Recognition Systems with Discrete Speech Representations Paper • 2407.03495 • Published Jul 3, 2024 • 1