Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Kunal Dhawan's picture
21 4 10

Kunal Dhawan

kunaldhawan
nvidia
teeofftechnologies's profile picture dlzwl's profile picture nourlab's profile picture
·
https://kunal-dhawan.weebly.com/
  • KunalDhawan

AI & ML interests

Conversational AI, NLP, Multimodal Machine Learning

Recent Activity

updated a model 13 days ago
nvidia/nemotron-speech-streaming-en-0.6b
new activity about 2 months ago
nvidia/nemotron-speech-streaming-en-0.6b:Deploy Streaming nemotron speech model
commentedon an article about 2 months ago
Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR
View all activity

Organizations

NVIDIA's profile picture Dynamic-SUPERB's profile picture ASR-LLM Group: Generative Error Correction's profile picture

upvoted a collection 2 months ago

Nemotron Speech

Collection
Open, state-of-the-art, production‑ready enterprise speech models from the NVIDIA Speech research team for ASR, TTS, Speaker Diarization and S2S • 11 items • Updated 2 days ago • 44
upvoted an article 3 months ago
view article
Article

Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR

Jan 5
•
85
upvoted 2 papers 10 months ago

Unified model for code-switching speech recognition and language identification based on a concatenated tokenizer

Paper • 2306.08753 • Published Jun 14, 2023 • 2

Codec-ASR: Training Performant Automatic Speech Recognition Systems with Discrete Speech Representations

Paper • 2407.03495 • Published Jul 3, 2024 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs