Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
ZhenYE's picture
10 8 33

ZhenYE

ZhenYe234
21world's profile picture iearthshine's profile picture selvanhf's profile picture
·
https://github.com/zhenye234
  • zhenye234

AI & ML interests

None yet

Recent Activity

upvoted a paper about 6 hours ago
Talker-T2AV: Joint Talking Audio-Video Generation with Autoregressive Diffusion Modeling
liked a dataset 4 months ago
jjuik2014/DH-FaceVid-1K
liked a model 7 months ago
Atotti/Qwen3-Omni-AudioTransformer
View all activity

Organizations

HKUST Audio's profile picture

authored 5 papers over 1 year ago

Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis

Paper • 2502.04128 • Published Feb 6, 2025 • 27

CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model

Paper • 2305.06908 • Published May 11, 2023 • 7

CoMoSVC: Consistency Model-based Singing Voice Conversion

Paper • 2401.01792 • Published Jan 3, 2024 • 11

FlashSpeech: Efficient Zero-Shot Speech Synthesis

Paper • 2404.14700 • Published Apr 23, 2024 • 32

Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model

Paper • 2408.17175 • Published Aug 30, 2024 • 6
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs