In a Training Loop 🔄

3 55

Alain Galvan

alaingalvan

https://alain.xyz

AI & ML interests

Ray tracing denoisers, autoencoders.

Recent Activity

liked a Space 4 days ago

microsoft/VibeVoice-ASR

upvoted a collection 4 days ago

VibeVoice

liked a Space 5 days ago

microsoft/TRELLIS.2

View all activity

Organizations

None yet

liked a Space 4 days ago

VibeVoice ASR

🌍

Official Playground of Microsoft VibeVoice-ASR

upvoted a collection 4 days ago

VibeVoice

Collection

Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 8 items • Updated Mar 2 • 241

liked a Space 5 days ago

TRELLIS.2

🏢

1.53k

High-fidelity 3D Generation from images

liked 4 Spaces 2 months ago

faster-qwen3-tts

🎙

219

Generate natural speech from text using custom or cloned voices

Parakeet-TDT-0.6b-V2

469

Transcribe audio files with timestamps and download transcripts

The Smol Training Playbook

📚

3.14k

The secrets to building world-class LLMs

Parakeet STT Progressive Transcription

🎤

Transcribe speech to text instantly with WebGPU acceleration

liked a model 2 months ago

openai/whisper-large-v3-turbo

Automatic Speech Recognition • 0.8B • Updated Oct 4, 2024 • 7.57M • • 2.98k

upvoted an article 3 months ago

Article

Creating custom kernels for the AMD MI300

Jul 9, 2025

•

upvoted a paper 3 months ago

SDEdit: Guided Image Synthesis and Editing with Stochastic Differential Equations

Paper • 2108.01073 • Published Aug 2, 2021 • 9

liked a model 3 months ago

zai-org/GLM-OCR

Image-to-Text • Updated 17 days ago • 8.43M • • 1.68k

liked 2 Spaces 3 months ago

PaddleOCR-VL-1.5 Online Demo

😻

PaddleOCR-VL-1.5_Online_Demo

Qwen3-ASR Demo

🎙

136

Transcribe audio to text with multi-language timestamps

liked 2 models 3 months ago

nvidia/personaplex-7b-v1

Audio-to-Audio • Updated Mar 2 • 530k • 2.48k

deepseek-ai/DeepSeek-OCR-2

Image-Text-to-Text • 3B • Updated Feb 3 • 1.5M • 935

liked 2 Spaces 3 months ago

Whisper

📉

2.76k

Transcribe audio files into text

Omni Image Editor

🖼

1.58k

Image edit, text to image, image upscale, remove watermark

liked 2 models 3 months ago

fal/flux-2-klein-4B-background-remove-lora

Image-to-Image • Updated Jan 21 • • 24

Supertone/supertonic-2

Text-to-Speech • Updated Jan 6 • 3.99k • 382

liked a Space 3 months ago

Qwen3-TTS Demo

🎙

1.91k

Generate speech audio from text with custom or cloned voices