Post
102
After a VLM, StepFun dropped a new audio model: Step-Audio-R1.1, enabling thinking while speaking π₯
stepfun-ai/Step-Audio-R1.1
β¨ Apache 2.0
β¨ Combines dual-brain architecture and acoustic-grounded reasoning to enable real-time dialogue with SOTA-level reasoning
stepfun-ai/Step-Audio-R1.1
β¨ Apache 2.0
β¨ Combines dual-brain architecture and acoustic-grounded reasoning to enable real-time dialogue with SOTA-level reasoning