Open to Collab

5 6 29

Anthony Loyola PRO

LOYOLABIZ

http://www.hypeminati.com

AI & ML interests

None yet

Recent Activity

liked a model 4 days ago

DavidAU/Qwen3.6-27B-NEO-CODE-Di-IMatrix-MAX-GGUF

reacted to imnotkitty's post with ❤️ 4 days ago

https://huggingface.co/spaces/tencent/Hy3-preview is out: an open-weights MoE reasoning model. ✅ 295B total / 21B active / 256K context ✅ Fused fast-and-slow thinking in a single model ✅ First model trained on Hunyuan's rebuilt pretraining + RL infra (Feb → Apr) Benchmarks: 👉 SWE-Bench Verified, Terminal-Bench 2.0, BrowseComp, WideSearch — competitive results, particularly strong on agentic tool use 👉 Top score on Tsinghua's 2026 Spring math PhD qualifying exam 👉 Strong context-learning and instruction-following on Tencent's CL-bench / CL-bench-Life More details can be found in my article: https://huggingface.co/blog/imnotkitty/hy3-preview

new activity 9 days ago

mradermacher/Darwin-2B-Opus-i1-GGUF:English?

View all activity

Organizations

liked a model 4 days ago

DavidAU/Qwen3.6-27B-NEO-CODE-Di-IMatrix-MAX-GGUF

Image-Text-to-Text • 27B • Updated 2 days ago • 17.8k • 27

reacted to imnotkitty's post with ❤️ 4 days ago

Post

3911

tencent/Hy3-preview is out: an open-weights MoE reasoning model.

✅ 295B total / 21B active / 256K context
✅ Fused fast-and-slow thinking in a single model
✅ First model trained on Hunyuan's rebuilt pretraining + RL infra (Feb → Apr)

Benchmarks:
👉 SWE-Bench Verified, Terminal-Bench 2.0, BrowseComp, WideSearch — competitive results, particularly strong on agentic tool use
👉 Top score on Tsinghua's 2026 Spring math PhD qualifying exam
👉 Strong context-learning and instruction-following on Tencent's CL-bench / CL-bench-Life

More details can be found in my article: https://huggingface.co/blog/imnotkitty/hy3-preview

2 replies

New activity in mradermacher/Darwin-2B-Opus-i1-GGUF 9 days ago

English?

#1 opened 9 days ago by

LOYOLABIZ

reacted to prithivMLmods's post with 👀 9 days ago

Post

4118

HY-World-2.0 — A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds is now available on Spaces, and it works both as native Gradio components and in Gradio server mode.

> HY-World-2.0-Demo: prithivMLmods/HY-World-2.0-Demo
> HY-World-2.0 [Server Mode]: prithivMLmods/HY-World-2.0-Demo
> Featuring 3D reconstruction and Gaussian splats with the Rerun viewer, along with camera poses, depth maps, and surface normals.
> In Server Mode, Gradio is served via FastAPI, with FastAPI remaining the top-level server.
> Model: tencent/HY-World-2.0
> GitHub: https://github.com/PRITHIVSAKTHIUR/HY-World-2.0-Demo

🤗To learn more, visit the app page or the respective model pages.

reacted to ST-x-Tony's post with 🤯❤️ 10 days ago

Post

1950

Introducing SKT-SURYA-H: Bharat’s first Sovereign AI at 2.544 Trillion Parameters! 🇮🇳

Developed by SKT AI Labs, we are pushing the limits of Open Source.
🚀 2.544T MoE Architecture
🧠 Weight Manifold Fusion

Explore:
sKT-Ai-Labs/SKT-SURYA-H

Stay Tuned All Drafts Soon

reacted to SeaWolf-AI's post with 👀🔥 11 days ago

Post

4443

Darwin-TTS: 3% of an LLM's Brain Makes TTS Speak with Emotion — Zero Training

We blended 3% of Qwen3-1.7B (LLM) FFN weights into Qwen3-TTS-1.7B's talker module. The result: emotionally enhanced speech synthesis — with zero training, zero data, and zero GPU hours.

Try the Demo: FINAL-Bench/Darwin-TTS-1.7B-Cross

Model Weights: FINAL-Bench/Darwin-TTS-1.7B-Cross

Full Research Article: https://huggingface.co/blog/FINAL-Bench/darwin-tts

Qwen3-1.7B (LLM) and Qwen3-TTS-1.7B's talker share 100% identical architecture — same hidden_size (2048), same layers (28), same heads (16). This enabled pure 1:1 weight blending across 84 FFN tensors with a single lerp operation. At 3% blend, emotion appears. At 5%, emotion intensifies. At 10%, the model breaks — producing 655-second outputs for a 3-second sentence, because the LLM's "keep generating" pattern overwhelms the TTS stop signal.

To our knowledge, this is the first training-free cross-modal weight transfer between an LLM and a TTS model. Prior work either requires adapter training (SmolTolk, 2025), fine-tuning (CSLM, 2025), or massive end-to-end compute (GPT-4o). Darwin-TTS achieves cross-modal capability transfer in under 2 minutes on CPU.

The key insight: TTS models with LLM backbones already "think" in language. We're just restoring 3% of the original LLM's language understanding patterns — particularly those related to emotional semantics and prosody planning. The code is three lines: load the model, load the LLM FFN, call p.lerp_(llm_weight, 0.03).

creators of the Darwin Evolutionary Merge Framework.
Darwin LLM V7 achieved GPQA Diamond 86.9% (HF Benchmark #3)
through CMA-ES optimized FFN crossbreeding. Darwin-TTS extends this principle from LLM-to-LLM merging into cross-modal LLM-to-TTS transfer. Apache 2.0.

upvoted a changelog 11 days ago

Hugging Face Changelog

Introducing Kernels

13 days ago

• 152

upvoted a changelog 18 days ago

Hugging Face Changelog

ZeroGPU overquota

18 days ago

• 133

liked 2 models 18 days ago

#1 opened 18 days ago by

LOYOLABIZ

liked a model 18 days ago

netflix/void-model

Video-to-Video • Updated 22 days ago • 907

reacted to branikita's post with ➕🤯🚀 18 days ago

Post

2585

We added multi-simulator support for our parallel gripper on the SO-ARM 100/101 arm. One URDF, one launch file, five physics engines: Gazebo, MuJoCo, Webots, CoppeliaSim, Isaac Sim.

The ROS2 package includes xacro descriptions, joint trajectory controllers, robot state publisher, and parameterized hardware interfaces for each simulator.

https://github.com/roboninecom/SO-ARM100-101-Parallel-Gripper

New activity in mradermacher/model_requests 18 days ago

https://huggingface.co/DavidAU/gemma-4-E4B-it-The-DECKARD-Claude-Opus-Expresso-Universe-HERETIC-UNCENSORED-Thinking

#2180 opened 18 days ago by

LOYOLABIZ

reacted to prithivMLmods's post with 🚀👍 18 days ago

Post

2297

Now the demo for image detection based on SAM3 and Gemma-4 (*Filter) is available on Spaces, using full-fledged Transformers inference with multimodal reasoning for processed images. It also supports video segmentation (mask), video segmentation (annotation), and image click segmentation.

🤗 Demo Space: prithivMLmods/SAM3-Gemma4-CUDA
🥽 SAM3: facebook/sam3
🔗 gemma-4-E2B-it: google/gemma-4-E2B-it

To learn more, visit the app page or the respective model pages.

1 reply

Anthony Loyola PRO

AI & ML interests

Recent Activity

Organizations

LOYOLABIZ's activity

English?

Introducing Kernels

ZeroGPU overquota

YOU ARE THE GOAT

https://huggingface.co/DavidAU/gemma-4-E4B-it-The-DECKARD-Claude-Opus-Expresso-Universe-HERETIC-UNCENSORED-Thinking