Artur Daveyan's picture

Artur Daveyan

ArturD

·

AI & ML interests

None yet

Recent Activity

liked a Space 1 day ago

upvoted a collection 1 day ago

liked a model 1 day ago

mistralai/Mistral-Medium-3.5-128B

View all activity

Organizations

None yet

upvoted 2 collections 1 day ago

Qwen-Scope

15 items • Updated 1 day ago • 39

Mistral Medium 3.5

Our first flaship models handling instruction-following, reasoning, and coding in a single set of opened-weights. • 2 items • Updated 2 days ago • 10

upvoted a collection 3 days ago

talkie-13b

talkie-1930-13b is a vintage language model trained on pre-1931 English-language text. See https://github.com/talkie-lm/talkie to run talkie. • 3 items • Updated 11 days ago • 38

upvoted a collection 7 days ago

DeepSeek-V4

4 items • Updated 8 days ago • 595

upvoted a paper 24 days ago

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

Paper • 2604.04921 • Published 26 days ago • 112

upvoted a collection 29 days ago

Bonsai

1-bit Bonsai models • 7 items • Updated 14 days ago • 189

upvoted a collection about 1 month ago

GigaChat 3.1

6 items • Updated Mar 24 • 61

upvoted 3 collections 2 months ago

Qwen3.5

21 items • Updated Mar 9 • 1.59k

pplx-embed

Diffusion-Pretrained Dense and Contextual Embeddings • 7 items • Updated Feb 26 • 96

Tiny Aya

Bridging Scale and Multilingual Depth • 10 items • Updated Feb 17 • 69

upvoted a paper 3 months ago

Semantic Routing: Exploring Multi-Layer LLM Feature Weighting for Diffusion Transformers

Paper • 2602.03510 • Published Feb 3 • 27

upvoted 4 collections 3 months ago

Nemotron ColEmbed V2

State-of-the-Art Late Interaction Vision-Language Embedding Models • 3 items • Updated 11 days ago • 13

PaddleOCR-VL-1.5

Towards a Multi-Task 0.9B VLM for Robust In-the-Wild Document Parsing • 7 items • Updated Mar 6 • 19

Step3-VL-10B

3 items • Updated Feb 2 • 36

Qwen3-TTS

7 items • Updated Jan 22 • 354

upvoted a paper 4 months ago

Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models

Paper • 2512.24618 • Published Dec 31, 2025 • 154

upvoted 3 collections 4 months ago

VulnLLM-R

Data and model for VulnLLM-R: Specialized Reasoning LLM with Agent Scaffold for Vulnerability Detection • 9 items • Updated Dec 17, 2025 • 9

Kandinsky 5.0 Video Pro Diffusers

Kandinsky 5.0 Video Pro is a 19B model that generates high-quality HD videos from English and Russian prompts with controllable camera motion. • 4 items • Updated Dec 14, 2025 • 13

perception-encoder-audio-visual

9 items • Updated Dec 16, 2025 • 28

upvoted a collection 5 months ago

DR Tulu

Models and data associated with DR Tulu, http://allenai-web/papers/drtulu • 6 items • Updated Feb 24 • 37