-
deepseek-ai/DeepSeek-R1
Text Generation • Updated • 897k • • 13.1k -
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation • 33B • Updated • 848k • • 2k -
google/gemma-2-27b-it
Text Generation • 27B • Updated • 401k • 559 -
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Paper • 2201.11903 • Published • 15
Collections
Discover the best community collections!
Collections including paper arxiv:2308.09687
-
Don't Do RAG: When Cache-Augmented Generation is All You Need for Knowledge Tasks
Paper • 2412.15605 • Published • 2 -
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
Paper • 2310.11511 • Published • 79 -
Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language Models through Question Complexity
Paper • 2403.14403 • Published • 7 -
Compressed Chain of Thought: Efficient Reasoning Through Dense Representations
Paper • 2412.13171 • Published • 35
-
Graph-enhanced Large Language Models in Asynchronous Plan Reasoning
Paper • 2402.02805 • Published • 1 -
Barack's Wife Hillary: Using Knowledge-Graphs for Fact-Aware Language Modeling
Paper • 1906.07241 • Published • 2 -
A Latent Space Theory for Emergent Abilities in Large Language Models
Paper • 2304.09960 • Published • 3 -
Reasoning on Graphs: Faithful and Interpretable Large Language Model Reasoning
Paper • 2310.01061 • Published • 2
-
Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers
Paper • 2309.08532 • Published • 54 -
Large Language Models as Optimizers
Paper • 2309.03409 • Published • 79 -
Graph of Thoughts: Solving Elaborate Problems with Large Language Models
Paper • 2308.09687 • Published • 7 -
Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning Tasks
Paper • 2211.12588 • Published • 3
-
Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models
Paper • 2308.10379 • Published -
Graph of Thoughts: Solving Elaborate Problems with Large Language Models
Paper • 2308.09687 • Published • 7 -
Skeleton-of-Thought: Large Language Models Can Do Parallel Decoding
Paper • 2307.15337 • Published • 39 -
Tab-CoT: Zero-shot Tabular Chain of Thought
Paper • 2305.17812 • Published • 2
-
No More Adam: Learning Rate Scaling at Initialization is All You Need
Paper • 2412.11768 • Published • 43 -
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks
Paper • 2412.14161 • Published • 51 -
HiRED: Attention-Guided Token Dropping for Efficient Inference of High-Resolution Vision-Language Models in Resource-Constrained Environments
Paper • 2408.10945 • Published • 10 -
PDFTriage: Question Answering over Long, Structured Documents
Paper • 2309.08872 • Published • 55
-
Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers
Paper • 2309.08532 • Published • 54 -
Graph of Thoughts: Solving Elaborate Problems with Large Language Models
Paper • 2308.09687 • Published • 7 -
From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting
Paper • 2309.04269 • Published • 34 -
Language models in molecular discovery
Paper • 2309.16235 • Published • 10
-
deepseek-ai/DeepSeek-R1
Text Generation • Updated • 897k • • 13.1k -
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation • 33B • Updated • 848k • • 2k -
google/gemma-2-27b-it
Text Generation • 27B • Updated • 401k • 559 -
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Paper • 2201.11903 • Published • 15
-
Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models
Paper • 2308.10379 • Published -
Graph of Thoughts: Solving Elaborate Problems with Large Language Models
Paper • 2308.09687 • Published • 7 -
Skeleton-of-Thought: Large Language Models Can Do Parallel Decoding
Paper • 2307.15337 • Published • 39 -
Tab-CoT: Zero-shot Tabular Chain of Thought
Paper • 2305.17812 • Published • 2
-
Don't Do RAG: When Cache-Augmented Generation is All You Need for Knowledge Tasks
Paper • 2412.15605 • Published • 2 -
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
Paper • 2310.11511 • Published • 79 -
Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language Models through Question Complexity
Paper • 2403.14403 • Published • 7 -
Compressed Chain of Thought: Efficient Reasoning Through Dense Representations
Paper • 2412.13171 • Published • 35
-
No More Adam: Learning Rate Scaling at Initialization is All You Need
Paper • 2412.11768 • Published • 43 -
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks
Paper • 2412.14161 • Published • 51 -
HiRED: Attention-Guided Token Dropping for Efficient Inference of High-Resolution Vision-Language Models in Resource-Constrained Environments
Paper • 2408.10945 • Published • 10 -
PDFTriage: Question Answering over Long, Structured Documents
Paper • 2309.08872 • Published • 55
-
Graph-enhanced Large Language Models in Asynchronous Plan Reasoning
Paper • 2402.02805 • Published • 1 -
Barack's Wife Hillary: Using Knowledge-Graphs for Fact-Aware Language Modeling
Paper • 1906.07241 • Published • 2 -
A Latent Space Theory for Emergent Abilities in Large Language Models
Paper • 2304.09960 • Published • 3 -
Reasoning on Graphs: Faithful and Interpretable Large Language Model Reasoning
Paper • 2310.01061 • Published • 2
-
Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers
Paper • 2309.08532 • Published • 54 -
Graph of Thoughts: Solving Elaborate Problems with Large Language Models
Paper • 2308.09687 • Published • 7 -
From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting
Paper • 2309.04269 • Published • 34 -
Language models in molecular discovery
Paper • 2309.16235 • Published • 10
-
Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers
Paper • 2309.08532 • Published • 54 -
Large Language Models as Optimizers
Paper • 2309.03409 • Published • 79 -
Graph of Thoughts: Solving Elaborate Problems with Large Language Models
Paper • 2308.09687 • Published • 7 -
Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning Tasks
Paper • 2211.12588 • Published • 3