Inference Providers
Active filters: ONNX
microsoft/mistral-7b-instruct-v0.2-ONNX
Text Generation
• Updated • 25
• 6
luweigen/Llama-3-8B-Instruct-int4-onnx-directml
Text Generation
• Updated • 4
EmbeddedLLM/llama-2-7b-chat-int4-onnx-directml
Text Generation
• Updated • 12
• 1
EmbeddedLLM/llama-2-13b-chat-int4-onnx-directml
Text Generation
• Updated • 2
EmbeddedLLM/mistral-7b-instruct-v0.3-int4-onnx-directml
Text Generation
• Updated • 41
• 3
EmbeddedLLM/mistral-7b-instruct-v0.3-onnx
Text Generation
• Updated • 2
EmbeddedLLM/gemma-2b-it-onnx
Text Generation
• Updated • 1
EmbeddedLLM/Starling-LM-7b-beta-onnx
Text Generation
• Updated EmbeddedLLM/openchat-3.6-8b-20240522-onnx
Text Generation
• Updated • 1
EmbeddedLLM/Phi-3-small-128k-instruct-onnx
Text Generation
• Updated EmbeddedLLM/Phi-3-vision-128k-instruct-onnx
Text Generation
• Updated EmbeddedLLM/01-ai_Yi-1.5-6B-Chat-onnx
Text Generation
• Updated EmbeddedLLM/gemma-7b-it-onnx
Text Generation
• Updated • 1
EmbeddedLLM/Phi-3-mini-128k-instruct-062024-onnx
Text Generation
• Updated EmbeddedLLM/Phi-3-mini-4k-instruct-062024-onnx
Text Generation
• Updated EmbeddedLLM/Phi-3-mini-4k-instruct-onnx-directml
Text Generation
• Updated • 3
EmbeddedLLM/Phi-3-mini-128k-instruct-onnx-directml
Text Generation
• Updated • 2
EmbeddedLLM/Phi-3-mini-4k-instruct-062024-int4-onnx-directml
Text Generation
• Updated • 3
EmbeddedLLM/Phi-3-medium-4k-instruct-onnx-directml
Text Generation
• Updated • 3
EmbeddedLLM/Phi-3-medium-128k-instruct-onnx-directml
Text Generation
• Updated • 6
EmbeddedLLM/Phi-3-mini-4k-instruct-onnx-cpu-int4-rtn-block-32
Text Generation
• Updated • 2
EmbeddedLLM/Phi-3-mini-4k-instruct-onnx-cpu-int4-rtn-block-32-acc-level-4
Text Generation
• Updated • 4
EmbeddedLLM/Phi-3-mini-128k-instruct-onnx-cpu-int4-rtn-block-32
Text Generation
• Updated • 3
EmbeddedLLM/Phi-3-mini-128k-instruct-onnx-cpu-int4-rtn-block-32-acc-level-4
Text Generation
• Updated • 3
Maximum2000/Phi-3.5-mini-instruct-cuda-fp16-onnx
Text Generation
• Updated Maximum2000/Phi-3.5-mini-instruct-cuda-fp32-onnx
Text Generation
• Updated microsoft/Phi-3.5-mini-instruct-onnx
Text Generation
• Updated • 119
• 37
philipp-zettl/bart-large-cnn
Summarization
• Updated • 4
onnx-community/Llama-3.2-1B-Instruct-GENAI-ONNX
Text Generation
• Updated • 31
• 1
onnx-community/Llama-3.2-3B-Instruct-GENAI-ONNX
Text Generation
• Updated • 15
• 20