GODsStrongestSoldier commited on
Commit
86b4458
·
verified ·
1 Parent(s): c605afb

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +205 -1
README.md CHANGED
@@ -14,4 +14,208 @@ datasets:
14
  - m-a-p/CodeFeedback-Filtered-Instruction
15
  - m-a-p/Code-Feedback
16
 
17
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
14
  - m-a-p/CodeFeedback-Filtered-Instruction
15
  - m-a-p/Code-Feedback
16
 
17
+ ---
18
+ Llama3.2-Agent.Hermes.Coder-3B (GGUF)
19
+
20
+ 📌 Model Overview
21
+
22
+ Model Name: WithinUsAI/Llama3.2-Agent.Hermes.Coder-3B-gguf
23
+ Organization: Within Us AI
24
+ Base Model: NousResearch/Hermes-3-Llama-3.2-3B
25
+ Architecture: LLaMA 3.2 (3B) + Hermes 3 fine-tuning
26
+ Format: GGUF (quantized for local inference)
27
+ Primary Focus: Agentic coding + structured reasoning
28
+
29
+ This model is a Hermes-enhanced LLaMA 3.2 coder, optimized for agent workflows, structured outputs, and high-control instruction following in a compact 3B footprint.
30
+
31
+ It blends:
32
+
33
+ * LLaMA 3.2’s strong foundation
34
+ * Hermes 3’s alignment + tool-use intelligence
35
+ * WithinUsAI’s agentic coding focus
36
+
37
+
38
+
39
+ 🧬 Architecture & Lineage
40
+
41
+ Base Stack
42
+
43
+ * Foundation: LLaMA 3.2 (3B parameter class)
44
+ * Fine-Tune: Hermes 3 (Nous Research)
45
+ * Conversion: GGUF via llama.cpp toolchain
46
+
47
+ Hermes 3 is known for:
48
+
49
+ * Strong instruction-following
50
+ * Multi-turn conversation stability
51
+ * Tool-use and function-calling capabilities
52
+ * Improved reasoning and controllability 
53
+
54
+ What WithinUsAI Adds
55
+
56
+ This variant emphasizes:
57
+
58
+ * Coding-first behavior
59
+ * Agentic task execution
60
+ * Structured outputs (JSON, functions, steps)
61
+
62
+
63
+
64
+ 🧠 Core Design Philosophy
65
+
66
+ This model operates like a disciplined junior engineer with a systems mindset 🧩💻
67
+
68
+ Not just generating code…
69
+ but thinking in steps, outputs, and actions.
70
+
71
+ Design Goals:
72
+
73
+ * High controllability (Hermes-style alignment)
74
+ * Strong coding bias
75
+ * Agent compatibility
76
+ * Efficient local deployment
77
+
78
+
79
+
80
+ ⚙️ Key Capabilities
81
+
82
+ 💻 Coding
83
+
84
+ * Python, JavaScript, C++, and more
85
+ * Function generation and refactoring
86
+ * Debugging and structured fixes
87
+
88
+ 🤖 Agentic Behavior
89
+
90
+ * Task decomposition
91
+ * Step-by-step execution planning
92
+ * Function calling / tool-use readiness
93
+
94
+ 🧠 Reasoning
95
+
96
+ * Chain-of-thought style outputs
97
+ * Logical breakdown of problems
98
+ * Instruction precision
99
+
100
+ 📦 Structured Output
101
+
102
+ * JSON generation
103
+ * Schema-following responses
104
+ * Deterministic formatting (strong Hermes trait)
105
+
106
+
107
+
108
+ 📦 GGUF Format & Deployment
109
+
110
+ Optimized for local inference and edge environments.
111
+
112
+ Supported Runtimes:
113
+
114
+ * llama.cpp
115
+ * LM Studio
116
+ * Ollama (GGUF-compatible builds)
117
+
118
+ Typical Quantizations (3B):
119
+
120
+ Quant Size Notes
121
+ Q4_K_M ~2.0 GB Best balance
122
+ Q5_K_M ~2.3 GB Higher quality
123
+ Q8_0 ~3.4 GB Maximum fidelity
124
+
125
+ Quantization enables large size reduction while maintaining usable performance, making local deployment practical. 
126
+
127
+
128
+
129
+ 🚀 Intended Use
130
+
131
+ ✅ Ideal Use Cases
132
+
133
+ * Local coding assistants
134
+ * Agent frameworks (tool-calling pipelines)
135
+ * Structured output systems (JSON APIs)
136
+ * Autonomous coding workflows
137
+ * Offline developer copilots
138
+
139
+ ⚠️ Limitations
140
+
141
+ * 3B size limits deep reasoning vs larger models
142
+ * Requires good prompt structure for best results
143
+ * Tool execution must be handled externally
144
+
145
+
146
+
147
+ 🛠️ Usage Example (llama.cpp)
148
+
149
+ ./main -m Llama3.2-Agent.Hermes.Coder-3B.Q4_K_M.gguf \
150
+ -p "Create a JSON schema and Python validator for user authentication." \
151
+ -n 512
152
+
153
+
154
+
155
+ 🧪 Training & Methodology
156
+
157
+ Within Us AI pipeline emphasizes:
158
+
159
+ * Instruction-tuned coding datasets
160
+ * Agentic workflow examples
161
+ * Structured output training
162
+ * Evaluation-driven refinement
163
+
164
+ Data Sources
165
+
166
+ * Proprietary Within Us AI datasets
167
+ * Third-party datasets (no ownership claimed)
168
+ * Focus areas:
169
+ * Code reasoning
170
+ * Tool usage patterns
171
+ * Step-by-step problem solving
172
+
173
+
174
+
175
+ 📊 Expected Performance Profile
176
+
177
+ Capability Strength
178
+ Coding High
179
+ Instruction following Very High
180
+ Structured output Very High
181
+ Reasoning depth Moderate
182
+ Efficiency Very High
183
+
184
+
185
+
186
+ 📜 License
187
+
188
+ License Type: LLaMA 3 / Hermes 3 compatible licensing (inherits base restrictions)**
189
+
190
+ Attribution Notes:
191
+
192
+ * Base model: Meta (LLaMA 3.2)
193
+ * Fine-tune: Nous Research (Hermes 3)
194
+ * GGUF + optimization + methodology: Within Us AI
195
+ * Third-party datasets used without ownership claims
196
+ * Credit belongs to original creators
197
+
198
+
199
+
200
+ 🙏 Acknowledgements
201
+
202
+ * Meta (LLaMA 3 architecture)
203
+ * Nous Research (Hermes 3 fine-tuning)
204
+ * GGUF / llama.cpp ecosystem
205
+ * Open-source AI community
206
+
207
+
208
+
209
+ 🔗 Links
210
+
211
+ * Model: https://huggingface.co/WithinUsAI/Llama3.2-Agent.Hermes.Coder-3B-gguf
212
+ * Organization: https://huggingface.co/WithinUsAI
213
+
214
+
215
+
216
+ 🧩 Closing Note
217
+
218
+ This model feels like a precision tool in a small chassis ⚙️
219
+
220
+ It doesn’t just answer…
221
+ it organizes, structures, and executes.