Update README.md
Browse files
README.md
CHANGED
|
@@ -37,6 +37,9 @@ SteerLM Paper: [SteerLM: Attribute Conditioned SFT as an (User-Steerable) Altern
|
|
| 37 |
|
| 38 |
Llama2-70B-SteerLM-Chat is trained with NVIDIA NeMo, an end-to-end, cloud-native framework to build, customize, and deploy generative AI models anywhere. It includes training and inferencing frameworks, guardrailing toolkits, data curation tools, and pretrained models, offering enterprises an easy, cost-effective, and fast way to adopt generative AI.
|
| 39 |
|
|
|
|
|
|
|
|
|
|
| 40 |
## Model Architecture:
|
| 41 |
|
| 42 |
**Architecture Type:** Transformer
|
|
@@ -51,8 +54,6 @@ The SteerLM method involves the following key steps:
|
|
| 51 |
|
| 52 |
Llama2-70B-SteerLM-Chat applies this technique on top of the Llama 2 70B Foundational model architecture. It was pretrained on internet-scale data and then aligned using [Open Assistant](https://huggingface.co/datasets/OpenAssistant/oasst1) and [HelpSteer](https://huggingface.co/datasets/nvidia/HelpSteer).
|
| 53 |
|
| 54 |
-
You can train the model using [NeMo Aligner](https://github.com/NVIDIA/NeMo-Aligner) following [SteerLM training user guide](https://docs.nvidia.com/nemo-framework/user-guide/latest/modelalignment/steerlm.html) or run inference based on steps below.
|
| 55 |
-
|
| 56 |
## Software Integration:
|
| 57 |
|
| 58 |
**Runtime Engine(s):**
|
|
|
|
| 37 |
|
| 38 |
Llama2-70B-SteerLM-Chat is trained with NVIDIA NeMo, an end-to-end, cloud-native framework to build, customize, and deploy generative AI models anywhere. It includes training and inferencing frameworks, guardrailing toolkits, data curation tools, and pretrained models, offering enterprises an easy, cost-effective, and fast way to adopt generative AI.
|
| 39 |
|
| 40 |
+
You can train the model using [NeMo Aligner](https://github.com/NVIDIA/NeMo-Aligner) following [SteerLM training user guide](https://docs.nvidia.com/nemo-framework/user-guide/latest/modelalignment/steerlm.html) or run inference based on steps below.
|
| 41 |
+
|
| 42 |
+
|
| 43 |
## Model Architecture:
|
| 44 |
|
| 45 |
**Architecture Type:** Transformer
|
|
|
|
| 54 |
|
| 55 |
Llama2-70B-SteerLM-Chat applies this technique on top of the Llama 2 70B Foundational model architecture. It was pretrained on internet-scale data and then aligned using [Open Assistant](https://huggingface.co/datasets/OpenAssistant/oasst1) and [HelpSteer](https://huggingface.co/datasets/nvidia/HelpSteer).
|
| 56 |
|
|
|
|
|
|
|
| 57 |
## Software Integration:
|
| 58 |
|
| 59 |
**Runtime Engine(s):**
|