nvidia
/

OpenCodeReasoning-Nemotron-1.1-7B

@@ -13,10 +13,10 @@ tags:
 pipeline_tag: text-generation
 ---
-# OpenCodeReasoning-Nemotron-7B-v1.1 Overview
 ## Description: <br>
-OpenCodeReasoning-Nemotron-7B-v1.1 is a large language model (LLM) which is a derivative of Qwen2.5-7B-Instruct (AKA the reference model). It is a reasoning model that is post-trained for reasoning for code generation. The model supports a context length of 64k tokens. <br>
 This model is ready for commercial/non-commercial use. <br>
@@ -131,7 +131,7 @@ Architecture Type: Dense decoder-only Transformer model
 Network Architecture: Qwen-7B-Instruct
 <br>
 **This model was developed based on Qwen2.5-7B-Instruct and has 7B model parameters. <br>**
-**OpenCodeReasoning-Nemotron-7B-v1.1 was developed based on Qwen2.5-7B-Instruct and has 7B model parameters. <br>**
 ## Input: <br>
 **Input Type(s):** Text <br>
@@ -156,30 +156,30 @@ NVIDIA Hopper <br>
 ## Model Version(s):
 1.1 (6/20/2025)  <br>
-OpenCodeReasoning-Nemotron-7B-v1.1<br>
-OpenCodeReasoning-Nemotron-14B-v1.1<br>
-OpenCodeReasoning-Nemotron-32B-v1.1<br>
 # Training and Evaluation Datasets: <br>
 ## Training Dataset:
-The training corpus for OpenCodeReasoning-Nemotron-7B-v1.1 is [OpenCodeReasoning](https://huggingface.co/datasets/nvidia/OpenCodeReasoning) dataset, which is composed of competitive programming questions and DeepSeek-R1 generated responses.
 Data Collection Method: Hybrid: Automated, Human, Synthetic <br>
 Labeling Method: Hybrid: Automated, Human, Synthetic <br>
 Properties: 1.165M samples from OpenCodeReasoning (https://huggingface.co/datasets/nvidia/OpenCodeReasoning)
 ## Evaluation Dataset:
-We used the datasets listed in the next section to evaluate OpenCodeReasoning-Nemotron-7B-v1.1. <br>
 **Data Collection Method: Hybrid: Automated, Human, Synthetic <br>**
 **Labeling Method: Hybrid: Automated, Human, Synthetic <br>**
 ### License/Terms of Use: <br>
-GOVERNING TERMS: Use of this model is governed by [Apache 2.0](https://huggingface.co/nvidia/OpenCodeReasoning-Nemotron-7B-v1.1/blob/main/LICENSE).
 ### Deployment Geography:
 Global<br>
@@ -188,7 +188,7 @@ Global<br>
 This model is intended for developers and researchers building LLMs. <br>
 ### Release Date:  <br>
-Huggingface [06/20/2025] via https://huggingface.co/nvidia/OpenCodeReasoning-Nemotron-7B-v1.1/ <br>
 ## Reference(s):
 [2504.01943] OpenCodeReasoning: Advancing Data Distillation for Competitive Coding

 pipeline_tag: text-generation
 ---
+# OpenCodeReasoning-Nemotron-1.1-7B Overview
 ## Description: <br>
+OpenCodeReasoning-Nemotron-1.1-7B is a large language model (LLM) which is a derivative of Qwen2.5-7B-Instruct (AKA the reference model). It is a reasoning model that is post-trained for reasoning for code generation. The model supports a context length of 64k tokens. <br>
 This model is ready for commercial/non-commercial use. <br>
 Network Architecture: Qwen-7B-Instruct
 <br>
 **This model was developed based on Qwen2.5-7B-Instruct and has 7B model parameters. <br>**
+**OpenCodeReasoning-Nemotron-1.1-7B was developed based on Qwen2.5-7B-Instruct and has 7B model parameters. <br>**
 ## Input: <br>
 **Input Type(s):** Text <br>
 ## Model Version(s):
 1.1 (6/20/2025)  <br>
+OpenCodeReasoning-Nemotron-1.1-7B<br>
+OpenCodeReasoning-Nemotron-1.1-14B<br>
+OpenCodeReasoning-Nemotron-1.1-32B<br>
 # Training and Evaluation Datasets: <br>
 ## Training Dataset:
+The training corpus for OpenCodeReasoning-Nemotron-1.1-7B is [OpenCodeReasoning](https://huggingface.co/datasets/nvidia/OpenCodeReasoning) dataset, which is composed of competitive programming questions and DeepSeek-R1 generated responses.
 Data Collection Method: Hybrid: Automated, Human, Synthetic <br>
 Labeling Method: Hybrid: Automated, Human, Synthetic <br>
 Properties: 1.165M samples from OpenCodeReasoning (https://huggingface.co/datasets/nvidia/OpenCodeReasoning)
 ## Evaluation Dataset:
+We used the datasets listed in the next section to evaluate OpenCodeReasoning-Nemotron-1.1-7B. <br>
 **Data Collection Method: Hybrid: Automated, Human, Synthetic <br>**
 **Labeling Method: Hybrid: Automated, Human, Synthetic <br>**
 ### License/Terms of Use: <br>
+GOVERNING TERMS: Use of this model is governed by [Apache 2.0](https://huggingface.co/nvidia/OpenCodeReasoning-Nemotron-1.1-7B/blob/main/LICENSE).
 ### Deployment Geography:
 Global<br>
 This model is intended for developers and researchers building LLMs. <br>
 ### Release Date:  <br>
+Huggingface [06/20/2025] via https://huggingface.co/nvidia/OpenCodeReasoning-Nemotron-1.1-7B/ <br>
 ## Reference(s):
 [2504.01943] OpenCodeReasoning: Advancing Data Distillation for Competitive Coding