CodeAtCMU/Qwen3-1.7B-Base-DataMix_full_sft_code_natural_language_mix_nl_5_data_120K Text Generation • 2B • Updated Jun 13 • 20
CodeAtCMU/Qwen3-1.7B-Base-DataMix_full_sft_code_natural_language_mix_nl_80_data_120K Text Generation • 2B • Updated Jun 13 • 22
CodeAtCMU/Qwen3-1.7B-Base-DataMix_full_sft_code_natural_language_mix_nl_20_data_120K Text Generation • 2B • Updated Jun 13 • 24
CodeAtCMU/Qwen3-0.6B-Base-DataMix_full_sft_code_natural_language_mix_nl_40_data_120K Text Generation • 0.6B • Updated Jun 12 • 18
CodeAtCMU/Qwen3-0.6B-Base-DataMix_full_sft_code_natural_language_mix_nl_80_data_120K Text Generation • 0.6B • Updated Jun 12 • 18
CodeAtCMU/Qwen3-0.6B-Base-DataMix_full_sft_code_natural_language_mix_nl_60_data_120K Text Generation • 0.6B • Updated Jun 12 • 20
CodeAtCMU/Qwen3-0.6B-Base-DataMix_full_sft_code_natural_language_mix_nl_10_data_120K Text Generation • 0.6B • Updated Jun 12 • 18
CodeAtCMU/Qwen3-0.6B-Base-DataMix_full_sft_code_natural_language_mix_nl_97p5_data_120K Text Generation • 0.6B • Updated Jun 12 • 20
CodeAtCMU/Qwen3-0.6B-Base-DataMix_full_sft_code_natural_language_mix_nl_2p5_data_120K Text Generation • 0.6B • Updated Jun 12 • 18
CodeAtCMU/Qwen3-0.6B-Base-DataMix_full_sft_code_natural_language_mix_nl_5_data_120K Text Generation • 0.6B • Updated Jun 12 • 18
CodeAtCMU/Qwen3-0.6B-Base-DataMix_full_sft_code_natural_language_mix_nl_90_data_120K Text Generation • 0.6B • Updated Jun 12 • 20
CodeAtCMU/Qwen3-0.6B-Base-DataMix_full_sft_code_natural_language_mix_nl_20_data_120K Text Generation • 0.6B • Updated Jun 12 • 18
CodeAtCMU/Qwen3-0.6B-Base-DataMix_full_sft_code_natural_language_mix_nl_95_data_120K Text Generation • 0.6B • Updated Jun 12 • 20
CodeAtCMU/Qwen3-0.6B-Base_full_sft_code_data_120K_replace_variables Text Generation • 0.6B • Updated Jun 12 • 8
CodeAtCMU/Qwen3-0.6B-Base_full_sft_code_data_120K_remove_comments Text Generation • 0.6B • Updated Jun 11 • 9
CodeAtCMU/Qwen3-0.6B-Base_full_sft_code_data_120K_remove_whitespace Text Generation • 0.6B • Updated Jun 11 • 11
CodeAtCMU/SmolLM2-360M_full_sft_natural_language_data_120K Text Generation • 0.4B • Updated Jun 2 • 9
CodeAtCMU/gemma-3-1b-pt_full_sft_natural_language_data_shard_9 Text Generation • 1.0B • Updated Jun 2 • 16
CodeAtCMU/gemma-3-1b-pt_full_sft_natural_language_data_shard_3 Text Generation • 1.0B • Updated Jun 2 • 17
CodeAtCMU/gemma-3-1b-pt_full_sft_natural_language_data_shard_4 Text Generation • 1.0B • Updated Jun 2 • 16
CodeAtCMU/gemma-3-1b-pt_full_sft_natural_language_data_shard_0 Text Generation • 1.0B • Updated Jun 2 • 16
CodeAtCMU/gemma-3-1b-pt_full_sft_natural_language_data_shard_7 Text Generation • 1.0B • Updated Jun 2 • 16
CodeAtCMU/gemma-3-1b-pt_full_sft_natural_language_data_shard_5 Text Generation • 1.0B • Updated Jun 2 • 16
CodeAtCMU/gemma-3-1b-pt_full_sft_natural_language_data_shard_2 Text Generation • 1.0B • Updated Jun 2 • 16
CodeAtCMU/gemma-3-1b-pt_full_sft_natural_language_data_shard_8 Text Generation • 1.0B • Updated Jun 2 • 16
CodeAtCMU/gemma-3-1b-pt_full_sft_natural_language_data_shard_6 Text Generation • 1.0B • Updated Jun 2 • 16