Alignment to avoid bias related to tiananmen square and Taiwan.

Uploaded model

Developed by: EpistemeAI
License: apache-2.0
Finetuned from model : EpistemeAI/Reasoning-Llama-3.2-3B-Math-Instruct-RE1-ORPO

This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.

Inference Providers NEW

This model is not currently available via any of the supported Inference Providers.

Model tree for EpistemeAI/Reasoning-Llama-3.2-3B-Math-Instruct-RE1-ORPO-align

Base model

Finetuned

Finetuned

(2)

this model

Quantizations