Transformers
PyTorch
English
llama
reward model
RLHF
RLAIF
text-generation-inference
File size: 15 Bytes
6f8f5dc
1
global_step1400