-
-
-
-
-
-
Active filters:
trl
Evan-Lin/Bart-abs-yelp-entailment-1
Reinforcement Learning
•
Updated
•
47
Evan-Lin/Bart-abs-yelp-allure-v1
Reinforcement Learning
•
Updated
•
47
Evan-Lin/Bart-abs-yelp-allure3
Reinforcement Learning
•
Updated
•
47
Evan-Lin/Bart-abs-yelp-allure2
Reinforcement Learning
•
Updated
•
47
Evan-Lin/Bart-abs-yelp-allure5
Reinforcement Learning
•
Updated
•
49
Evan-Lin/Bart-large-abs-yelp-allure5
Reinforcement Learning
•
Updated
•
47
Evan-Lin/Bart-large-abs-yelp-allure2
Reinforcement Learning
•
Updated
•
45
Evan-Lin/Bart-large-abs-yelp-allure
Reinforcement Learning
•
Updated
•
47
Evan-Lin/Bart-large-abs-yelp-entailment
Reinforcement Learning
•
Updated
•
49
Evan-Lin/Bart-large-abs-yelp-allure-entailment
Reinforcement Learning
•
Updated
•
48
Evan-Lin/Bart-large-abs-amazon-allure
Reinforcement Learning
•
Updated
•
47
Evan-Lin/Bart-large-abs-amazon-entailment2-rouge
Reinforcement Learning
•
Updated
•
45
Evan-Lin/Bart-large-abs-amazon-allure2
Reinforcement Learning
•
Updated
•
47
Evan-Lin/Bart-large-abs-amazon-entailment
Reinforcement Learning
•
Updated
•
48
renyulin/gptneo125m-detoxify-ppo
Reinforcement Learning
•
Updated
•
15
vwxyzjn/starcoderbase-triviaqa
Text Generation
•
Updated
•
17
Evan-Lin/Bart-large-abs-yelp-inferable
Reinforcement Learning
•
Updated
•
1
Evan-Lin/Bart-large-abs-yelp-inferable-2
Reinforcement Learning
•
Updated
•
47
lvwerra/starcoderbase-gsm8k
Text Generation
•
Updated
•
16
approach0/mathy-vicuna-13B-FFT-queryLM-adapter
Reinforcement Learning
•
Updated
Evan-Lin/yelp-attractive-1
Reinforcement Learning
•
Updated
•
47
Evan-Lin/yelp-attractive-3
Reinforcement Learning
•
Updated
•
49
Evan-Lin/yelp-attractive-2
Reinforcement Learning
•
Updated
•
47
Evan-Lin/yelp-attractive-4
Reinforcement Learning
•
Updated
•
47
Evan-Lin/yelp-attractive-keyword-1
Reinforcement Learning
•
Updated
•
47
Evan-Lin/yelp-attractive-large-1
Reinforcement Learning
•
Updated
•
47
amirabdullah19852020/pythia-160m_sentiment_reward
Reinforcement Learning
•
Updated
•
30
amirabdullah19852020/pythia-70m_sentiment_reward
Reinforcement Learning
•
Updated
•
15
amirabdullah19852020/pythia-410m_sentiment_reward
Reinforcement Learning
•
Updated
•
65
amirabdullah19852020/pythia-70m_utility_reward
Reinforcement Learning
•
Updated
•
18