Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2
1
Luckeciano Carvalho Melo
luckeciano
Follow
0 followers
·
1 following
https://luckeciano.github.io
LuckecianoMelo
luckeciano
AI & ML interests
Reinforcement Learning
Recent Activity
updated
a Space
3 days ago
luckeciano/maxent-rl-eval-leaderboard
updated
a model
3 days ago
luckeciano/Qwen-2.5-7B-Simple-RL
updated
a model
4 days ago
luckeciano/Qwen-2.5-1.5B-Simple-RL
View all activity
Organizations
Papers
1
arxiv:
2206.06614
spaces
1
Sleeping
Maxent Rl Eval Leaderboard
🏃
Display evaluation metrics from JSON results
models
7
Sort: Recently updated
luckeciano/Qwen-2.5-7B-Simple-RL
Text Generation
•
Updated
3 days ago
•
8
luckeciano/Qwen-2.5-1.5B-Simple-RL
Text Generation
•
Updated
4 days ago
•
9
luckeciano/Qwen-2.5-0.5B-Instruct-Simple-RL
Updated
10 days ago
luckeciano/merged-hermes-reward-model-reddit
Text Classification
•
Updated
Feb 2, 2024
•
4
luckeciano/merged-llama7b-reward-model-reddit
Text Classification
•
Updated
Jan 19, 2024
•
9
luckeciano/merged-gpt2-xl-sft-reddit
Text Generation
•
Updated
Dec 12, 2023
•
57
luckeciano/merged-llama-sft-reddit
Text Generation
•
Updated
Dec 11, 2023
•
10
datasets
10
Sort: Recently updated
luckeciano/mistral8x22b-reddit-post-features
Viewer
•
Updated
May 10, 2024
•
92.9k
•
50
luckeciano/llama370b-reddit-post-features
Viewer
•
Updated
May 10, 2024
•
82.5k
•
49
luckeciano/llama370b-features-reddit
Viewer
•
Updated
May 7, 2024
•
150k
•
78
luckeciano/mistral8x22b-features-reddit
Viewer
•
Updated
Apr 22, 2024
•
166k
•
115
luckeciano/hermes-reddit-post-features
Viewer
•
Updated
Apr 18, 2024
•
92.7k
•
78
luckeciano/llama27b-features-reddit
Viewer
•
Updated
Apr 13, 2024
•
189k
•
86
luckeciano/falcon7b-features-reddit
Viewer
•
Updated
Apr 13, 2024
•
159k
•
70
luckeciano/hermes-features-ultrafeedback
Viewer
•
Updated
Mar 7, 2024
•
63.8k
•
74
luckeciano/reddit-features-hermes
Viewer
•
Updated
Feb 13, 2024
•
169k
•
393
luckeciano/learning-to-summarize
Viewer
•
Updated
Jan 17, 2024
•
426k
•
153