1 23 42

罗杰斯

rojasdiego

https://rojasdiego.com

AI & ML interests

LLMs for Code Generation

Recent Activity

liked a model 12 days ago

meta-llama/Llama-3.3-70B-Instruct

upvoted a paper about 2 months ago

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning

liked a model about 2 months ago

tencent/Tencent-Hunyuan-Large

View all activity

Organizations

rojasdiego's activity

liked a model 12 days ago

meta-llama/Llama-3.3-70B-Instruct

Text Generation • Updated 3 days ago • 277k • • 1.28k

upvoted a paper about 2 months ago

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning

Paper • 2411.02337 • Published Nov 4 • 35

liked 3 models about 2 months ago

upvoted a paper 2 months ago

Why Does the Effective Context Length of LLMs Fall Short?

Paper • 2410.18745 • Published Oct 24 • 17

commented a paper 2 months ago

Why Does the Effective Context Length of LLMs Fall Short?

Paper • 2410.18745 • Published Oct 24 • 17 •

liked a model 2 months ago

nvidia/Llama-3.1-Nemotron-70B-Instruct-HF

Text Generation • Updated Oct 25 • 169k • 1.93k

updated a collection 3 months ago

Reading List

Collection

2 items • Updated Oct 4

upvoted a paper 3 months ago

SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration

Paper • 2410.02367 • Published Oct 3 • 47

updated 2 models 3 months ago

rojasdiego/Meta-Llama-3.1-8B-Instruct-Apple-MLX

Text Generation • Updated Oct 2 • 10

rojasdiego/Meta-Llama-3.1-8B-Instruct-Apple-MLX-Adapter

Updated Oct 2

updated a dataset 3 months ago

rojasdiego/Apple-MLX-QA

Viewer • Updated Oct 2 • 9 • 39

liked a model 3 months ago

meta-llama/Llama-3.2-3B

Text Generation • Updated Oct 24 • 1.17M • 419

upvoted a collection 3 months ago

Llama 3.2

Collection

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 19 days ago • 548

updated 2 models 3 months ago

koyeb/Meta-Llama-3.1-8B-Instruct-Apple-MLX-Adapter

Question Answering • Updated Sep 26

koyeb/Meta-Llama-3.1-8B-Instruct-Apple-MLX

Question Answering • Updated Sep 26 • 23

upvoted a paper 4 months ago

Attention Heads of Large Language Models: A Survey

Paper • 2409.03752 • Published Sep 5 • 88