Models from "Predicting Task Performance with Context-aware Scaling Laws"

WangLab
university
AI & ML interests
NLP
Recent Activity
models
16

WangResearchLab/Yarn-Llama-2-13b-128k
Text Generation
•
Updated
•
10

WangResearchLab/Yarn-Llama-2-13b-64k
Text Generation
•
Updated
•
11

WangResearchLab/Yarn-Llama-2-13b-32k
Text Generation
•
Updated
•
10

WangResearchLab/Yarn-Llama-2-13b-16k
Text Generation
•
Updated
•
9

WangResearchLab/Yarn-Llama-2-13b-8k
Text Generation
•
Updated
•
6

WangResearchLab/Llama-2-13b-hf
Text Generation
•
Updated
•
16

WangResearchLab/Yarn-Llama-2-7b-128k
Text Generation
•
Updated
•
21

WangResearchLab/Yarn-Llama-2-7b-64k
Text Generation
•
Updated
•
7

WangResearchLab/Yarn-Llama-2-7b-32k
Text Generation
•
Updated
•
11

WangResearchLab/Yarn-Llama-2-7b-16k
Text Generation
•
Updated
•
9