🚩 Report

#137

by hayleecs - opened Mar 28, 2024

Mar 28, 2024

No information about the training data. How are we supposed to compare it with LLaMa 2 models and how can we justify the change in perplexity on common datasets such as wikitext?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment