metadata
datasets:
- allenai/c4
language:
- en
library_name: transformers
license: apache-2.0
Model Description
This is a model using the llama2 architecture and only 30 million parameters. It is trained on approximately 2 billion tokens of diverse web data from the first 1000000 rows of the uncleaned c4 english dataset.