Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
jrahn
/
gpt3_125M_edu_hermes
like
0
Text Generation
Transformers
Safetensors
HuggingFaceFW/fineweb-edu
teknium/OpenHermes-2.5
English
gpt2
llm.c
text-generation-inference
Inference Endpoints
License:
mit
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
26bf3cc
gpt3_125M_edu_hermes
1 contributor
History:
7 commits
jrahn
Upload loss_curve.png
26bf3cc
verified
4 months ago
.gitattributes
1.52 kB
initial commit
4 months ago
README.md
3.85 kB
Update README.md
4 months ago
config.json
769 Bytes
Upload model
4 months ago
generation_config.json
119 Bytes
Upload model
4 months ago
loss_curve.png
149 kB
Upload loss_curve.png
4 months ago
main.log
796 kB
Upload main.log with huggingface_hub
4 months ago
merges.txt
456 kB
Upload tokenizer
4 months ago
model.safetensors
250 MB
LFS
Upload model
4 months ago
run_gpt3_125M_edu_hermes.sh
1.37 kB
Upload run_gpt3_125M_edu_hermes.sh with huggingface_hub
4 months ago
special_tokens_map.json
438 Bytes
Upload tokenizer
4 months ago
tokenizer_config.json
514 Bytes
Upload tokenizer
4 months ago
vocab.json
999 kB
Upload tokenizer
4 months ago