-
BitNet: Scaling 1-bit Transformers for Large Language Models
Paper • 2310.11453 • Published • 97 -
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Paper • 2404.14219 • Published • 256 -
LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding
Paper • 2404.16710 • Published • 77
Mayor
Eric111
AI & ML interests
None yet
Recent Activity
liked
a model
2 days ago
Wan-AI/Wan2.1-I2V-14B-720P
liked
a model
4 days ago
facebook/llm-compiler-13b
liked
a model
4 days ago
moonshotai/Moonlight-16B-A3B-Instruct
Organizations
None yet
Collections
1
models
46
Eric111/CatunaMayo3B-DPO
Updated
Eric111/CatunaMayo3B
Text Generation
•
Updated
•
5
Eric111/UltraCatunaMayo-DPO-GGUF
Updated
•
16
Eric111/UltraCatunaMayo-DPO
Text Generation
•
Updated
•
37
Eric111/UltraCatunaMayo-GGUF
Updated
•
12
Eric111/UltraCatunaMayo
Text Generation
•
Updated
•
10
Eric111/stablelm-zephyr-6b
Updated
Eric111/CatunaLaserPi-DPO
Text Generation
•
Updated
•
30
•
1
Eric111/Mistral-7B-Instruct_v0.2_UNA-TheBeagle-7b-v1
Text Generation
•
Updated
•
18
•
1
Eric111/MistInst-v0.2_ochat-3.5-0106_dpo-binarized-NeuralTrix-7B
Text Generation
•
Updated
•
20
datasets
None public yet