Cross's picture

Cross

dillfrescott

AI & ML interests

AI, anime, computers

Recent Activity

liked a model 1 day ago
unsloth/phi-4-GGUF
liked a model 1 day ago
bartowski/Sky-T1-32B-Preview-GGUF
liked a model 1 day ago
NovaSky-AI/Sky-T1-32B-Preview
View all activity

Organizations

The Waifu Research Department's profile picture

dillfrescott's activity

reacted to Jaward's post with ๐Ÿ‘โค๏ธ 15 days ago
view post
Post
2960
nanoBLT: Simplified lightweight implementation of a character-level Byte Latent Transformer model (under 500 lines of code). The model is 2x4x2 (n_layers_encoder, n_layers_latent, n_layers_decoder) layer deep trained on ~1M bytes of tiny Shakespeare with a patch size of 4.

Code: https://github.com/Jaykef/ai-algorithms/blob/main/byte_latent_transformer.ipynb
reacted to hexgrad's post with ๐Ÿค—๐Ÿ‘ 15 days ago
view post
Post
3108
Tonight, Adam & Michael join the 82M Apache TTS model in hexgrad/Kokoro-82M
reacted to AdinaY's post with ๐Ÿค—๐Ÿ‘โค๏ธ๐Ÿš€ 17 days ago
view post
Post
3582
The Chinese community is shipping ๐Ÿšข

DeepSeek V3 (685 B MoE) has quietly released on the hub!
Base: deepseek-ai/DeepSeek-V3-Base
Instruct: deepseek-ai/DeepSeek-V3

Canโ€™t wait to see whatโ€™s next!
  • 1 reply
ยท