Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
14
10
14
Gabriel Mongaras
gmongaras
Follow
timlee0131's profile picture
little-lake-studios's profile picture
mshojaei77's profile picture
6 followers
·
6 following
gmongaras
gmongaras
gmongaras
AI & ML interests
None yet
Recent Activity
authored
a paper
21 days ago
On the Expressiveness of Softmax Attention: A Recurrent Neural Network Perspective
commented
on
a paper
27 days ago
On the Expressiveness of Softmax Attention: A Recurrent Neural Network Perspective
liked
a model
about 1 month ago
ACE-Step/ACE-Step-v1-3.5B
View all activity
Organizations
gmongaras
's models
26
Sort: Recently updated
gmongaras/datav3_attempt5_8GPU_SoftFlash_RoPE2d_2AccSteps_13batchsize_stage3
Updated
May 14
gmongaras/datav3_attempt5_8GPU_SoftFlash_RoPE2d_2AccSteps_40batchsize_stage2
Updated
Apr 28
gmongaras/t
Updated
Apr 24
gmongaras/datav3_attempt5_8GPU_SoftFlash_RoPE2d_2AccSteps_140batchsize_stage1
Updated
Apr 19
gmongaras/Llama3.1_8B_Instruct_GRPO_gsm8k
Updated
Apr 15
gmongaras/datav3_attempt4_8GPU_SoftFlash_RoPE2dV2_2AccSteps_stage2
Updated
Apr 11
gmongaras/datav3_attempt4_8GPU_SoftFlash_RoPE2dV2_2AccSteps
Updated
Apr 11
gmongaras/Latent_Diffusion_Model_Imagenet2012_Softmax_250000
Updated
Feb 27
gmongaras/Softmax_Attention_BERT
Feature Extraction
•
Updated
Oct 7, 2024
•
3
gmongaras/Cosine_Attention_BERT
Feature Extraction
•
Updated
Oct 7, 2024
•
2
gmongaras/Cosine_Attention_GPT_1.2B
Feature Extraction
•
Updated
Oct 7, 2024
•
4
gmongaras/Cosine_Attention_GPT_300M
Feature Extraction
•
Updated
Oct 7, 2024
•
4
gmongaras/Softmax_Attention_GPT_1.2B
Feature Extraction
•
Updated
Oct 7, 2024
•
3
gmongaras/Softmax_Attention_GPT_300M
Feature Extraction
•
Updated
Oct 7, 2024
•
4
gmongaras/Yann_UWU
Text Generation
•
7B
•
Updated
Oct 5, 2024
•
3
gmongaras/Meta-Llama-3.1-8B
Text Generation
•
8B
•
Updated
Sep 22, 2024
•
2.2k
gmongaras/reddit_negative_v1_13B
Text Generation
•
Updated
Sep 15, 2023
•
5
•
1
gmongaras/Wizard_7B_Squad_v2
Text Generation
•
Updated
Sep 15, 2023
•
5
gmongaras/reddit_negative_v1_8B
Text Generation
•
Updated
Sep 15, 2023
•
5
gmongaras/Wizard_7B_Reddit_Political_2019_13B
Text Generation
•
Updated
Sep 15, 2023
•
5
gmongaras/Wizard_7B_Squad_8bit
Text Generation
•
Updated
Sep 11, 2023
•
5
gmongaras/Wizard_7B_Squad
Text Generation
•
Updated
Sep 11, 2023
•
5
gmongaras/Wizard_7B_Reddit_Political_2019
Text Generation
•
Updated
Sep 11, 2023
•
5
gmongaras/Wizard_7B_Reddit_Political_2019_8bit
Text Generation
•
7B
•
Updated
Sep 11, 2023
•
6
gmongaras/wizardLM-7B-HF-8bit
Text Generation
•
Updated
Sep 7, 2023
•
1
gmongaras/gpt-anime-sub-1.3B
Text Generation
•
1B
•
Updated
Apr 26, 2023
•
31
•
6