dame rajee's picture

dame rajee

damerajee

·

AI & ML interests

None yet

Organizations

Posts 2

Post

540

On the 2nd of October a really cool paper was released called "Were RNNs all we need" https://arxiv.org/abs/2410.01201

This paper introduces the MinGRU model, a simplified version of the traditional Gated Recurrent Unit (GRU) designed to enhance efficiency by removing hidden state dependencies from its gates. This allows for parallel training, making it significantly faster than conventional GRUs. Additionally, MinGRU eliminates non-linear activations like tanh, streamlining computations.

So I read the paper and I tried training this model and it seems to be doing quite well , you could check out the pre-trained model on the huggingface spaces

- damerajee/mingru-stories

Post

1969

Just released ViLaH - a compact 3B parameter vision language model! which generates responses in Hindi only hindi for now 😔

BhashaAI/ViLaH

Collections 5

View 5 collections

spaces 1

Mingru Stories

models 92

damerajee/super-transformers-model

damerajee/qwen-spectrum

Updated Dec 8, 2024

damerajee/llama-tinystories

Updated Nov 26, 2024

damerajee/mingru

Updated Oct 18, 2024

damerajee/Barlowtwins-50

0.0B • Updated Oct 5, 2024 • 3

damerajee/barlow-twins-pt

Updated Sep 30, 2024

damerajee/gpt-small

Updated Sep 23, 2024

damerajee/MAE

Updated Sep 12, 2024

damerajee/paligemma-hindi-part-2

Updated Sep 7, 2024 • 2

damerajee/paligemma-hindi-part-3

Updated Sep 6, 2024

datasets 73

damerajee/OpenO1-SFT-MATH

Viewer • Updated Jan 6 • 491k • 7 • 1

damerajee/clean_vqa_prt2

Viewer • Updated Jul 16, 2024 • 273k • 39 • 1

damerajee/clean_data_vqa

Viewer • Updated Jul 13, 2024 • 300k • 21 • 1

damerajee/Llava-pretrain-small

Viewer • Updated Jun 28, 2024 • 250k • 72 • 1

damerajee/audio_pre-training-v1.3

Viewer • Updated Jun 23, 2024 • 500 • 1

damerajee/pre-train_audio-hin

Viewer • Updated Jun 22, 2024 • 53 • 3

damerajee/short_text_audio-3

Viewer • Updated Jun 22, 2024 • 2.19k • 8

damerajee/short_text_audio-2

Viewer • Updated Jun 21, 2024 • 1.14k • 6

damerajee/short_text_audio

Viewer • Updated Jun 21, 2024 • 129 • 5

damerajee/long_text_audio

Viewer • Updated Jun 21, 2024 • 129 • 6

View 73 datasets