Gyanateet Dutta

Ryukijano

https://ryukijano.github.io

AI & ML interests

Computer Graphics, General Artificial Intelligence,model merging, massive ASR for data collection, 3D ML, on-device ML, quantization, model judging, ML in browser, healthcare applications, education, intersection of art and ML.

Recent Activity

liked a model about 11 hours ago

google/timesfm-2.0-500m-pytorch

updated a collection 2 days ago

Text-3D

liked a model 2 days ago

stabilityai/stable-point-aware-3d

View all activity

Organizations

Ryukijano's activity

upvoted a collection about 1 month ago

VILA: On Pre-training for Visual Language Models

Collection

10 items • Updated Oct 31, 2024 • 48

upvoted an article about 2 months ago

Article

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

•

Nov 19, 2024

• 11

upvoted 2 papers about 2 months ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 113

Grounding Image Matching in 3D with MASt3R

Paper • 2406.09756 • Published Jun 14, 2024 • 1

upvoted an article about 2 months ago

Article

How to run Gemini Nano locally in your browser

•

Jul 11, 2024

• 43

upvoted 2 collections 2 months ago

Sparsh

Collection

Models and datasets for Sparsh: Self-supervised touch representations for vision-based tactile sensing • 15 items • Updated Oct 24, 2024 • 12

MobileLLM

Collection

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 9 items • Updated Nov 27, 2024 • 101

upvoted a collection 3 months ago

Stable Diffusion 3.5

Collection

6 items • Updated 3 days ago • 121

upvoted an article 3 months ago

Article

Advanced Flux Dreambooth LoRA Training with 🧨 diffusers

•

Oct 21, 2024

• 32

upvoted a paper 3 months ago

Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention

Paper • 2410.10774 • Published Oct 14, 2024 • 25

upvoted a paper 4 months ago

MonoFormer: One Transformer for Both Diffusion and Autoregression

Paper • 2409.16280 • Published Sep 24, 2024 • 18

upvoted 4 collections 4 months ago

upvoted an article 5 months ago

Article

Scaling robotics datasets with video encoding

Aug 27, 2024

• 35

upvoted a paper 5 months ago

Tora: Trajectory-oriented Diffusion Transformer for Video Generation

Paper • 2407.21705 • Published Jul 31, 2024 • 27

upvoted a collection 6 months ago

Llama 3.1

Collection

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 638

upvoted an article 6 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 298

upvoted a collection 6 months ago

DCLM

Collection

DCLM Models + Datasets • 7 items • Updated Jul 22, 2024 • 42