Jeetendra Kumar

jeetendra760

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Progressive Multimodal Reasoning via Active Retrieval

upvoted a paper 4 days ago

Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces

upvoted a paper 5 days ago

Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions

View all activity

Organizations

None yet

jeetendra760's activity

upvoted a paper 2 days ago

Progressive Multimodal Reasoning via Active Retrieval

Paper • 2412.14835 • Published 7 days ago • 66

upvoted a paper 4 days ago

Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces

Paper • 2412.14171 • Published 7 days ago • 22

upvoted 3 papers 5 days ago

Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions

Paper • 2411.14405 • Published Nov 21 • 58

VisDoM: Multi-Document QA with Visually Rich Elements Using Multimodal Retrieval-Augmented Generation

Paper • 2412.10704 • Published 12 days ago • 14

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 16

upvoted a collection 7 days ago

Bamba

Collection

Collection of Bamba - hybrid Mamba2 model architecture based models trained on open data • 8 items • Updated 8 days ago • 16

upvoted 2 collections 8 days ago

Navarasa

Collection

Collection of Gemma finetuned 7B/ 2B Indic Navarasa models. • 4 items • Updated Mar 18 • 2

Navarasa 2.0 Models

Collection

Collection of models Navarasa 2.0 Models finetuned with Gemma on 15 Indian languages • 5 items • Updated Mar 18 • 17

upvoted a paper 15 days ago

Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions

Paper • 2401.01827 • Published Jan 3 • 16

upvoted a paper 16 days ago

EXAONE 3.5: Series of Large Language Models for Real-world Use Cases

Paper • 2412.04862 • Published 20 days ago • 48

upvoted a paper 18 days ago

Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

Paper • 2412.04454 • Published 20 days ago • 49

upvoted a paper 19 days ago

Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion

Paper • 2412.04424 • Published 20 days ago • 55

upvoted a paper 23 days ago

Puzzle: Distillation-Based NAS for Inference-Optimized LLMs

Paper • 2411.19146 • Published 28 days ago • 13

upvoted a collection 23 days ago

INTELLECT-1

Collection

13 items • Updated 20 days ago • 7

upvoted 2 papers 27 days ago

Star Attention: Efficient LLM Inference over Long Sequences

Paper • 2411.17116 • Published 30 days ago • 47

MagicQuill: An Intelligent Interactive Image Editing System

Paper • 2411.09703 • Published Nov 14 • 57

upvoted a paper about 1 month ago

Generative World Explorer

Paper • 2411.11844 • Published Nov 18 • 75