r's picture

r PRO

oceansweep

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models

liked a model 4 days ago

LatitudeGames/Wayfarer-Large-70B-Llama-3.3-GGUF

upvoted a paper 5 days ago

Rethinking Diverse Human Preference Learning through Principal Component Analysis

View all activity

Organizations

None yet

oceansweep's activity

upvoted a paper 2 days ago

LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models

Paper • 2502.14834 • Published 4 days ago • 23

liked a model 4 days ago

LatitudeGames/Wayfarer-Large-70B-Llama-3.3-GGUF

Updated 5 days ago • 5.36k • 16

upvoted 7 papers 5 days ago

Rethinking Diverse Human Preference Learning through Principal Component Analysis

Paper • 2502.13131 • Published 6 days ago • 34

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published 8 days ago • 134

Learning Getting-Up Policies for Real-World Humanoid Robots

Paper • 2502.12152 • Published 7 days ago • 36

PAFT: Prompt-Agnostic Fine-Tuning

Paper • 2502.12859 • Published 6 days ago • 13

Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation

Paper • 2502.13145 • Published 6 days ago • 34

SafeRoute: Adaptive Model Selection for Efficient and Accurate Safety Guardrails in Large Language Models

Paper • 2502.12464 • Published 6 days ago • 27

Soundwave: Less is More for Speech-Text Alignment in LLMs

Paper • 2502.12900 • Published 6 days ago • 73

liked a model 6 days ago

ibm-granite/granite-vision-3.1-2b-preview

Image-Text-to-Text • Updated 3 days ago • 10.9k • 80

liked a model 8 days ago

OpenGVLab/InternVideo2_5_Chat_8B

Video-Text-to-Text • Updated 6 days ago • 13.4k • 43

upvoted 2 papers 9 days ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published 11 days ago • 181

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published 11 days ago • 141

upvoted a paper 11 days ago

Expect the Unexpected: FailSafe Long Context QA for Finance

Paper • 2502.06329 • Published 14 days ago • 122

upvoted 2 papers 13 days ago

Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models

Paper • 2502.04404 • Published 18 days ago • 21

QuEST: Stable Training of LLMs with 1-Bit Weights and Activations

Paper • 2502.05003 • Published 17 days ago • 41

upvoted 2 papers 17 days ago

LIMO: Less is More for Reasoning

Paper • 2502.03387 • Published 19 days ago • 56

Jailbreaking with Universal Multi-Prompts

Paper • 2502.01154 • Published 21 days ago • 8

upvoted 2 papers 24 days ago

GuardReasoner: Towards Reasoning-based LLM Safeguards

Paper • 2501.18492 • Published 25 days ago • 81

Atla Selene Mini: A General Purpose Evaluation Model

Paper • 2501.17195 • Published 28 days ago • 33