Kazuaki Hiraga's picture

33 333

Kazuaki Hiraga

kazuakey

·

AI & ML interests

NLP, Sentiment Analysis, Named Entity Recognition

Recent Activity

upvoted a paper 1 day ago

Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning

liked a dataset 8 days ago

Aratako/Synthetic-JP-Preference-Dataset-Qwen2.5_72B-191k

liked a dataset 8 days ago

AI-MO/NuminaMath-CoT

View all activity

Organizations

kazuakey's activity

upvoted a paper 1 day ago

Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning

Paper • 2502.14768 • Published 4 days ago • 35

upvoted a paper 23 days ago

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published Jan 14 • 55

upvoted a collection 23 days ago

📐 FineMath

FineMath datasets and ablation models • 14 items • Updated 4 days ago • 19

upvoted a collection 26 days ago

TinySwallow

Compact Japanese models trained with "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models" • 5 items • Updated 26 days ago • 16

upvoted 2 papers 27 days ago

Baichuan-Omni-1.5 Technical Report

Paper • 2501.15368 • Published 30 days ago • 61

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published 30 days ago • 62

upvoted 2 papers 28 days ago

Chain-of-Retrieval Augmented Generation

Paper • 2501.14342 • Published Jan 24 • 51

EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents

Paper • 2501.11858 • Published Jan 21 • 7

upvoted a paper about 1 month ago

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

Paper • 2501.12326 • Published Jan 21 • 51

upvoted an article about 1 month ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

Jan 15

• 148

upvoted a paper about 1 month ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 106

upvoted an article about 1 month ago

Article

How to generate text: using different decoding methods for language generation with Transformers

Mar 1, 2020

• 157

upvoted 2 collections about 1 month ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated 4 days ago • 240

SmolVLM

State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct • 5 items • Updated 4 days ago • 34

upvoted an article 3 months ago

Article

Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK

By

and 1 other •

Nov 21, 2024

• 35

upvoted an article 4 months ago

Article

Transformers.js v3: WebGPU support, new models & tasks, and more…

Oct 22, 2024

• 67

upvoted a collection 4 months ago

Llama-3.1-Swallow

9 items • Updated 25 days ago • 5

upvoted an article 4 months ago

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

By

and 1 other •

Oct 14, 2024

• 72

upvoted an article 5 months ago

Article

Fine-Tuning 1B LLaMA 3.2: A Comprehensive Step-by-Step Guide with Code

By

•

Oct 2, 2024

• 53

upvoted a collection 5 months ago

NVLM 1.0

A family of frontier-class multimodal large language models (LLMs) that achieve state-of-the-art results on vision-language tasks and text-only tasks. • 2 items • Updated Jan 17 • 51