OFA-Sys

non-profit

OFA-Sys

Activity Feed Request to join this org

AI & ML interests

Pretraining, Multimodality, NLP, CV, etc.

Recent Activity

JustinLin610 authored a paper 5 days ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Losin94 authored a paper 5 days ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

JustinLin610 authored a paper 13 days ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

View all activity

OFA-Sys's activity

JustinLin610

authored a paper 5 days ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published 6 days ago • 59

Losin94

authored a paper 5 days ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published 6 days ago • 59

JustinLin610

authored a paper 13 days ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published 14 days ago • 88

Losin94

authored a paper 13 days ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published 14 days ago • 88

JustinLin610

authored a paper 14 days ago

Enabling Scalable Oversight via Self-Evolving Critic

Paper • 2501.05727 • Published 17 days ago • 69

Losin94

authored a paper 14 days ago

Enabling Scalable Oversight via Self-Evolving Critic

Paper • 2501.05727 • Published 17 days ago • 69

JustinLin610

authored a paper 17 days ago

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

Paper • 2412.18619 • Published Dec 16, 2024 • 54

Losin94

authored a paper 21 days ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published 25 days ago • 48

JustinLin610

authored a paper 21 days ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published 25 days ago • 48

mingfengxue

authored a paper about 1 month ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 343

keminglu

authored a paper about 1 month ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 343

JustinLin610

authored a paper about 1 month ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 343

Losin94

authored 8 papers about 1 month ago

How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition

Paper • 2310.05492 • Published Oct 9, 2023 • 2

Bridging Subword Gaps in Pretrain-Finetune Paradigm for Natural Language Generation

Paper • 2106.06125 • Published Jun 11, 2021

PolyLM: An Open Source Polyglot Large Language Model

Paper • 2307.06018 • Published Jul 12, 2023 • 26

LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feedback

Paper • 2406.14024 • Published Jun 20, 2024

Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement

Paper • 2409.12122 • Published Sep 18, 2024 • 3

AI & ML interests

Recent Activity

Team members 14

OFA-Sys's activity