MogaNet

university

https://github.com/Westlake-AI/MogaNet

Activity Feed Request to join this org

AI & ML interests

MogaNet: Efficient Multi-order Gated Aggregation Network

MogaNet's activity

Lupin1998

authored 15 papers 5 months ago

Cascade-DETR: Delving into High-Quality Universal Object Detection

Paper • 2307.11035 • Published Jul 20, 2023

Behavior Contrastive Learning for Unsupervised Skill Discovery

Paper • 2305.04477 • Published May 8, 2023

Rethinking Memory and Communication Cost for Efficient Large Language Model Training

Paper • 2310.06003 • Published Oct 9, 2023 • 2

SemiReward: A General Reward Model for Semi-supervised Learning

Paper • 2310.03013 • Published Oct 4, 2023 • 1

LongVQ: Long Sequence Modeling with Vector Quantization on Structured Memory

Paper • 2404.11163 • Published Apr 17, 2024

Similarity is Not All You Need: Endowing Retrieval Augmented Generation with Multi Layered Thoughts

Paper • 2405.19893 • Published May 30, 2024 • 31

Efficient Multi-order Gated Aggregation Network

Paper • 2211.03295 • Published Nov 7, 2022 • 2

Short-Long Convolutions Help Hardware-Efficient Linear Attention to Focus on Long Sequences

Paper • 2406.08128 • Published Jun 12, 2024 • 1

Peer Review as A Multi-Turn and Long-Context Dialogue with Role-Based Interactions

Paper • 2406.05688 • Published Jun 9, 2024

RDesign: Hierarchical Data-efficient Representation Learning for Tertiary Structure-based RNA Design

Paper • 2301.10774 • Published Jan 25, 2023

Boosting the Power of Small Multimodal Reasoning Models to Match Larger Models with Self-Consistency Training

Paper • 2311.14109 • Published Nov 23, 2023

Unveiling the Backbone-Optimizer Coupling Bias in Visual Representation Learning

Paper • 2410.06373 • Published Oct 8, 2024 • 36

Switch EMA: A Free Lunch for Better Flatness and Sharpness

Paper • 2402.09240 • Published Feb 14, 2024 • 3

A Survey on Mixup Augmentations and Beyond

Paper • 2409.05202 • Published Sep 8, 2024

OpenMixup: Open Mixup Toolbox and Benchmark for Visual Representation Learning

Paper • 2209.04851 • Published Sep 11, 2022 • 1

ZedongWangAI

authored 5 papers 5 months ago

Switch EMA: A Free Lunch for Better Flatness and Sharpness

Paper • 2402.09240 • Published Feb 14, 2024 • 3

A Survey on Mixup Augmentations and Beyond

Paper • 2409.05202 • Published Sep 8, 2024

OpenMixup: Open Mixup Toolbox and Benchmark for Visual Representation Learning

Paper • 2209.04851 • Published Sep 11, 2022 • 1

SemiReward: A General Reward Model for Semi-supervised Learning

Paper • 2310.03013 • Published Oct 4, 2023 • 1

LongVQ: Long Sequence Modeling with Vector Quantization on Structured Memory

Paper • 2404.11163 • Published Apr 17, 2024