T-MAC: CPU Renaissance via Table Lookup for Low-Bit LLM Deployment on Edge Paper โข 2407.00088 โข Published Jun 25, 2024 โข 11
InternEvo: Efficient Long-sequence Large Language Model Training via Hybrid Parallelism and Redundant Sharding Paper โข 2401.09149 โข Published Jan 17, 2024 โข 1
AMSP: Super-Scaling LLM Training via Advanced Model States Partitioning Paper โข 2311.00257 โข Published Nov 1, 2023 โข 10