Balancing Speed and Stability: The Trade-offs of FP8 vs. BF16 Training in LLMs
Paper
•
2411.08719
•
Published
Researching and building foundation models with improved generalization and reasoning. LAION & friends spin-off for open-sourcing foundation models with strong generalization and reasoning , including datasets necessary for their creation, to serve as common open, reproducible grounds for further research experiments.