通用训练功能策略¶
Continual Learning in Language Models 2024-07-02 PPT: presentation-liyun.pptx
Mix of Experts Ronald(Ronald)
Model checkpointing for LLMs 2024-07-31
PPT: Improving checkpointing in LLMs-liyun.pptx
Lora相关技术分享
Yiming Yao(Eamon)
zero-bubble策略分析及应用 2024-09-13
bitsandbytes 量化技巧分析
Wenji Cai
PPT: bitandbytes量化技术分享.pptx
Conf: 8bit optimizer 研究学习
大模型中学习率的调度方法研究
Bo Shi
量化训练 Ronald(Ronald)
2025-03-16 meeting_01.mp4
后训练ppo介绍 Chao Pan(Chris)
2025-05-30 ppo介绍.mp4
Quantized training Damian
2025-3-29
meeting_01.mp4
qlora模型量化分享.pptx
FLUX 介绍 Pei Yao
https://arxiv.org/pdf/2406.06858
FlashOverlap 介绍 Yu Zhuang
2504.19519] FlashOverlap: A Lightweight Design for Efficiently Overlapping Communication and Computation
奖励模型PRM介绍Shanying Wu(Mia)
2025-06-11
PPT:reward model.pptx
录制视频:reward_model_0611_2025.mp4
Updated by jun chen 16 days ago · 1 revisions