Hongpei Li
  • Bio
  • Papers
  • Blog
  • Experience
  • Gallery
  • Recent & Upcoming Talks
    • Example Talk
  • Blog
    • Accelerating Nonlinear Programming on GPUs
    • Use Customized CUDA kernel in your PyTorch Code
    • Configure Your Server (Part II) -- Python Environment
    • Unlocking Your GPU's Potential--A Guide to CUDA Samples and Performance Testing
    • Configure Your Server (Part I) -- SSH and Git
  • Publications
    • BenLOC: A Benchmark for Learning to Configure MIP Optimizers
    • PDHCG: A Scalable First-Order Method for Large-Scale Competitive Market Equilibrium Computation
    • A Restarted Primal-Dual Hybrid Conjugate Gradient Method for Large-Scale Quadratic Programming
    • FMIP: Joint Continuous-Integer Flow For Mixed-Integer Linear Programming
    • OptPipe: Memory- and Scheduling-Optimized Pipeline Parallelism for LLM Training
    • Restarted Primal-Dual Hybrid Conjugate Gradient Method for Large-Scale Quadratic Programming
    • Solving Integrated Process Planning and Scheduling Problem via Graph Neural Network Based Deep Reinforcement Learning
  • Gallery_pics
    • Angels&demons
    • Archer_elf
    • bilibili
    • character_design
    • christmas_trees
    • emblem_practice
    • Gundam_model_free
    • Gundam_model_snow
    • High_school_class_emblem
    • icon-design
    • Miku
    • New_Year_Greetings
    • New_Year_Greetings_2021
    • practice
    • practice_armour
    • practice_cha
    • practice_face
    • practice_ugly
    • self_logo
    • Silhouette
    • Zodiac_mecha-Chicken
    • Zodiac_mecha-Rabbit
  • Gallery
  • Projects
  • Projects
    • DRL4IPPS
    • PDHCG
    • ML4MOC
    • PDHCG-Net
    • LSTM-based Quasi-Newton
  • Experience
  • Teaching
    • Learn JavaScript
    • Learn Python

OptPipe: Memory- and Scheduling-Optimized Pipeline Parallelism for LLM Training

Jan 1, 2025·
Hongpei Li
,
Han Zhang
,
Huikang Liu
,
Dongdong Ge
,
Yinyu Ye
· 0 min read
Cite arXiv URL
Type
Manuscript
Last updated on Jan 1, 2025

← FMIP: Joint Continuous-Integer Flow For Mixed-Integer Linear Programming Jan 1, 2025
Restarted Primal-Dual Hybrid Conjugate Gradient Method for Large-Scale Quadratic Programming Oct 1, 2024 →

© 2025 Me. This work is licensed under CC BY NC ND 4.0

Published with Hugo Blox Builder — the free, open source website builder that empowers creators.