EleutherAI/DeeperSpeed
stale
significant_divergence
Selected Prefer upstream if you want current DeepSpeed capabilities and active feature development. Prefer this fork only if you specifically need its downstream patches and are prepared to own the divergence and porting burden.
HabanaAI/DeepSpeed
active
significant_divergence
Choose this fork if your priority is Habana/Gaudi support and you want a DeepSpeed variant already adapted for that platform. Choose upstream if you need the newest DeepSpeed features, fastest bugfix flow, or the least-divergent codebase.
B06901052/DeepSpeed
stale
significant_divergence
Prefer this fork only if you need its older, customized behavior and are prepared to own maintenance. If you want current DeepSpeed capabilities, active fixes, and modern distributed-training features, upstream is the better choice.
ROCm/DeepSpeed
stale
significant_divergence
Choose this fork if your priority is accelerator-specific compatibility and you can tolerate lagging upstream features. Choose upstream if you want the latest DeepSpeed capabilities, active maintenance, and lower integration risk.
martinshkreli/DeepSpeed
stale
significant_divergence
Choose this fork only if you need its specific older/custom DeepSpeed behavior and are prepared to own major divergence. For most adopters, upstream DeepSpeed is the safer choice because it is active, much newer, and far richer in maintained features.
tarxemo/DeepSpeed
slowing
significant_divergence
Choose upstream unless you specifically need this older, unchanged snapshot. This fork does not add capabilities and lags substantially behind current DeepSpeed.
bneayoub/DeepSpeed
stale
significant_divergence
Prefer upstream unless you specifically need this fork's older snapshot or legacy chat/CPU/AMD changes; for new adoption, the fork is too stale and too divergent to be a safe default.
Snowflake-Labs/DeepSpeed
stale
significant_divergence
Prefer this fork only if you need Snowflake-specific maintenance or the narrowed codebase it represents. If you want current DeepSpeed features, active upstream alignment, or broad model-system support, upstream is the better choice.
Stability-AI/DeepSpeed
stale
significant_divergence
Prefer this fork only if you need its specific 2023-era customizations and can accept major divergence from upstream. For most adopters, upstream DeepSpeed is the safer choice because this fork is stale, heavily rewritten, and likely missing newer features and fixes.
erew123/DeepSpeed
stale
significant_divergence
Prefer this fork only if you explicitly want an older, heavily pruned DeepSpeed baseline and are prepared to own maintenance yourself. For most adopters, upstream is the safer choice because this fork is stale and materially diverged.