Skip to content

Pull requests: aws-samples/awsome-distributed-training

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Distillation example
#758 opened Jun 23, 2025 by chivatam
adding nemo2.0 eks test case
#688 opened May 21, 2025 by KeitaW Draft
Feat/ddp mlflow enhancement New feature or request
#655 opened Apr 28, 2025 by KeitaW
Feature/slinkly slurm hyperpod eks enhancement New feature or request
#651 opened Apr 25, 2025 by bluecrayon52
easy smhp slurm and eks
#514 opened Dec 10, 2024 by gmgtamz
add GPU accounting for SMHP
#462 opened Oct 21, 2024 by KeitaW
Update bionemo test case + propose to subdirectories per orchastrator documentation Improvements or additions to documentation
#396 opened Aug 5, 2024 by KeitaW Draft
Esm2 on Sagemaker Hyperpod
#387 opened Jul 25, 2024 by awsankur
Neuron distributed
#359 opened Jun 13, 2024 by KeitaW
Llama training with FP8
#331 opened May 15, 2024 by pbelevich Draft
Add draft gpu troubles
#290 opened Apr 30, 2024 by mhuguesaws Draft
[WIP] torchtune usecase
#260 opened Apr 12, 2024 by pbelevich Draft
ProTip! Type g i on any issue or pull request to go back to the issue listing page.