Stars
Efficient Dataset Distillation by Representative Matching
There can be more than Notion and Miro. AFFiNE(pronounced [ə‘fain]) is a next-gen knowledge base that brings planning, sorting and creating all together. Privacy first, open-source, customizable an…
A Python library transfers PyTorch tensors between CPU and NVMe
Scalable PaLM implementation of PyTorch
HiQ - Observability And Optimization In Modern AI Era
Optimizing AlphaFold Training and Inference on GPU Clusters
Sky Computing: Accelerating Geo-distributed Computing in Federated Learning
Examples of training models with hybrid parallelism using ColossalAI
Performance benchmarking with ColossalAI
Making large AI models cheaper, faster and more accessible
PyTorch implementation of LAMB for ImageNet/ResNet-50 training
Accuracy 77%. Large batch deep learning optimizer LARS for ImageNet with PyTorch and ResNet, using Horovod for distribution. Optional accumulated gradient and NVIDIA DALI dataloader.
Distributed K-FAC preconditioner for PyTorch