Skip to content
View fastalgo's full-sized avatar

Organizations

@hpcaitech

Block or report fastalgo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Efficient AI Inference & Serving

Python 472 29 Updated Jan 8, 2024

Efficient Dataset Distillation by Representative Matching

Python 111 9 Updated Feb 28, 2024

There can be more than Notion and Miro. AFFiNE(pronounced [ə‘fain]) is a next-gen knowledge base that brings planning, sorting and creating all together. Privacy first, open-source, customizable an…

TypeScript 54,286 3,676 Updated Aug 15, 2025

A Python library transfers PyTorch tensors between CPU and NVMe

C++ 120 25 Updated Nov 27, 2024

Scalable PaLM implementation of PyTorch

Python 190 27 Updated Dec 19, 2022

HiQ - Observability And Optimization In Modern AI Era

Python 73 10 Updated Jan 27, 2025

Optimizing AlphaFold Training and Inference on GPU Clusters

Python 608 91 Updated Jul 16, 2024

Sky Computing: Accelerating Geo-distributed Computing in Federated Learning

Python 91 21 Updated Nov 22, 2022
Python 8 2 Updated Feb 11, 2022

Large-scale model inference.

Python 632 88 Updated Sep 12, 2023

Examples of training models with hybrid parallelism using ColossalAI

Python 340 103 Updated Mar 23, 2023

Performance benchmarking with ColossalAI

Python 39 16 Updated Jul 6, 2022

Making large AI models cheaper, faster and more accessible

Python 41,080 4,527 Updated Aug 15, 2025

中国买房相关资料和项目整理,方便查看,持续更新中...

1,422 153 Updated Oct 7, 2023

PyTorch implementation of LAMB for ImageNet/ResNet-50 training

Python 13 2 Updated May 13, 2021
Python 28 Updated Jul 11, 2021

Accuracy 77%. Large batch deep learning optimizer LARS for ImageNet with PyTorch and ResNet, using Horovod for distribution. Optional accumulated gradient and NVIDIA DALI dataloader.

Python 38 8 Updated Jun 1, 2021

Distributed K-FAC preconditioner for PyTorch

Python 89 24 Updated Aug 11, 2025

A Chainer extension for K-FAC

Python 20 3 Updated Jun 16, 2019

RetinaNet in PyTorch

Python 1,000 248 Updated Mar 17, 2019

Over9000 optimizer

Jupyter Notebook 425 57 Updated Nov 22, 2022
Showing results