Stars
🚀 Efficient implementations of state-of-the-art linear attention models
12 Lessons to Get Started Building AI Agents
A live reading list for LLM data synthesis (Updated to July, 2025).
PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
🎬 人人影视 机器人和网站,包含人人影视全部资源以及众多网友的网盘分享
Bringing BERT into modernity via both architecture changes and scaling
Modeling, training, eval, and inference code for OLMo
The hub for EleutherAI's work on interpretability and learning dynamics
AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。
A library for mechanistic interpretability of GPT-style language models
Fast, Flexible and Portable Structured Generation
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Community maintained fork of pdfminer - we fathom PDF
Aligning pretrained language models with instruction data generated by themselves.
[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"
A easy, reliable, fluid template for python packages complete with docs, testing suites, readme's, github workflows, linting and much much more
Robust recipes to align language models with human and AI preferences
AGI资料汇总学习(主要包括LLM和AIGC),持续更新......
Example models using DeepSpeed
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
A curated list of reinforcement learning with human feedback resources (continually updated)
A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI
LangChain 的中文入门教程
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
A GPU-accelerated graph learning library for PyTorch, facilitating the scaling of GNN training and inference.