Highlights
- Pro
Stars
A course on aligning smol models.
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
Secure open source cloud runtime for AI apps & AI agents
Speech To Speech: an effort for an open-sourced and modular GPT4-o
π€ LeRobot: Making AI for Robotics more accessible with end-to-end learning
Scripts to download and assemble printable versions of Roche's famous biochemical pathways poster
Easily embed, cluster and semantically label text datasets
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
Minimalistic large language model 3D-parallelism training
[ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct
A blazing fast inference solution for text embeddings models
Robust recipes to align language models with human and AI preferences
LLM powered development for VSCode
A framework for the evaluation of autoregressive code generation language models.
π€ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
π A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
Home of StarCoder: fine-tuning & inference!
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
Fine-tune SantaCoder for Code/Text Generation.
π€ Evaluate: A library for easily evaluating machine learning models and datasets.
The official Python client for the Huggingface Hub.
π€ Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
A π₯ cookiecutter template for building Hugging Face Spaces
Jupyter notebooks for the Natural Language Processing with Transformers book