-
flash-linear-attention Public
Forked from fla-org/flash-linear-attention🚀 Efficient implementations of state-of-the-art linear attention models
Python MIT License UpdatedSep 5, 2025 -
ai-agents-for-beginners Public
Forked from microsoft/ai-agents-for-beginners11 Lessons to Get Started Building AI Agents
Jupyter Notebook MIT License UpdatedJul 24, 2025 -
LLM-Synthetic-Data Public
Forked from pengr/LLM-Synthetic-DataA live reading list for LLM-synthetic-data.
MIT License UpdatedJul 1, 2025 -
verl Public
Forked from volcengine/verlverl: Volcano Engine Reinforcement Learning for LLMs
Python Apache License 2.0 UpdatedMay 28, 2025 -
loong Public
Forked from camel-ai/loong🐉 Loong: Synthesize Long CoTs at Scale through Verifiers.
Jupyter Notebook Apache License 2.0 UpdatedMay 20, 2025 -
datatrove Public
Forked from huggingface/datatroveFreeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
Python Apache License 2.0 UpdatedJan 30, 2025 -
ms-swift Public
Forked from modelscope/ms-swiftUse PEFT or Full-parameter to finetune 300+ LLMs or 60+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V-2.6, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)
Python Apache License 2.0 UpdatedAug 22, 2024 -
-
ChunkLlama Public
Forked from HKUNLP/ChunkLlama[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"
Python Apache License 2.0 UpdatedJul 18, 2024 -
Large-Language-Models-play-StarCraftII Public
Forked from histmeisah/Large-Language-Models-play-StarCraftIITextStarCraft2,a pure language env which support llms play starcraft2
Python UpdatedJun 13, 2024 -
Python-Package-Template Public template
Forked from kyegomez/Python-Package-TemplateA easy, reliable, fluid template for python packages complete with docs, testing suites, readme's, github workflows, linting and much much more
Shell MIT License UpdatedJun 5, 2024 -
chinese-frequency-word-list Public
Forked from liangqi/chinese-frequency-word-listUpdatedJun 4, 2024 -
GradCache Public
Forked from luyug/GradCacheRun Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint
Python Apache License 2.0 UpdatedMar 26, 2024 -
alignment-handbook Public
Forked from huggingface/alignment-handbookRobust recipes to align language models with human and AI preferences
Python Apache License 2.0 UpdatedMar 21, 2024 -
EffectiveModernCppChinese Public template
Forked from CnTransGroup/EffectiveModernCppChinese《Effective Modern C++》- 完成翻译
UpdatedFeb 19, 2024 -
CQMC Public
Forked from IvanaXu/CQMCLLM + 中文文本匹配 Chinese Question Matching Corpus
Python MIT License UpdatedFeb 17, 2024 -
Awesome-AGI Public
Forked from ArronAI007/Awesome-AGIAGI资料汇总学习(主要包括LLM和AIGC),持续更新......
Jupyter Notebook UpdatedJan 31, 2024 -
ChatLM-mini-Chinese Public
Forked from charent/ChatLM-mini-Chinese中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调。
Python Apache License 2.0 UpdatedJan 12, 2024 -
Phi2-mini-Chinese Public
Forked from charent/Phi2-mini-ChinesePhi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.
Jupyter Notebook Apache License 2.0 UpdatedJan 12, 2024 -
DeepSpeedExamples Public
Forked from deepspeedai/DeepSpeedExamplesExample models using DeepSpeed
Python Apache License 2.0 UpdatedJan 10, 2024 -
lightllm Public
Forked from ModelTC/LightLLMLightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
Python Apache License 2.0 UpdatedJan 2, 2024 -
LLaMA-Factory Public
Forked from hiyouga/LLaMA-FactoryEasy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)
Python Apache License 2.0 UpdatedDec 29, 2023 -
awesome-RLHF Public
Forked from opendilab/awesome-RLHFA curated list of reinforcement learning with human feedback resources (continually updated)
Apache License 2.0 UpdatedDec 20, 2023 -
LLM-Tuning Public
Forked from beyondguo/LLM-TuningTuning LLMs with no tears💦, sharing LLM-tools with love❤️.
Python UpdatedOct 30, 2023 -
bertviz Public
Forked from jessevig/bertvizBertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Python Apache License 2.0 UpdatedAug 24, 2023 -
Crosstalk-Generation Public
Forked from lumberbroMaiden/crosstalk_gptPerform crosstalk with Qian Yu
Python UpdatedAug 17, 2023 -
gpt_academic Public
Forked from binary-husky/gpt_academic为GPT/GLM提供图形交互界面,特别优化论文阅读润色体验,模块化设计支持自定义快捷按钮&函数插件,支持代码块表格显示,Tex公式双显示,新增Python和C++项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持清华chatglm等本地模型
Python GNU General Public License v3.0 UpdatedMay 3, 2023 -
MOSS Public
Forked from OpenMOSS/MOSSAn open-source tool-augmented conversational language model from Fudan University
Python Apache License 2.0 UpdatedApr 24, 2023 -
MiniGPT-4 Public
Forked from Vision-CAIR/MiniGPT-4MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
Python BSD 3-Clause "New" or "Revised" License UpdatedApr 18, 2023 -
JARVIS Public
Forked from microsoft/JARVISJARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
Python MIT License UpdatedApr 10, 2023