- All languages
- AngelScript
- C
- C#
- C++
- CSS
- CoffeeScript
- Dockerfile
- Elixir
- Go
- HCL
- HTML
- Java
- JavaScript
- Jupyter Notebook
- Kotlin
- Lua
- MDX
- Makefile
- Markdown
- Move
- PHP
- PLpgSQL
- Pascal
- PowerShell
- Python
- Rich Text Format
- Ruby
- Rust
- SCSS
- Scala
- Shell
- Smali
- Smarty
- Solidity
- Swift
- TeX
- Thrift
- TypeScript
- Vim Script
- Vue
- Wikitext
Starred repositories
Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has state of the art retrieval performance on both text and visual…
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
✨✨Latest Advances on Multimodal Large Language Models
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
Label Studio is a multi-type data labeling and annotation tool with standardized output format
[ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box
BoT-SORT: Robust Associations Multi-Pedestrian Tracking
A High-performance cross-platform Video Processing Python framework powerpacked with unique trailblazing features 🔥
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
VideoFinder is an advanced video analysis tool powered by multimodal AI, designed to help users easily locate and identify specific objects or people within video content. By combining the capabili…
This repository contains a comprehensive computer vision/machine learning football project that uses YOLO for object detection, Kmeans for pixel segmentation, optical flow for motion tracking, and …
A cross-platform video structuring (video analysis) framework. If you find it helpful, please give it a star: ) 跨平台的视频结构化(视频分析)框架,觉得有帮助的请给个星星 : )
Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc.
🎥 Python and OpenCV-based scene cut/transition detection program & library.
A comprehensive video analysis tool that combines computer vision, audio transcription, and natural language processing to generate detailed descriptions of video content. This tool extracts key fr…
Common utilities for ONNX converters
This Repo, Builds an NLP system that analyzes a TV series with NLP and even creates a character chat bot with LLMs
A paper list of some recent Transformer-based CV works.
Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)
TensorRT+YOLO系列的 多路 多卡 多实例 并行视频分析处理案例
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
🚀🚀🚀 A collection of some awesome public YOLO object detection series projects and the related object detection datasets.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Python APIs for web automation, testing, and bypassing bot-detection.
Deformable DETR: Deformable Transformers for End-to-End Object Detection.