# X-AnyLabeling **Repository Path**: springni/X-AnyLabeling ## Basic Information - **Project Name**: X-AnyLabeling - **Description**: No description available - **Primary Language**: Python - **License**: GPL-3.0 - **Default Branch**: main - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2026-05-22 - **Last Updated**: 2026-06-23 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README

[简体中文](README_zh-CN.md) | [English](README.md)

![](https://user-images.githubusercontent.com/18329471/234640541-a6a65fbc-d7a5-4ec3-9b65-55305b01a7aa.png)

自动训练

自动标注

检测一切

分割一切

可提示概念定位

视觉问答

聊天机器人

图像分类器

视频分类器

OCR识别

## 🥳 新功能 - `2026-06-20`: 新增 [TrackTrack](./examples/multiple_object_tracking/README.md) 多目标跟踪算法。 - `2026-06-19`: 新增 PP-OCRv6 模型，支持多语言文本检测与识别。 - `2026-06-18`: 新增 [SCRFD 10G KPS](./examples/estimation/face_estimation/README.md)，支持人脸检测与五点人脸关键点标注。 - `2026-06-15`: 新增橡皮擦功能，支持快速移除多边形和折线标注的顶点。 - `2026-06-06`: 新增 [LocateAnything](./examples/grounding/locateanything/README.md) 支持，覆盖开放词汇视觉定位、指点定位和场景文本检测。 - `2026-05-23`: 新增支持 [视频分类器](./docs/zh_cn/video_classifier.md) 标注面板，支持时间轴片段分类、片段描述、AI 自动切分视频与描述生成。 - `2026-04-26`: 新增客户端（本地 ONNX）支持 [SAM 3](./examples/grounding/sam3/README.md) 文本驱动分割。 - `2026-04-25`: 新增支持 [GeCo2](./examples/counting/geco2/README.md) 目标计数模型。 - `2026-04-19`: 新增标注检查状态流程，便于快速复核已标注图片。 - `2026-04-19`: 新增支持 YOLO 模型 TensorRT 后端推理。 - `2026-04-18`: 新增支持将标注可视化结果导出为图片或视频。 - `2026-04-18`: 新增支持 [PaddleOCR](./docs/zh_cn/paddle_ocr.md) 文档解析与智能文字识别标注面板。 - `2026-04-01`: 新增支持日语和韩语界面语言（`ja_JP`、`ko_KR`）。 - `2026-03-22`: 新增支持在 GUI 中通过内置设置面板直接调整常用选项。 - `2026-03-10`: 新增支持从矩形生成 3D Cuboid 对象标注。 - `2026-03-01`: 完成 PyQt5 到 PyQt6 的升级重构（Beta 版本），并同步修复与优化若干功能细节。 - 更多详情，请参考[更新日志](./CHANGELOG.md) ## 简介 **X-AnyLabeling** 是一款基于AI推理引擎和丰富功能特性于一体的强大辅助标注工具，其专注于实际应用，致力于为多模态数据工程师提供工业级的一站式解决方案，可自动快速进行各种复杂任务的标定。此外，我们强烈推荐您尝试 [X-AnyLabeling-Server](https://github.com/CVHub520/X-AnyLabeling-Server)，这是一个简单、轻量级且可扩展的框架，为 X-AnyLabeling 提供远程推理能力。 ## 新特性

- 支持远程推理服务。 - 支持 `ONNX Runtime`、`TensorRT`、`OpenCV DNN` 等灵活后端加速推理。 - 支持一键预测所有图像。 - 支持`图像`和`视频`处理。 - 支持`英文`、`中文`、`日文`、`韩文` 四种界面语言。 - 支持自定义模型和二次开发。 - 支持一键导入和导出多种标签格式，如 `COCO`、`VOC`、`YOLO`、`DOTA`、`MOT`、`MASK`、`PPOCR`、`MMGF`、`VLM-R1`、`ShareGPT` 等； - 支持多种图像标注样式，包括 `多边形`、`矩形`、`长方体`、`旋转框`、`圆形`、`线条`、`点`，以及 `文本检测`、`识别` 和 `KIE` 标注； - 支持各类视觉任务，如`图像分类`、`目标检测`、`实例分割`、`姿态估计`、`旋转检测`、`多目标跟踪`、`光学字符识别`、`图像文本描述`、`车道线检测`、`分割一切`、`视觉问答`、`文档解析`等。 ## 模型库 | **任务类别** | **支持模型** | | :--- | :--- | | 🖼️ **图像分类** | YOLOv5-Cls, YOLOv8-Cls, YOLO11-Cls, InternImage, PULC | | 🎯 **目标检测** | YOLOv5/6/7/8/9/10, YOLO11/12/26, YOLOX, YOLO-NAS, D-FINE, DAMO-YOLO, Gold_YOLO, RT-DETR, RF-DETR, DEIMv2 | | 🖌️ **实例分割** | YOLOv5-Seg, YOLOv8-Seg, YOLO11-Seg, YOLO26-Seg, Hyper-YOLO-Seg, RF-DETR-Seg | | 🏃 **姿态估计** | YOLOv8-Pose, YOLO11-Pose, YOLO26-Pose, DWPose, RTMO | | 😀 **人脸估计** | SCRFD, YOLOv6Lite-Face | | 👣 **目标跟踪** | TrackTrack, Bot-SORT, ByteTrack, SAM2/3-Video | | 🔄 **旋转目标检测** | YOLOv5-Obb, YOLOv8-Obb, YOLO11-Obb, YOLO26-Obb | | 📏 **深度估计** | Depth Anything | | 🧩 **分割一切** | SAM 1/2/3, SAM-HQ, SAM-Med2D, EdgeSAM, EfficientViT-SAM, MobileSAM | | ✂️ **图像抠图** | RMBG 1.4/2.0 | | 💡 **候选框提取** | UPN | | 🏷️ **图像标记** | RAM, RAM++ | | 📄 **光学字符识别** | PP-OCRv4, PP-OCRv5, PP-OCRv6 | | 🧾 **综合版面分析** | PP-DocLayoutV3 | | 📑 **文档解析** | PaddleOCR-VL, PaddleOCR-VL-1.6 | | 🗣️ **视觉基础模型** | Rex-Omni, Florence2 | | 👁️ **视觉语言模型** | Qwen3-VL, Gemini, ChatGPT, GLM | | 🛣️ **车道线检测** | CLRNet | | 🔢 **目标计数** | CountGD, GeCO, GeCo2 | | 📍 **视觉定位** | Grounding DINO, YOLO-World, YOLOE, SAM 3, LocateAnything | | 📚 **其他** | 👉 [model_zoo](./docs/en/model_zoo.md) 👈 | ## 文档 0. [远程推理服务](https://github.com/CVHub520/X-AnyLabeling-Server) 1. [安装文档](./docs/zh_cn/get_started.md) 2. [用户手册](./docs/zh_cn/user_guide.md) 3. [命令行界面](./docs/zh_cn/cli.md) 4. [自定义模型](./docs/zh_cn/custom_model.md) 5. [常见问题答疑](./docs/zh_cn/faq.md) 6. [聊天机器人](./docs/zh_cn/chatbot.md) 7. [视觉问答](./docs/zh_cn/vqa.md) 8. [图像分类器](./docs/zh_cn/image_classifier.md) 9. [视频分类器](./docs/zh_cn/video_classifier.md) 10. [文档解析与智能文字识别](./docs/zh_cn/paddle_ocr.md) ## 示例 - [Classification](./examples/classification/) - [Image-Level](./examples/classification/image-level/README.md) - [Shape-Level](./examples/classification/shape-level/README.md) - [Detection](./examples/detection/) - [HBB Object Detection](./examples/detection/hbb/README.md) - [OBB Object Detection](./examples/detection/obb/README.md) - [Segmentation](./examples/segmentation/README.md) - [Instance Segmentation](./examples/segmentation/instance_segmentation/) - [Binary Semantic Segmentation](./examples/segmentation/binary_semantic_segmentation/) - [Multiclass Semantic Segmentation](./examples/segmentation/multiclass_semantic_segmentation/) - [Description](./examples/description/) - [Tagging](./examples/description/tagging/README.md) - [Captioning](./examples/description/captioning/README.md) - [Estimation](./examples/estimation/) - [Face Estimation](./examples/estimation/face_estimation/README.md) - [Pose Estimation](./examples/estimation/pose_estimation/README.md) - [Depth Estimation](./examples/estimation/depth_estimation/README.md) - [OCR](./examples/optical_character_recognition/) - [Text Recognition](./examples/optical_character_recognition/text_recognition/) - [Key Information Extraction](./examples/optical_character_recognition/key_information_extraction/README.md) - [MOT](./examples/multiple_object_tracking/README.md) - [Tracking by HBB Object Detection](./examples/multiple_object_tracking/README.md) - [Tracking by OBB Object Detection](./examples/multiple_object_tracking/README.md) - [Tracking by Instance Segmentation](./examples/multiple_object_tracking/README.md) - [Tracking by Pose Estimation](./examples/multiple_object_tracking/README.md) - [iVOS](./examples/interactive_video_object_segmentation) - [SAM2-Video](./examples/interactive_video_object_segmentation/sam2/README.md) - [SAM3-Video](./examples/interactive_video_object_segmentation/sam3/README.md) - [Matting](./examples/matting/) - [Image Matting](./examples/matting/image_matting/README.md) - [Vision-Language](./examples/vision_language/) - [Rex-Omni](./examples/vision_language/rexomni/README.md) - [Florence 2](./examples/vision_language/florence2/README.md) - [Counting](./examples/counting/) - [GeCo](./examples/counting/geco/README.md) - [GeCo2](./examples/counting/geco2/README.md) - [Grounding](./examples/grounding/) - [YOLOE](./examples/grounding/yoloe/README.md) - [SAM 3](./examples/grounding/sam3/README.md) - [LocateAnything](./examples/grounding/locateanything/README.md) - [Training](./examples/training/) - [Ultralytics](./examples/training/ultralytics/README.md) ## 贡献指南我们欢迎社区协作！**X‑AnyLabeling** 项目的成长离不开开发者们的共同参与，无论是修复 Bug、优化文档、还是添加新功能，您的贡献都非常宝贵。在参与前请阅读我们的 [贡献指南](./CONTRIBUTING.md)，并在提交 Pull Request 前确认您已同意 [贡献者许可协议 (CLA)](./CLA.md)。如果你觉得这个项目有帮助，请点亮右上角的⭐星标⭐。如有任何问题或疑问，欢迎[创建 issue](https://github.com/CVHub520/X-AnyLabeling/issues) 或发送邮件至 cv_hub@163.com。衷心感谢每一位为项目贡献力量的朋友 🙏 ## 许可本项目遵循 [GPL-3.0 license](./LICENSE) 协议，完全开源免费，初衷是希望让更多开发者、研究者和企业能够便捷地使用 AI 应用平台，推动整个行业的发展。我们鼓励大家自由使用（包括商业用途），也可以基于本项目添加功能并进行商业化，但必须保留品牌标识并标注源项目地址。此外，为了了解 X-AnyLabeling 的生态和使用情况，如果您将本项目用于学术、科研、教学或是企业用户，请在此[填写登记表](https://forms.gle/MZCKhU7UJ4TRSWxR7)。此登记仅用于统计使用情况，不会产生任何费用，我们会对信息严格保密。 X-AnyLabeling 由个人独立开发和维护。如果本项目对您有所帮助，欢迎通过下方捐赠链接支持项目持续发展，您的支持是对我最大的鼓励！如对项目有任何疑问或希望合作，欢迎随时微信联系：ww10874。 ## 赞助 | **微信支付** | **支付宝** | | :---: | :---: | |

| 感谢您的支持！ ## 引用如果您在研究中使用了这个软件，请按照以下方式引用它： ``` @misc{X-AnyLabeling, year = {2023}, author = {Wei Wang}, publisher = {Github}, organization = {CVHub}, journal = {Github repository}, title = {Advanced Auto Labeling Solution with Added Features}, howpublished = {\url{https://github.com/CVHub520/X-AnyLabeling}} } ``` --- ![Star History Chart](https://api.star-history.com/svg?repos=CVHub520/X-AnyLabeling&type=Date)

🔝 返回顶部