Stars
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
每个人都能看懂的大模型知识分享,LLMs春/秋招大模型面试前必看,让你和面试官侃侃而谈
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
[PR 2025] DocAligner: Automating the Annotation of Photographed Documents Through Real-virtual Alignment
A best practice for deep learning project template architecture.
OpenOCR: An Open-Source Toolkit for General-OCR Research and Applications, integrates a unified training and evaluation benchmark, commercial-grade OCR and Document Parsing systems, and faithful re…
A lmdb dataset format conversion tool
📷 EasyPhoto | Your Smart AI Photo Generator.
useful text recognition algorithms, CRNN and SVTR text recognition
This repository contains datasets and baselines for benchmarking Chinese text recognition.
Annotation Tool for Text Simplification Corpora
Official implementation of SBNet as described in "Single-branch Network for Multimodal Training".
Pytorch implementation of our paper "CLRNet: Cross Layer Refinement Network for Lane Detection" (CVPR2022 Acceptance).
Decoder architecture based on the UNet++. Combining residual bottlenecks with depthwise convolutions and attention mechanisms, it outperforms the UNet++ in a coronary artery segmentation task, whil…
[CVPR 2023] DiffPose: Toward More Reliable 3D Pose Estimation
Code for our CVPR'2024 paper "GauHuman: Articulated Gaussian Splatting from Monocular Human Videos"
RUCAIBox / UniCRS
Forked from wxl1999/UniCRS[KDD22] Official PyTorch implementation for "Towards Unified Conversational Recommender Systems via Knowledge-Enhanced Prompt Learning".
[ECCV2022 Oral] Registration based Few-Shot Anomaly Detection
deep learning for image processing including classification and object-detection etc.
✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows
🎨 数学公式识别增强版:中英文手写印刷公式、支持初级符号推导(数据结构基于 LaTeX 抽象语法树)Math Formula OCR Pro, supports handwrite, Chinese-mixed formulas and simple symbol reasoning (based on LaTeX AST).
EMSANet: Efficient Multi-Task RGB-D Scene Analysis for Indoor Environments
Jupyter notebook tutorials for mmpose
Document Layout Analysis resources repos for development with PdfPig.
Algorithms for explaining machine learning models