frank60229

🐂

frank60229

🐂

0 followers · 6 following

Starred repositories

TianxingChen / Embodied-AI-Guide

[Lumina Embodied AI] 具身智能技术指南 Embodied-AI-Guide

8,695 580 Updated Sep 22, 2025

binary-husky / gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 69,577 8,391 Updated Sep 20, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,367 11,090 Updated Nov 7, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,849 896 Updated Sep 30, 2025

open-compass / VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,320 534 Updated Nov 5, 2025

FoundationVision / VAR

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,471 542 Updated May 18, 2025

yangchris11 / samurai

Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"

Python 6,987 480 Updated Mar 18, 2025

opendatalab / MinerU

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 48,222 3,988 Updated Nov 6, 2025

KwaiVGI / LivePortrait

Bring portraits to life!

Python 17,256 1,784 Updated Jun 14, 2025

datawhalechina / self-llm

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程

Jupyter Notebook 25,748 2,588 Updated Nov 4, 2025

datawhalechina / tiny-universe

《大模型白盒子构建指南》：一个全手搓的Tiny-Universe

Jupyter Notebook 3,975 403 Updated Aug 30, 2025

QwenLM / Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 6,345 469 Updated Aug 7, 2024

OpenBMB / MiniCPM-V

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,191 1,665 Updated Sep 24, 2025

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 38,111 4,133 Updated Jul 6, 2025

CVHub520 / X-AnyLabeling

Effortless data labeling with AI support from Segment Anything and other awesome models.

Python 6,900 769 Updated Nov 5, 2025

oh-my-ocr / text_renderer

Generate text line images for training deep learning OCR models

Python 886 173 Updated Nov 4, 2025

meta-llama / llama

Inference code for Llama models

Python 58,906 9,812 Updated Jan 26, 2025

iscyy / ultralyticsPro

🔥🔥🔥 专注于YOLO11，YOLOv8、TYOLOv12、YOLOv10、RT-DETR、YOLOv7、YOLOv5改进模型，Support to improve backbone, neck, head, loss, IoU, NMS and other modules🚀

Python 2,851 459 Updated Apr 7, 2025

OFA-Sys / OFA

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Python 2,538 249 Updated Apr 24, 2024

modelscope / modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Python 8,443 878 Updated Nov 6, 2025

WenmuZhou / PytorchOCR

基于Pytorch的OCR工具库，支持常用的文字检测和识别算法

Python 1,498 313 Updated Sep 2, 2024

dali92002 / DocEnTR

DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022

Jupyter Notebook 176 36 Updated Jan 17, 2025

PKU-YuanGroup / ChatLaw

ChatLaw：A Powerful LLM Tailored for Chinese Legal. 中文法律大模型

7,347 588 Updated Jan 4, 2025

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,816 2,665 Updated Jul 3, 2025

wenwenyu / PICK-pytorch

Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICPR 2020)

Python 570 191 Updated Jul 25, 2024

DayBreak-u / chineseocr_lite

超轻量级中文ocr，支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M

C++ 12,232 2,293 Updated Aug 14, 2023

ChatGPTNextWeb / NextChat

TypeScript 86,353 60,862 Updated Oct 27, 2025

xcanwin / KeepChatGPT

这是一款提高ChatGPT的数据安全能力和效率的插件。并且免费共享大量创新功能，如：自动刷新、保持活跃、数据安全、取消审计、克隆对话、言无不尽、净化页面、展示大屏、拦截跟踪、日新月异、明察秋毫等。让我们的AI体验无比安全、顺畅、丝滑、高效、简洁。

JavaScript 14,903 743 Updated Oct 14, 2025

facebookresearch / segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 52,394 6,132 Updated Sep 18, 2024

UB-Mannheim / tesseract

Forked from tesseract-ocr/tesseract

Tesseract Open Source OCR Engine (main repository)

C++ 3,861 497 Updated Oct 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly