Skip to content
View frank60229's full-sized avatar
🐂
🐂

Block or report frank60229

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

55 results for source starred repositories
Clear filter

[Lumina Embodied AI] 具身智能技术指南 Embodied-AI-Guide

8,652 575 Updated Sep 22, 2025

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 69,555 8,389 Updated Sep 20, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,133 11,044 Updated Nov 5, 2025

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,842 896 Updated Sep 30, 2025

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,312 531 Updated Nov 5, 2025

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,467 541 Updated May 18, 2025

Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"

Python 6,986 479 Updated Mar 18, 2025

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 48,149 3,974 Updated Nov 4, 2025

Bring portraits to life!

Python 17,248 1,784 Updated Jun 14, 2025

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 25,710 2,583 Updated Nov 4, 2025

《大模型白盒子构建指南》:一个全手搓的Tiny-Universe

Jupyter Notebook 3,971 403 Updated Aug 30, 2025

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 6,344 467 Updated Aug 7, 2024

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,185 1,665 Updated Sep 24, 2025

A generative speech model for daily dialogue.

Python 38,102 4,132 Updated Jul 6, 2025

Effortless data labeling with AI support from Segment Anything and other awesome models.

Python 6,887 766 Updated Nov 5, 2025

Generate text line images for training deep learning OCR models

Python 886 173 Updated Nov 4, 2025

Inference code for Llama models

Python 58,899 9,812 Updated Jan 26, 2025

🔥🔥🔥 专注于YOLO11,YOLOv8、TYOLOv12、YOLOv10、RT-DETR、YOLOv7、YOLOv5改进模型,Support to improve backbone, neck, head, loss, IoU, NMS and other modules🚀

Python 2,852 459 Updated Apr 7, 2025

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Python 2,537 249 Updated Apr 24, 2024

ModelScope: bring the notion of Model-as-a-Service to life.

Python 8,443 878 Updated Nov 5, 2025

基于Pytorch的OCR工具库,支持常用的文字检测和识别算法

Python 1,498 313 Updated Sep 2, 2024

DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022

Jupyter Notebook 176 36 Updated Jan 17, 2025

ChatLaw:A Powerful LLM Tailored for Chinese Legal. 中文法律大模型

7,343 588 Updated Jan 4, 2025

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,811 2,663 Updated Jul 3, 2025

Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICPR 2020)

Python 570 191 Updated Jul 25, 2024

超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M

C++ 12,227 2,294 Updated Aug 14, 2023

✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows

TypeScript 86,339 60,869 Updated Oct 27, 2025

这是一款提高ChatGPT的数据安全能力和效率的插件。并且免费共享大量创新功能,如:自动刷新、保持活跃、数据安全、取消审计、克隆对话、言无不尽、净化页面、展示大屏、拦截跟踪、日新月异、明察秋毫等。让我们的AI体验无比安全、顺畅、丝滑、高效、简洁。

JavaScript 14,901 743 Updated Oct 14, 2025

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 52,376 6,131 Updated Sep 18, 2024

Tesseract Open Source OCR Engine (main repository)

C++ 70,716 10,354 Updated Oct 13, 2025
Next