Skip to content
View shijiatongxue's full-sized avatar
Focusing
Focusing

Block or report shijiatongxue

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

485 results for source starred repositories
Clear filter

视觉UI分析工具

Python 400 77 Updated Jul 26, 2023

🥢像老乡鸡🐔那样做饭。主要部分于2024年完工,非老乡鸡官方仓库。文字来自《老乡鸡菜品溯源报告》,并做归纳、编辑与整理。CookLikeHOC.

JavaScript 21,915 2,202 Updated Oct 17, 2025

AGENTS.md — a simple, open format for guiding coding agents

TypeScript 7,929 617 Updated Oct 22, 2025

PaSa -- an advanced paper search agent powered by large language models. It can autonomously make a series of decisions, including invoking search tools, reading papers, and selecting relevant refe…

Python 1,423 107 Updated May 27, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 15,987 1,260 Updated Oct 27, 2025

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 5,475 285 Updated Nov 5, 2025

✏️ Web-based image segmentation tool for object detection, localization, and keypoints

Vue 2,249 471 Updated Jan 30, 2025
Python 8,655 514 Updated Oct 9, 2024

[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"

Python 2,664 295 Updated Jul 31, 2024

[CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"

Python 1,431 143 Updated Dec 20, 2023

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Python 4,739 448 Updated Aug 19, 2024

Tesseract Open Source OCR Engine (main repository)

C++ 70,717 10,355 Updated Oct 13, 2025

Pure Javascript OCR for more than 100 Languages 📖🎉🖥

JavaScript 37,450 2,345 Updated Oct 26, 2025

The headless rich text editor framework for web artisans.

TypeScript 33,344 2,729 Updated Nov 5, 2025

The ProseMirror WYSIWYM editor

JavaScript 8,417 364 Updated Apr 22, 2025

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 52,377 6,132 Updated Sep 18, 2024

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,685 443 Updated May 29, 2024

Simple Python version management

Roff 43,568 3,217 Updated Nov 5, 2025

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 62,719 9,241 Updated Nov 5, 2025

GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Python 1,731 103 Updated Oct 28, 2025

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Python 3,144 323 Updated Oct 11, 2025

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 48,151 3,975 Updated Nov 4, 2025

Multilingual Document Layout Parsing in a Single Vision-Language Model

Python 5,583 562 Updated Oct 31, 2025

OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。

Python 39,512 3,907 Updated May 31, 2025

[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.

Python 1,536 108 Updated May 29, 2025

Load modules according to tsconfig paths in webpack.

TypeScript 622 48 Updated Nov 15, 2024

Your AI Operator for Web, Android, Automation & Testing.

TypeScript 10,610 723 Updated Nov 5, 2025
Python 8,129 571 Updated Nov 5, 2025

The smallest, simplest and fastest JavaScript pixel-level image comparison library

JavaScript 6,604 324 Updated Jul 16, 2025
Next