Lists (30)
Sort Name ascending (A-Z)
AI
awesome-list
👍big-front-end
桌面端,移动端build-tools
构建工具clean-code
整洁、可读的代码code-editor
compiler
- compiler - parser - transformer - linter - formatter - astd2c
design to codediagram
doc site
docker
figma
fonts
framwork
interview
js-tools
network
node-js
package manage
python
react
rust
🦀serverless
template
生产力模板test
typescript
ui
动画
微前端
渲染器
canvas、WebGL、WebGPUStarred repositories
🥢像老乡鸡🐔那样做饭。主要部分于2024年完工,非老乡鸡官方仓库。文字来自《老乡鸡菜品溯源报告》,并做归纳、编辑与整理。CookLikeHOC.
AGENTS.md — a simple, open format for guiding coding agents
PaSa -- an advanced paper search agent powered by large language models. It can autonomously make a series of decisions, including invoking search tools, reading papers, and selecting relevant refe…
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
✏️ Web-based image segmentation tool for object detection, localization, and keypoints
[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
[CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
Tesseract Open Source OCR Engine (main repository)
Pure Javascript OCR for more than 100 Languages 📖🎉🖥
The headless rich text editor framework for web artisans.
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
a state-of-the-art-level open visual language model | 多模态预训练模型
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
Multilingual Document Layout Parsing in a Single Vision-Language Model
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.
Load modules according to tsconfig paths in webpack.
Your AI Operator for Web, Android, Automation & Testing.
The smallest, simplest and fastest JavaScript pixel-level image comparison library