shijiatongxue

✨

Focusing

Shi Jia shijiatongxue

✨

Focusing

Front-end developer @bytedance

42 followers · 85 following

Beijing, China
https://shijia.dev

Achievements

Lists (30)

Sort

Starred repositories

Meituan-Dianping / vision-ui

视觉UI分析工具

Python 400 77 Updated Jul 26, 2023

Gar-b-age / CookLikeHOC

🥢像老乡鸡🐔那样做饭。主要部分于2024年完工，非老乡鸡官方仓库。文字来自《老乡鸡菜品溯源报告》，并做归纳、编辑与整理。CookLikeHOC.

JavaScript 21,921 2,201 Updated Oct 17, 2025

openai / agents.md

AGENTS.md — a simple, open format for guiding coding agents

TypeScript 7,943 618 Updated Oct 22, 2025

bytedance / pasa

PaSa -- an advanced paper search agent powered by large language models. It can autonomously make a series of decisions, including invoking search tools, reading papers, and selecting relevant refe…

Python 1,423 107 Updated May 27, 2025

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 16,003 1,266 Updated Oct 27, 2025

datajuicer / data-juicer

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 5,476 286 Updated Nov 6, 2025

jsbroks / coco-annotator

✏️ Web-based image segmentation tool for object detection, localization, and keypoints

Vue 2,249 471 Updated Jan 30, 2025

apple / ml-ferret

Python 8,655 514 Updated Oct 9, 2024

google-research / pix2struct

Python 667 60 Updated Jun 3, 2025

IDEA-Research / DINO

[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"

Python 2,665 295 Updated Jul 31, 2024

IDEA-Research / MaskDINO

[CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"

Python 1,431 143 Updated Dec 20, 2023

UX-Decoder / Segment-Everything-Everywhere-All-At-Once

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Python 4,739 448 Updated Aug 19, 2024

tesseract-ocr / tesseract

Tesseract Open Source OCR Engine (main repository)

C++ 70,726 10,357 Updated Oct 13, 2025

naptha / tesseract.js

Pure Javascript OCR for more than 100 Languages 📖🎉🖥

JavaScript 37,450 2,345 Updated Oct 26, 2025

ueberdosis / tiptap

The headless rich text editor framework for web artisans.

TypeScript 33,350 2,729 Updated Nov 5, 2025

ProseMirror / prosemirror

The ProseMirror WYSIWYM editor

JavaScript 8,417 364 Updated Apr 22, 2025

facebookresearch / segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 52,380 6,133 Updated Sep 18, 2024

zai-org / CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,687 445 Updated May 29, 2024

pyenv / pyenv

Simple Python version management

Roff 43,569 3,217 Updated Nov 5, 2025

PaddlePaddle / PaddleOCR

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 62,762 9,250 Updated Nov 5, 2025