Skip to content
View jszh's full-sized avatar

Organizations

@meomoe

Block or report jszh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Screen reader driver for test automation.

TypeScript 519 12 Updated Apr 5, 2026

Dataset of Mouse and Touchscreen Input Performance

6 1 Updated Apr 13, 2021

AI agents running research on single-GPU nanochat training automatically

Python 72,364 10,562 Updated Mar 26, 2026

Virtual Screen Reader is a screen reader simulator for unit tests.

TypeScript 136 6 Updated Apr 10, 2026
JavaScript 8 Updated Apr 15, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,688 3,656 Updated Apr 15, 2026

Building a comprehensive and handy list of papers for GUI agents

Python 701 39 Updated Apr 14, 2026
Python 119 27 Updated Nov 19, 2025

Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.

1,419 42 Updated Mar 9, 2026

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 70,114 8,578 Updated Apr 12, 2026

VIP cheatsheet for Stanford's CME 295 Transformers and Large Language Models

4,311 609 Updated Jul 27, 2025

Web Content Accessibility Guidelines

HTML 1,403 401 Updated Apr 14, 2026

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. Each technique has a detailed notebook tutorial.

Jupyter Notebook 26,770 3,199 Updated Apr 14, 2026

ScreenCoder — Turn any UI screenshot into clean, editable HTML/CSS with full control. Fast, accurate, and easy to customize.

Python 2,638 258 Updated Oct 22, 2025

FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae, Traycer AI…

135,202 33,982 Updated Mar 28, 2026

Vibetest MCP - automated QA testing using Browser-Use agents

Python 787 80 Updated Sep 2, 2025

C++ implementation of a ScienceDirect paper "An accelerating cpu-based correlation-based image alignment for real-time automatic optical inspection"

C++ 1,118 260 Updated Jan 20, 2026

An open-source audio wake word (or phrase) detection framework with a focus on performance and simplicity.

Jupyter Notebook 2,100 257 Updated Dec 30, 2025

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook 3,424 399 Updated Nov 11, 2025

Run Segment Anything Model 2 on a live video stream

Jupyter Notebook 577 91 Updated Jun 3, 2025

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

Python 9,697 835 Updated Mar 14, 2026
Jupyter Notebook 131 17 Updated Dec 4, 2023

Datasets on Website Aesthetics for Machine Learning

R 13 4 Updated Mar 28, 2023

Android in docker solution with noVNC supported and video recording

Python 14,482 1,659 Updated Apr 13, 2026
Python 150 25 Updated Jul 12, 2022

A command-line utility and Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.

Python 476 82 Updated Feb 23, 2024

A Python Perceptual Image Hashing Module

Python 3,814 339 Updated Apr 17, 2025

Pretty good call graphs for dynamic languages

Python 4,559 327 Updated Jul 27, 2025

MagentaA11y is a tool built to simplify the process of accessibility testing.

TypeScript 77 22 Updated Apr 14, 2026
Next