Skip to content
View kiljos's full-sized avatar
🍊
🍊

Highlights

  • Pro

Block or report kiljos

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

YouTube Thumbnail Generator with AI-powered face detection and image generation

Python 4 1 Updated Jan 31, 2026

OpenViking is an open-source context database designed specifically for AI Agents(such as openclaw). OpenViking unifies the management of context (memory, resources, and skills) that Agents need th…

Python 25,852 2,002 Updated Jun 20, 2026

The design language that makes your AI harness better at design.

JavaScript 39,865 2,202 Updated Jun 20, 2026

Fully automatic censorship removal for language models

Python 25,261 2,719 Updated Jun 19, 2026

A complete AI agency at your fingertips - From frontend wizards to Reddit community ninjas, from whimsy injectors to reality checkers. Each agent is a specialized expert with personality, processes…

Shell 114,716 18,739 Updated Jun 18, 2026

A Simple and Universal Swarm Intelligence Engine, Predicting Anything. 简洁通用的群体智能引擎,预测万物

Python 66,862 10,423 Updated May 24, 2026

An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of…

Python 71,968 9,765 Updated Jun 20, 2026

Model Context Protocol (MCP) server for AI-assisted development ("vibe coding") of MDK applications.

JavaScript 32 6 Updated Jun 11, 2026

AI agents can now use real Android and iOS apps, just like a human.

Python 2,613 223 Updated Jun 10, 2026

[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.

Python 1,870 138 Updated Apr 24, 2026

Every front-end GUI client for ChatGPT, Claude, and other LLMs

3,985 275 Updated Jan 22, 2026

[NeurIPS'25] GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents

Python 409 51 Updated Apr 13, 2026

GUI Grounding for Professional High-Resolution Computer Use

Python 377 55 Updated Jun 17, 2026

Agent S: an open agentic framework that uses computers like a human

Python 11,894 1,401 Updated May 13, 2026

This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use" (ACL 2025 Oral).

485 21 Updated Aug 16, 2025

Python script to upload videos on YouTube using Selenium

Python 662 207 Updated Feb 12, 2023

Goodreads Quote API

Python 23 6 Updated Oct 22, 2024

[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).

Python 850 109 Updated Feb 3, 2025

AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI

JavaScript 1,059 100 Updated Dec 9, 2024

🖥️ Run AI Agent in your browser.

Python 16,107 2,717 Updated May 15, 2026

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …

Python 7,225 810 Updated Mar 5, 2025

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

TeX 1,008 34 Updated May 22, 2026

ui-screenshot-to-prompt is an AI-powered tool that analyzes UI images to generate detailed prompts for AI coders. It uses computer vision and natural language processing to break down UI components…

Python 221 40 Updated Oct 15, 2025

A curated list of of awesome UI agents resources, encompassing Web, App, OS, and beyond (continually updated)

303 36 Updated Jun 17, 2026

A collection of AI Agents papers (Updated biweekly)

1,502 113 Updated May 30, 2026

A one stop repository for generative AI research updates, interview resources, notebooks and much more!

HTML 27,795 5,755 Updated Jun 20, 2026

JavaScript API for Chrome and Firefox

TypeScript 95,160 9,460 Updated Jun 20, 2026

The model, data and code for the visual GUI Agent SeeClick

HTML 486 31 Updated Jul 13, 2025

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 22,057 4,103 Updated Jun 18, 2026
Next