Stars
azooKey-Desktop is an open-source Japanese input method for macOS, written in Swift and powered by the Zenzai neural kana-kanji converter. It provides live conversion, optional LLM-based “Magic Con…
Optimizing inference proxy for LLMs
Kanban board to manage your AI coding agents
Behavior Injection: Preparing Language Models for Reinforcement Learning (NeurIPS 2025)
Chrome MCP Server is a Chrome extension-based Model Context Protocol (MCP) server that exposes your Chrome browser functionality to AI assistants like Claude, enabling complex browser automation, c…
A CUI tool for browsing and resuming Claude Code conversations
2025! X / Twitter API scrapper with authorization support. Allows you to scrape search results, User's profiles (followers/following), Tweets (favoriters/retweeters) and more.
An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…
Kortix – build, manage and train AI Agents. Fully Open Source.
🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
Context7 MCP Server -- Up-to-date code documentation for LLMs and AI code editors
Documentation for the Model Context Protocol (MCP)
Amazon Nova Act is a research preview of a new AI model for developers to build agents that take actions in web browsers
No fortress, purely open ground. OpenManus is Coming.
The AI Browser Automation Framework
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
Simple package to extract text with coordinates from programmatic PDFs
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
A terminal workspace with batteries included
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
DSPy: The framework for programming—not prompting—language models
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
High-speed Large Language Model Serving for Local Deployment
NeuroSurgeon is a package that enables researchers to uncover and manipulate subnetworks within models in Huggingface Transformers
A collection of GPT system prompts and various prompt injection/leaking knowledge.