Stars
LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.
Code release for the paper, "Proactive Agents for Text-to-Image Generation under Uncertainty"
An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.
Mobile and Web client for Codex and Claude Code, with realtime voice, encryption and fully featured
Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and search embeddings and metadata.
Confidence scores for Neural Networks, made easy!
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
A structured template for building robust generative AI applications
This repository delivers end-to-end, code-first tutorials covering every layer of production-grade GenAI agents, guiding you from spark to scale with proven patterns and reusable blueprints for re…
ContextualStory: Consistent Visual Storytelling with Spatially-Enhanced and Storyline Context (AAAI 2025)
[NeurIPS 2025 D&B] Open-source Multi-agent Poster Generation from Papers
Just another reasonably minimal repo for class-conditional training of pixel-space diffusion transformers.
Implementation of paper: Flux Already Knows – Activating Subject-Driven Image Generation without Training
[ACM MM 2025] Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis
🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.
[AAAI 2025] 🎬RCDMs🎬: Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models. RCDMs improve story generation with strong semantic and temporal consistency, int…
LangChain, LangGraph Open Tutorial for everyone!
Running Docling as an API service
Lets make video diffusion practical!
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
hwpxlib 패키지 python에서 쉽게 사용 할수 있게 만든 github repo 입니다.
LangGraph-powered ReAct agent with Model Context Protocol (MCP) integration. A Streamlit web interface for dynamically configuring, deploying, and interacting with AI agents capable of accessing va…
Convert PDF to markdown + JSON quickly with high accuracy
🌐 Make websites accessible for AI agents. Automate tasks online with ease.