Stars
[NeurIPS 2025] More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models
A relation-free graph constrcution method for efficient GraphRAG.
This project is the official implementation of 'DreamOmni2: Multimodal Instruction-based Editing and Generation''
[TKDE2025] Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL | A curated list of resources (surveys, papers, benchmarks, and opensource projects) on large language model-based …
A neural network for emotion recognition based on multimodal physiological signal
A blueprint for building production-ready RAG systems that minimize hallucination, featuring switchable 3-step (Speed) and 4-step (Precision) pipelines.
Joint Semantic Detection and Dissemination Control of Phishing Attacks on Social Media via LLama- Based Modeling
INFTY Engine: An Optimization Toolkit to Support Continual AI
Res-SAM Framework for GPR Underground Hazard Detection
A comprehensive, production-ready framework for building intelligent AI agents with advanced capabilities including tool calling, persistent memory, intelligent concurrency, and event-driven observ…
Nexent is a zero-code platform for auto-generating agents — no orchestration, no complex drag-and-drop required. Nexent also offers powerful capabilities for agent running control, data processing …
Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model
Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better
Code repository for a potential journal publication
[ICCV 2025] Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement 🔥
Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors
小而美的Vue3异步处理解决方案,让复杂的异步逻辑变得简单优雅,让重复的样板代码成为历史
Official implementation for "HA-VLN: A Benchmark for Human-Aware Navigation in Discrete-Continuous Environments with Dynamic Multi-Human Interactions, Real-World Validation, and an Open Leaderboard".
Inspiring the Next Generation of Segment Anything Models: Comprehensively Evaluate SAM and SAM 2 with Diverse Prompts Towards Context-Dependent Concepts under Different Scenes
A vision language model for gigapixel whole slide images in histopathology
Simple Baseline for Visual Question Answering
[EMNLP 2024 Oral] Official implementation of paper "SHIELD: LLM-Driven Schema Induction for Predictive Analytics in EV Battery Supply Chain Disruptions"
CSGHub is a brand-new open-source platform for managing LLMs, developed by the OpenCSG team. It offers both open-source and on-premise/SaaS solutions, with features comparable to Hugging Face. Gain…
53AI Hub is an open-source AI portal, which enables you to quickly build a operational-level AI portal to launch and operate AI agents, prompts, and AI tools. It supports seamless integration with …
This project aims to analyze physical activity data from children and adolescents to predict the extent of their problematic internet use. The goal is to develop a model that can help identify earl…
Distributed GPU-Accelerated Framework for Evolutionary Computation. Comprehensive Library of Evolutionary Algorithms & Benchmark Problems.