-
Shanghai Normal University
- Fengxian District, Shanghai, China
Highlights
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
zgsm-ai / costrict
Forked from RooCodeInc/Roo-CodeCostrict - strict AI coder for enterprises, quality first, including AI Agent, AI CodeReview, AI Completion.
Translate PDF, Word, PowerPoint, etc. | zotero翻译插件,微信扫码注册,新用户可免费翻译25万汉字或100万个英文字母。超能文献官网:suppr.wilddata.cn;
The IOS app Right to Record, which is an app that records and uploads at the same time, this can protect your video footages so that even if your phone was destoried when recording, you'll still ha…
DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Models
hiksdk 是海康威视官方 C SDK 的 Go 语言封装,通过 CGO 调用底层 SDK,提供简洁易用的 Go API。支持网络摄像机(IPC)、网络视频录像机(NVR)、数字视频录像机(DVR)等全系列海康设备。
[AAAI 2026 Oral] Cook and Clean Together: Teaching Embodied Agents for Parallel Task Execution
Open source AI terminal and SSH Client for EC2, Database and Kubernetes.
Open, reproducible benchmarks and practical recipes to reduce I/O bottlenecks and improve end-to-end performance in AI training and bulk inference.
HoloCubic-AIO-Enhanced Multi-Function Firmware - An all-in-one ESP32-Arduino based firmware featuring weather clock, photo album, video player, screen mirroring, web server, BiliBili fans tracker, …
A lightweight browser-to-NAS pipeline for capturing and downloading web videos. It integrates a Chrome Extension with a NAS-hosted Docker backend (FastAPI, workers, FFmpeg) to automatically detect,…
A Transparent Generalist Model towards Holistic Medical Vision-Language Understanding
A Systematic Evaluation Framework for Large Language Models in Multi-omics Analysis
LimiX: Unleashing Structured-Data Modeling Capability for Generalist Intelligence https://arxiv.org/abs/2509.03505
[NeurIPS 2025] NAUTILUS: A Large Multimodal Model for Underwater Scene Understanding
🤖 A comprehensive task management system specifically designed for AI assistants. Supports project management, task tracking, team collaboration, and seamless AI integration through MCP (Model Cont…
SplitFM: A Split Parameter-Efficient Fine-Tuning and Inference Framework for Foundation Models
[ICCV 2025] Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives
🤖 AutoAudit--智能审计决策系统 Python FastAPI License 基于大语言模型的智能审计平台 | 集成知识图谱、RAG、强化学习等前沿技术 功能特性 • 快速开始 • 技术架构 • 文档 📋 项目简介 智能审计决策系统是一个基于大语言模型(LLM)的智能审计平台,集成了知识图谱、RAG检索增强生成、强化学习等前沿技术,为审计工作提供智能化支持。 🎯 核心价值: 突破…
An Intelligent System for Spine Imaging Analysis and Automated Diagnostic Report Generation.
A head-only, lightweight, fast, thread safe, valgrind-like memory monitor, which output perf-like report.
RLinf is a flexible and scalable open-source infrastructure designed for post-training foundation models (LLMs, VLMs, VLAs) via reinforcement learning.
Agent-ready RPA suite with out-of-the-box automation tools. Built for individuals and enterprises.
Official codebase for the paper "Presenting a Paper is an Art: Self-Improvement Aesthetic Agents for Academic Presentations"
Joint Semantic Detection and Dissemination Control of Phishing Attacks on Social Media via LLama- Based Modeling
Enterprise-grade, commercial-friendly agentic workflow platform for building next-generation SuperAgents.
A decentralized agent network for building collaborative, LLM-powered agent-to-agent (A2A) systems.
This project uses wrist-worn sensor data—movement, temperature, and proximity—to distinguish body-focused repetitive behaviors (BFRBs) from everyday gestures. The goal is to build a model that impr…
ScaleCUA is the open-sourced computer use agents that can operate on cross-platform environments (Windows, macOS, Ubuntu, Android).
[NeurIPS 2025🔥]Main source code of SRPO framework.