Skip to content
View xiujiesong's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Shanghai Jiao Tong University
  • Shanghai

Highlights

  • Pro

Block or report xiujiesong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

"🐈 nanobot: The Ultra-Lightweight Personal AI Agent"

Python 39,200 6,847 Updated Apr 12, 2026

🏛️ 三省六部制 · OpenClaw Multi-Agent Orchestration System — 9 specialized AI agents with real-time dashboard, model config, and full audit trails

Python 14,944 1,566 Updated Apr 9, 2026

[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,910 90 Updated Jan 8, 2026
JavaScript 4,105 1,841 Updated Jun 21, 2024

A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/

JavaScript 4,766 1,040 Updated Sep 4, 2025

Official inference repo for FLUX.1 models

Python 25,399 1,872 Updated Jul 31, 2025

Open-source multi-turn evaluation toolkit of LLMs. Under construction...

Python 10 Updated Mar 25, 2026

A benchmark evolving framework and a benchmark for LLMs' multi-turn instruction following evaluation.

Python 2 Updated Apr 7, 2026
Python 10,940 739 Updated Feb 9, 2026

Downloads videos and playlists from YouTube

C# 14,673 1,824 Updated Apr 12, 2026

MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation Model

1,265 55 Updated Jan 8, 2026

Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.

1,416 42 Updated Mar 9, 2026

The baselines of ARC-Challenge-Interspeech2026

Python 58 5 Updated Dec 1, 2025

MOSS-TTSD is a spoken dialogue generation model designed for expressive multi-speaker synthesis. It features long-context modeling, flexible speaker control, and multilingual support, while enablin…

Python 1,262 122 Updated Mar 23, 2026

上海交通大学开题报告/中期报告LaTeX模板(非官方) Shanghai Jiao Tong University LaTeX templates for thesis proposals and annual reports (unofficial)

TeX 159 10 Updated Jan 25, 2026

A benchmark on visual perception in text strings for both LLMs and MLLMs.

Python 14 1 Updated Apr 7, 2026

O1 Replication Journey

1,999 61 Updated Jan 14, 2025

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 3,665 252 Updated Jan 8, 2026

AI for Science 论文解读合集(持续更新ing),论文/数据集/教程下载:hyper.ai

3,176 502 Updated Mar 22, 2025

A Survey on Jailbreak Attacks and Defenses against Multimodal Generative Models

315 13 Updated Jan 11, 2026

AcadHomepage: A Modern and Responsive Academic Personal Homepage

SCSS 2,704 5,561 Updated Apr 12, 2026

This is a repository for listing papers on scene graph generation and application.

630 42 Updated Apr 9, 2026

Official repo for 'Large Multimodal Models Evaluation: A Survey'

101 10 Updated Mar 16, 2026

This project introduces a novel, user-centric leaderboard for Large Language Models (LLMs) that moves beyond one-size-fits-all evaluations. Our framework empowers users to create personalized ranki…

Python 4 Updated Jan 26, 2026
Python 7 Updated Jul 7, 2025

诺亚盘古大模型研发背后的真正的心酸与黑暗的故事。

11,419 1,326 Updated Jul 9, 2025

DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation

Python 812 56 Updated Jul 9, 2025

OmniGen2: Exploration to Advanced Multimodal Generation. https://arxiv.org/abs/2506.18871

Jupyter Notebook 4,044 23 Updated Mar 20, 2026
Next