-
Peking University
- China
-
03:25
(UTC +08:00) - https://step-out.github.io/
Lists (1)
Sort Name ascending (A-Z)
Stars
Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/gemini agent will be an AI research agent with full horsepowe…
[FSE'2026] PlayCoder: Making LLM-Generated GUI Code Playable
A curated collection of 100+ multimodal large language models
Official implementation of "CellFlux: Simulating Cellular Morphology Changes via Flow Matching" (ICML 2025)
Make the AI responses and the search results from the search engine appear more coherently on the same page.
[FSE'2026] SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks
MineContext is your proactive context-aware AI partner(Context-Engineering+ChatGPT Pulse)
The official repo for paper, LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods.
Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.
MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution. [NeurIPS 2024]
SWE-bench: Can Language Models Resolve Real-world Github Issues?
An open-source implementaion for fine-tuning Qwen-VL series by Alibaba Cloud.
[EMNLP 2025 main] C3 Benchmark: A Bilingual Benchmark for Spoken Dialogue Models Exploring Challenges in Complex Conversations
Writing AI Conference Papers: A Handbook for Beginners
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
A Survey on Image Quality Assessment: Insights, Analysis, and Future Outlook
A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.