Skip to content
View gszfwsb's full-sized avatar
😈
Making alchemy
😈
Making alchemy

Highlights

  • Pro

Block or report gszfwsb

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Python 1,759 139 Updated Dec 20, 2025

repo for paper https://arxiv.org/abs/2504.13837

Python 300 17 Updated Dec 17, 2025

一个基于nano banana pro🍌的原生AI PPT生成应用,迈向真正的"Vibe PPT"; 支持上传任意模板图片;上传任意素材&智能解析;一句话/大纲/页面描述自动生成PPT;口头修改指定区域、一键导出 - An AI-native PPT generator based on nano banana pro🍌

TypeScript 5,297 584 Updated Dec 20, 2025
Python 31 3 Updated Jun 23, 2025

Code for VideoCompressa: Data-Efficient Video Understanding via Joint Temporal Compression and Spatial Reconstruction

Python 3 Updated Dec 10, 2025

Crowdfunding open source projects: use OpenReview's high-quality review data to fine-tune a professional review and response LLM. 众筹开源项目:利用OpenReview的优质审稿数据,微调出一个专业的审稿和审稿回复GPT

Python 209 12 Updated Apr 26, 2023

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 12,725 1,175 Updated Sep 26, 2025

Tools for merging pretrained large language models.

Python 6,614 649 Updated Dec 17, 2025
Jupyter Notebook 35 4 Updated Nov 30, 2025

The official implementation for [NeurIPS2025 Oral] Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free

Jupyter Notebook 663 41 Updated Dec 20, 2025

Enjoy the magic of Diffusion models!

Python 11,177 1,053 Updated Dec 20, 2025

The official implementation of dLLM-Var

Python 27 Updated Nov 6, 2025

Physics of Language Models, Part 4

HTML 274 13 Updated Dec 9, 2025

Search Self-Play: Pushing the Frontier of Agent Capability without Supervision

Python 76 5 Updated Nov 13, 2025

Scaling Preference Data Curation via Human-AI Synergy

133 1 Updated Jul 3, 2025

Code repository for Group-MATES Group-Level Data Selection for Efficient Pretraining

Python 8 2 Updated Jun 14, 2025
Python 1,735 77 Updated Dec 16, 2025

Socratic-Zero is a fully autonomous framework that generates high-quality training data for mathematical reasoning

Python 3 Updated Oct 27, 2025

This is the official implementation for Human-Agent Collaborative Paper-to-Page Crafting for Under $0.1.

HTML 150 12 Updated Oct 27, 2025

ERGO (Efficient Reasoning & Guided Observation) is a large vision–language model trained with reinforcement learning on efficiency objectives.

Python 10 Updated Oct 2, 2025

Official implementation of paper: SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Python 312 17 Updated Apr 28, 2025

2026 AI/ML internship & new graduate job list updated daily

4,240 173 Updated Dec 20, 2025

The best repository showing why transformers might not be the answer for time series forecasting and showcasing the best SOTA non transformer models.

808 57 Updated Nov 14, 2025

Towards a Unified View of Large Language Model Post-Training

Python 195 11 Updated Sep 8, 2025

[Survey] A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems

1,427 92 Updated Oct 11, 2025
Next