Skip to content
View ckqqqq's full-sized avatar
  • Bejing University of Posts and Telecommunications
  • Beijing
  • 21:38 (UTC -12:00)
  • X @qiker

Highlights

  • Pro

Block or report ckqqqq

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Psy-Insight: Mental Health Oriented Interpretable Multi-turn Bilingual Counseling Dataset for Large Language Model Finetuning

Jupyter Notebook 14 1 Updated Nov 5, 2024

Official Repository of Paper: "SynParaSpeech: Automated Synthesis of Paralinguistic Datasets for Speech Generation and Understanding"

JavaScript 56 4 Updated Nov 5, 2025

《labuladong的算法小抄》顺序阅读版

922 311 Updated Aug 8, 2022

Open Overleaf/ShareLaTex projects in vscode, with full collaboration support.

TypeScript 1,184 39 Updated Oct 30, 2025

ACE-Step: A Step Towards Music Generation Foundation Model

Python 3,229 376 Updated Jun 27, 2025

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python 1 Updated Sep 15, 2023

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 5 1 Updated Aug 29, 2024

helper functions for processing and integrating visual language information with Qwen-VL Series Model

Python 15 5 Updated Aug 30, 2024

Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 30+ benchmarks

Python 15 Updated Feb 17, 2025

VideoAuteur: Towards Long Narrative Video Generation

43 1 Updated Oct 22, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,314 807 Updated Oct 31, 2025

[NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*

Jupyter Notebook 116 6 Updated Dec 10, 2024

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,616 70 Updated May 11, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,120 2,424 Updated Nov 5, 2025
Python 61 1 Updated Oct 2, 2024
Python 965 65 Updated Mar 24, 2025

LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 625 50 Updated Apr 8, 2025
Python 4,368 414 Updated Sep 14, 2025

[ACL 2024 Findings] "TempCompass: Do Video LLMs Really Understand Videos?", Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, Lei Li, Sishuo Chen, Xu Sun, Lu Hou

Python 125 4 Updated Apr 4, 2025

[CVPR 2025 🔥]A Large Multimodal Model for Pixel-Level Visual Grounding in Videos

Python 89 3 Updated Apr 14, 2025

Lets make video diffusion practical!

Python 16,080 1,545 Updated Oct 16, 2025

Video-R1: Reinforcing Video Reasoning in MLLMs [🔥the first paper to explore R1 for video]

Python 728 38 Updated Sep 19, 2025

Witness the aha moment of VLM with less than $3.

Python 3,975 290 Updated May 19, 2025

Research code for ACL2024 paper: "Synchronized Video Storytelling: Generating Video Narrations with Structured Storyline"

Python 39 5 Updated Dec 27, 2024

Video translation and dubbing tool powered by LLMs. The video translator offers 100 language translations and one-click full-process deployment. The video translation output is optimized for platfo…

Go 8,841 723 Updated Nov 5, 2025

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,765 295 Updated Jun 12, 2025

Frontier Multimodal Foundation Models for Image and Video Understanding

Jupyter Notebook 1,030 74 Updated Aug 14, 2025
Python 797 46 Updated Jul 8, 2024
Jupyter Notebook 6 Updated Oct 3, 2024
Next