Skip to content
View ylwangy's full-sized avatar

Block or report ylwangy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 1 Updated Jan 27, 2026

Beyond Majority Voting: Towards Fine-grained and More Reliable Reward Signal for Test-Time Reinforcement

Python 3 Updated Jan 1, 2026

SemPA: Improving Sentence Embeddings of Large Language Models through Semantic Preference Alignment

Python 3 3 Updated Jan 14, 2026

Official implementation of the NeurIPS 2025 paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"

Python 308 37 Updated Jan 26, 2026

This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.

276 5 Updated Feb 9, 2026
Python 313 45 Updated Dec 12, 2025

[TMLR 2026] Survey: https://arxiv.org/pdf/2507.20198

297 20 Updated Feb 10, 2026

Enhancing Automated Interpretability with Output-Centric Feature Descriptions

Jupyter Notebook 10 Updated Jun 12, 2025
Python 74 11 Updated May 23, 2024

Attribute-guided reinforcement learning framework for molecular property prediction with large language models.

Python 4 1 Updated Sep 25, 2025

Code for the ICML 2025 Paper "Product of Experts with LLMs: Boosting Performance on ARC is a Matter of Perspective"

Python 47 8 Updated Nov 9, 2025

This repository contains the official implementation of the paper **"Improving Rationality in the Reasoning Process of Language Models through Self-playing Game."**

Python 3 1 Updated May 27, 2025

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients. Published in Nature.

Python 3,351 278 Updated Jul 25, 2025
Python 216 36 Updated Jan 5, 2026

[ACL'25 Findings] LDIR (Low-Dimensional Dense Interpretable Text Embeddings with Relative Representations) is a novel text embedding method that balances semantic expressiveness, interpretability, …

Python 8 2 Updated Aug 4, 2025

[ACL'25 Findings] RankedVotingSC (Ranked Voting based Self-Consistency) is a method that generates ranked answers in each reasoning attempt and aggregates them using ranked voting across multiple r…

Python 11 Updated Aug 7, 2025

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,324 129 Updated Nov 9, 2025

Bootstrapping ARC

Python 155 24 Updated Nov 20, 2024

国家自然科学基金申请书正文(面上项目)LaTeX 模板(非官方)

BibTeX Style 1,021 254 Updated Jan 24, 2026

Our solution for the arc challenge 2024

Jupyter Notebook 188 33 Updated Jun 17, 2025

Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"

Python 344 32 Updated Nov 10, 2025

Implementation for ACL 2024 paper "Meta-Task Prompting Elicits Embeddings from Large Language Models"

Python 12 Updated Jul 25, 2024

[ECCV 2024 Best Paper Candidate & TPAMI 2025] PointLLM: Empowering Large Language Models to Understand Point Clouds

Python 970 51 Updated Aug 14, 2025

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 11,163 1,097 Updated Nov 18, 2024
Python 150 5 Updated Aug 23, 2023

Code for "Visual Spatial Description: Controlled Spatial-Oriented Image-to-Text Generation"

Python 26 3 Updated Mar 9, 2024

[ICCV 2025] Improving 3D Large Language Model via Robust Instruction Tuning

Python 68 2 Updated Oct 19, 2025
Python 23 Updated Apr 19, 2024
Next