Skip to content
View ylwangy's full-sized avatar

Block or report ylwangy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 4 Updated Jan 27, 2026

Beyond Majority Voting: Towards Fine-grained and More Reliable Reward Signal for Test-Time Reinforcement

Python 3 Updated Jan 1, 2026

SemPA: Improving Sentence Embeddings of Large Language Models through Semantic Preference Alignment

Python 4 3 Updated Feb 14, 2026

Official implementation of the NeurIPS 2025 paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"

Python 323 39 Updated Jan 26, 2026

This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.

310 7 Updated Mar 26, 2026
Python 313 45 Updated Dec 12, 2025

[TMLR 2026] Survey: https://arxiv.org/pdf/2507.20198

331 20 Updated Feb 22, 2026

Enhancing Automated Interpretability with Output-Centric Feature Descriptions

Jupyter Notebook 11 1 Updated Jun 12, 2025
Python 76 14 Updated May 23, 2024

Attribute-guided reinforcement learning framework for molecular property prediction with large language models.

Python 4 1 Updated Sep 25, 2025

Code for the ICML 2025 Paper "Product of Experts with LLMs: Boosting Performance on ARC is a Matter of Perspective"

Python 51 8 Updated Nov 9, 2025

This repository contains the official implementation of the paper **"Improving Rationality in the Reasoning Process of Language Models through Self-playing Game."**

Python 4 1 Updated May 27, 2025

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients. Published in Nature.

Python 3,456 282 Updated Jul 25, 2025
Python 219 37 Updated Jan 5, 2026

[ACL'25 Findings] LDIR (Low-Dimensional Dense Interpretable Text Embeddings with Relative Representations) is a novel text embedding method that balances semantic expressiveness, interpretability, …

Python 8 2 Updated Aug 4, 2025

[ACL'25 Findings] RankedVotingSC (Ranked Voting based Self-Consistency) is a method that generates ranked answers in each reasoning attempt and aggregates them using ranked voting across multiple r…

Python 11 Updated Aug 7, 2025

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,407 128 Updated Nov 9, 2025

Bootstrapping ARC

Python 156 25 Updated Nov 20, 2024

国家自然科学基金申请书正文(面上项目)LaTeX 模板(非官方)

BibTeX Style 1,033 260 Updated Jan 24, 2026

Our solution for the arc challenge 2024

Jupyter Notebook 189 33 Updated Jun 17, 2025

Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"

Python 345 32 Updated Nov 10, 2025

Implementation for ACL 2024 paper "Meta-Task Prompting Elicits Embeddings from Large Language Models"

Python 12 Updated Jul 25, 2024

[ECCV 2024 Best Paper Candidate & TPAMI 2025] PointLLM: Empowering Large Language Models to Understand Point Clouds

Python 988 55 Updated Mar 17, 2026

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 11,195 1,100 Updated Nov 18, 2024
Python 152 5 Updated Aug 23, 2023

Code for "Visual Spatial Description: Controlled Spatial-Oriented Image-to-Text Generation"

Python 26 3 Updated Mar 9, 2024

[ICCV 2025] Improving 3D Large Language Model via Robust Instruction Tuning

Python 70 2 Updated Oct 19, 2025
Python 23 Updated Apr 19, 2024
Next