Skip to content
View BestSonny's full-sized avatar
🤡
Focusing
🤡
Focusing

Highlights

  • Pro

Block or report BestSonny

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"

Python 444 52 Updated Jan 26, 2026

[CVPR 2025] Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Attention Lens

Python 78 10 Updated Oct 9, 2025

Hierarchal Agent Loop Optimizer

Python 596 53 Updated May 15, 2026

Data recipes and robust infrastructure for training AI agents

Python 148 25 Updated May 17, 2026

[ICLR'26] "Nabla-Reasoner: LLM Reasoning via Test-Time Gradient Descent in Latent Space" by Peihao Wang*, Ruisi Cai*, Zhen Wang, Hongyuan Mei, Qiang Liu, Pan Li, Zhangyang Wang

Python 31 Updated Mar 10, 2026

Python library to train neural networks with a strong focus on hydrological applications.

Python 540 264 Updated Apr 7, 2026

A global community dataset for large-sample hydrology

Jupyter Notebook 256 51 Updated Jul 24, 2025

Measuring how well CLI agents like Claude Code or Codex CLI can post-train base LLMs on a single H100 GPU in 10 hours

Python 326 36 Updated May 15, 2026

ALMA (Automated meta-Learning of Memory designs for Agentic systems) is a framework that meta-learns memory designs to replace human-engineered designs for agentic system.

Python 209 23 Updated Apr 8, 2026

MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entirely different from those used for meta-training.

Python 71 14 Updated Jun 5, 2020

Awesome List for On-Policy Distillation

338 4 Updated May 16, 2026

Evolutionary Scale Modeling (esm): Pretrained language models for proteins

Python 4,079 791 Updated Feb 7, 2024
Python 63 12 Updated Jun 11, 2025

BioDiscoveryAgent is an LLM-based AI agent for closed-loop design of genetic perturbation experiments

Python 106 23 Updated Jul 6, 2025

🖥 Neural Computers' Data Engine

Python 194 26 Updated Apr 13, 2026
Python 3 Updated Apr 18, 2026

A unified multimodal model toolkit

Python 113 8 Updated May 14, 2026

Hybrid ML + physics model of the Earth's atmosphere

Python 974 126 Updated May 16, 2026

TorchCFM: a Conditional Flow Matching library

Python 2,463 212 Updated Apr 20, 2026

A paper list for Time series modelling, including prediciton and anomaly detection

95 21 Updated Mar 13, 2020

A Super AI Lab with massive AI Doctors as Assistants. Best IDE for Research via AI Power.

JavaScript 950 97 Updated May 6, 2026

Verlog: A Multi-turn RL framework for LLM agents

Python 74 7 Updated Apr 28, 2026

[ICML 2026] Official Code for Rectified LpJEPA: Joint-Embedding Predictive Architectures with Sparse and Maximum-Entropy Representations

Python 75 9 Updated Feb 15, 2026
Python 1,128 97 Updated Jan 25, 2026

Explain Before You Answer: A Survey on Compositional Visual Reasoning

309 34 Updated Oct 17, 2025
Next