Starred repositories
Real-time decision features without streaming infra. Turn live events into product reflexes — no Kafka, no Flink, no feature store.
Crawl accepted papers and citation data from ML/DL/NLP/CV/Robotics/Security/SE conferences
Code and data for study: "Understanding Software Engineering Agents: A Study of Thought-Action-Result Trajectories"
[EMNLP'25] Understanding the Thinking Process of Reasoning Models: A Perspective from Schoenfeld's Episode Theory
LILO: Library Induction with Language Observations
Frankentext: Stitching random text fragments into long-form narratives [ACL '26]
[TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
Code accompanying the paper "Massive Activations in Large Language Models"
🌎💪 BrowserGym, a Gym environment for web task automation
Awesome GUI Agent Paper List
[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Code for paper Empowering Large Language Model Agents through Action Learning
A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.
[Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications
Statsmodels: statistical modeling and econometrics in Python
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
[BEA @ ACL 2023] General-purpose tool for linguistic features extraction; Tested on readability assessment, essay scoring, fake news detection, hate speech detection, etc.
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Lean 3 material for Kevin Buzzard's 2021 TCC courrse on formalising mathematics. Lean 4 version available here: https://github.com/ImperialCollegeLondon/formalising-mathematics-2024
Companion webpage for the book "Bayesian Optimization" by Roman Garnett