Skip to content
Change the repository type filter

All

    Repositories list

    • [NeurIPS'24 LanGame workshop] On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability
      Python
      14200Updated Apr 10, 2026Apr 10, 2026
    • TTC-Net

      Public
      Beyond Test-Time Training: Learning to Reason via Hardware-Efficient Optimal Control
      Python
      MIT License
      11310Updated Mar 11, 2026Mar 11, 2026
    • [ICLR'26] "Nabla-Reasoner: LLM Reasoning via Test-Time Gradient Descent in Latent Space" by Peihao Wang*, Ruisi Cai*, Zhen Wang, Hongyuan Mei, Qiang Liu, Pan Li…
      Python
      MIT License
      02500Updated Mar 10, 2026Mar 10, 2026
    • VLM-3R

      Public
      [CVPR 2026] VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction
      Python
      Other
      2536990Updated Mar 9, 2026Mar 9, 2026
    • [AAAI'26 Oral] Oscillation Inversion: Training-free Image and Video Enhancement through Oscillated Latents in Large Flow Models
      MIT License
      0110Updated Nov 16, 2025Nov 16, 2025
    • LoX

      Public
      [COLM 2025] LoX: Low-Rank Extrapolation Robustifies LLM Safety Against Fine-tuning
      Python
      MIT License
      0610Updated Nov 13, 2025Nov 13, 2025
    • WeLore

      Public
      [ICML 2025] From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories and Applications
      Python
      15210Updated Oct 30, 2025Oct 30, 2025
    • Neon

      Public
      [ICLR 2026 Oral] Neon: Negative Extrapolation From Self-Training Improves Image Generation
      Python
      MIT License
      22200Updated Oct 7, 2025Oct 7, 2025
    • WFM-TTS

      Public
      [COLM 2025] Can Test-Time Scaling Improve World Foundation Model?
      Jupyter Notebook
      0700Updated Aug 13, 2025Aug 13, 2025
    • Jupyter Notebook
      3400Updated Jul 2, 2025Jul 2, 2025
    • ViHGNN

      Public
      [ICCV2023] "Vision HGNN: An Image is More than a Graph of Nodes" by Yan Han, Peihao Wang, Souvik Kundu, Ying Ding, and Zhangyang Wang
      Python
      MIT License
      126140Updated Jun 13, 2025Jun 13, 2025
    • TAPE

      Public
      [ICML'25] "Rethinking Addressing in Language Models via Contextualized Equivariant Positional Encoding" by Jiajun Zhu, Peihao Wang, Ruisi Cai, Jason D. Lee, Pan…
      Python
      MIT License
      21510Updated Jun 6, 2025Jun 6, 2025
    • E3D-Bench

      Public
      Official implementation of "E3D-Bench: A Benchmark for End-to-End 3D Geometric Foundation Models"
      110320Updated Jun 4, 2025Jun 4, 2025
    • [ICLR 2024] Principled Architecture-aware Scaling of Hyperparameters
      Python
      MIT License
      0600Updated Jun 1, 2025Jun 1, 2025
    • SteepGS

      Public
      [CVPR 2025] Steepest Descent Density Control for Compact 3D Gaussian Splatting
      JavaScript
      0500Updated May 15, 2025May 15, 2025
    • R-Sparse

      Public
      [ICLR'25] R-Sparse: Rank-Aware Activation Sparsity for Efficient LLM Inference
      Python
      51920Updated Apr 28, 2025Apr 28, 2025
    • [ICML 2024] Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity; Lu Yin*, Ajay Jaiswal*, Shiwei Liu, Souvik Kundu, Zhangyang W…
      Python
      21600Updated Apr 21, 2025Apr 21, 2025
    • llm-kick

      Public
      [ICLR 2024] Jaiswal, A., Gan, Z., Du, X., Zhang, B., Wang, Z., & Yang, Y. Compressing llms: The truth is rarely pure and never simple.
      Python
      72710Updated Apr 21, 2025Apr 21, 2025
    • SEAL

      Public
      [COLM 2025] SEAL: Steerable Reasoning Calibration of Large Language Models for Free
      Python
      45630Updated Apr 6, 2025Apr 6, 2025
    • [ICLR'25] "Understanding Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing" by Peihao Wang, Ruisi Cai, Yuehao Wang, Jiajun Zhu, P…
      Python
      MIT License
      11700Updated Mar 21, 2025Mar 21, 2025
    • [SatML 2024] Shake to Leak: Fine-tuning Diffusion Models Can Amplify the Generative Privacy Risk
      Python
      MIT License
      31520Updated Mar 15, 2025Mar 15, 2025
    • [NAACL 2025] Extracting and Understanding the Superficial Knowledge in Alignment, Runjin Chen, Gabriel Jacob Perin, Xuxi Chen, Xilun Chen, Yan Han, Nina S. T. …
      Python
      0500Updated Jan 25, 2025Jan 25, 2025
    • [3DV 2026] VideoLifter: Lifting Videos to 3D with Fast Hierarchical Stereo Alignment
      Python
      Apache License 2.0
      414031Updated Jan 21, 2025Jan 21, 2025
    • [NeurIPS 2024] Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models
      Python
      834260Updated Jan 21, 2025Jan 21, 2025
    • [NeurIPS 2024 Spotlight]"LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS", Zhiwen Fan, Kevin Wang, Kairun Wen, Zehao Zhu, Dejia…
      Python
      Other
      74790171Updated Dec 30, 2024Dec 30, 2024
    • READ-ME

      Public
      [NeurIPS2024] "Read-ME: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design", Ruisi Cai, Yeonju Ro, Geon-Woo Kim, Peihao Wang, Babak…
      Python
      21500Updated Dec 16, 2024Dec 16, 2024
    • [Preprint] "Take the Bull by the Horns: Hard Sample-Reweighted Continual Training Improves LLM Generalization" by Xuxi Chen, Zhendong Wang, Daouda Sow, Junjie Y…
      Python
      2410Updated Dec 2, 2024Dec 2, 2024
    • [IROS 2024] MM3DGS SLAM: Multi-modal 3D Gaussian Splatting for SLAM Using Vision, Depth, and Inertial Measurements
      Python
      Other
      2020470Updated Oct 16, 2024Oct 16, 2024
    • VitaGPT: A tailored AI assistant in honor of Teacher's Day 2024 created by all vita members
      Python
      0100Updated Sep 10, 2024Sep 10, 2024
    • LoCoCo

      Public
      [ICML‘2024] "LoCoCo: Dropping In Convolutions for Long Context Compression", Ruisi Cai, Yuandong Tian, Zhangyang Wang, Beidi Chen
      Python
      01730Updated Sep 7, 2024Sep 7, 2024
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.