-
Amazon
- Seattle
- shownx.github.io
- https://orcid.org/0000-0003-3814-7203
Lists (5)
Sort Name ascending (A-Z)
Starred repositories
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
MiroMind Research Agent: Fully Open-Source Deep Research Agent with Reproducible State-of-the-Art Performance on FutureX, GAIA, HLE, BrowserComp and xBench.
A server-side CKKS GPU library fully interoperable with OpenFHE.
The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"
This repository contains LLM (Large language model) interview question asked in top companies like Google, Nvidia , Meta , Microsoft & fortune 500 companies.
🟣 LLMs interview questions and answers to help you prepare for your next machine learning and data science interview in 2025.
verl: Volcano Engine Reinforcement Learning for LLMs
On the Theoretical Limitations of Embedding-Based Retrieval
Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.
Reference PyTorch implementation and models for DINOv3
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
Multi-Agent System Powered by LLMs for End-to-end Multimodal ML Automation
Renderer for the harmony response format to be used with gpt-oss
A platform for community discussion. Free, open, simple.
TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.
Chronos: Pretrained Models for Probabilistic Time Series Forecasting
[NeurIPS'24] This repository is the implementation of "SpatialRGPT: Grounded Spatial Reasoning in Vision Language Models"
Fast and memory-efficient exact attention
Pre-trained, Scalable, High-performance Reward Models via Policy Discriminative Learning.
FSFM: A Generalizable Face Security Foundation Model via Self-Supervised Facial Representation Learning (CVPR25)
[NeurIPS 2025 Spotlight] Q-Insight: Understanding Image Quality via Visual Reinforcement Learning
[ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation
[CVPR'25] Do Your Best and Get Enough Rest for Continual Learning
Rapid fuzzy string matching in Python using various string metrics
[WACV 2025] Official implementation of "Online-LoRA: Task-free Online Continual Learning via Low Rank Adaptation" by Xiwen Wei, Guihong Li and Radu Marculescu
[ACL'25 Main] Official Implementation of HiDe-LLaVA: Hierarchical Decoupling for Continual Instruction Tuning of Multimodal Large Language Model
Implementation for NeurIPS 2024 paper "SAFE: Slow and Fast Parameter-Efficient Tuning for Continual Learning with Pre-Trained Models" (https://arxiv.org/abs/2411.02175)