-
Alcon
- Los Angeles, CA
- http://oliverwu.georgetown.domains/
Highlights
Stars
[CVPR 2025] UniK3D: Universal Camera Monocular 3D Estimation
An agentic skills framework & software development methodology that works.
Differentiable wave optics simulation library built on PyTorch
Differentiable optical lens simulator for end-to-end computational imaging.
A deep learning package for many-body potential energy representation and molecular dynamics
Curated collection of AI prompts for MATLAB development - enhance your workflow with MATLAB Copilot, GitHub Copilot, Claude, Cursor, and other AI coding assistants. Includes prompts for Live Script…
A collection of DESIGN.md files analysis by popular brand design systems. Drop one into your project and let coding agents generate a matching UI.
Codes of paper "Unseen Object Amodal Instance Segmentation via Hierarchical Occlusion Modeling", ICRA 2022
[CVPR 2026 (Highlight)] 4D-RGPT: Toward Region-level 4D Understanding via Perceptual Distillation
Comprehensive optical design, optimization, and analysis in Python, including GPU-accelerated and differentiable ray tracing via PyTorch.
Community-contributed instructions, agents, skills, and configurations to help you make the most of GitHub Copilot.
RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO, designed for fine-tuning. [ICLR 2026]
This is the open source repository for our IEEE Transactions on Computational Imaging 2022 paper "dO: A differentiable engine for Deep Lens design of computational imaging systems".
AI agents running research on single-GPU nanochat training automatically
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…
This course is designed to guide beginners through the exciting world of Edge AI, covering fundamental concepts, popular models, inference techniques, device-specific applications, model optimizati…
Animated sprite editor & pixel art tool (Windows, macOS, Linux)
DigitalPlat FreeDomain: Free Domain For Everyone
Official Code for "Large-scale Self-supervised Video Foundation Model for Intelligent Surgery"
The first competitive instance segmentation approach that runs on small edge devices at real-time speeds.
SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models
Get started with building Fullstack Agents using Gemini 2.5 and LangGraph
Witness the aha moment of VLM with less than $3.
Training VLM agents with multi-turn reinforcement learning