-
Mila, Université de Montréal
- Montreal, QC, Canada
-
11:04
(UTC -04:00) - https://hiroki11x.github.io/
- @_hiroki11x
- in/hiroki11x
Highlights
Stars
A unified library of SOTA model optimization techniques like quantization, distillation, pruning, neural architecture search, speculative decoding, etc. It compresses deep learning models for downs…
Lean Formalization of Generalization Error Bound by Rademacher Complexity
Academic Research Skills for Claude Code: research → write → review → revise → finalize
AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning (ICLR 2023).
The simplest, fastest repository for training/finetuning small-sized VLMs.
slime is an LLM post-training framework for RL Scaling.
Very low latency speech to text, intent recognition, and text to speech, for building voice agents and interfaces
Turn any AI agent into an AI Scientist. The #1 Agent Skills library for science, used by 160,000+ scientists worldwide. 140 ready-to-use skills plus 100+ scientific databases covering biology, chem…
A curated list of awesome Claude Skills, resources, and tools for customizing Claude AI workflows
ImageNet-Sketch data set for evaluating model's ability in learning (out-of-domain) semantics at ImageNet scale
A collection of optimization problems in mathematics
A theory of optimal learning rate schedules in SGD from optimal control theory
Scalable Computing for Advanced Library and Environment
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
Pseudo-Asynchronous Local SGD: Robust and Efficient Data-Parallel Training (TMLR2025)
Implementatoin for paper: A Unified Stability Analysis of SAM vs SGD: Role of Data Coherence and Emergence of Simplicity Bias
CellViT: Vision Transformers for Precise Cell Segmentation and Classification
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…