Skip to content
View AvivBick's full-sized avatar

Highlights

  • Pro

Block or report AvivBick

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 73 3 Updated May 29, 2026
Python 7 1 Updated Mar 9, 2026

πŸš€ Efficient implementations for emerging model architectures

Python 5,217 556 Updated Jun 11, 2026

Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"

Python 15 3 Updated Apr 30, 2025

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Python 6,898 402 Updated Mar 27, 2026

Official repository for CMU Machine Learning Department's 10717: "The Art of the Paper".

289 11 Updated Apr 21, 2022

An awesome repository & A comprehensive survey on interpretability of LLM attention heads.

TeX 410 12 Updated Mar 2, 2025

πŸš€ Collection of components for development, training, tuning, and inference of foundation models leveraging PyTorch native components.

Python 224 107 Updated Jun 13, 2026

Chat Templates for πŸ€— HuggingFace Large Language Models

Jinja 719 67 Updated Dec 13, 2024

Official implementation of Phi-Mamba. A MOHAWK-distilled model (Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Models)

Python 125 15 Updated Sep 13, 2024

A playbook for systematically maximizing the performance of deep learning models.

30,190 2,423 Updated Jun 18, 2024

A quick guide (especially) for trending instruction finetuning datasets

3,393 238 Updated Nov 28, 2023

Reading list for research topics in state-space models

364 38 Updated May 18, 2026

Metric Learning (npair loss & angular loss) on mnist and Visualizing by t_SNE

Python 35 7 Updated Feb 15, 2023

A list of contrastive Learning papers

311 37 Updated Apr 13, 2022

PyTorch implementation for the ICLR 2020 paper "Understanding the Limitations of Variational Mutual Information Estimators"

Jupyter Notebook 76 9 Updated Feb 12, 2020