Skip to content
View linkage001's full-sized avatar

Block or report linkage001

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models

Python 268 31 Updated Apr 23, 2024

Fast parallel LLM inference for MLX

Jupyter Notebook 1 Updated Aug 18, 2024

huggingface chat-ui integration with mlx-lm server

Shell 62 3 Updated Feb 13, 2024

Fast parallel LLM inference for MLX

Jupyter Notebook 249 21 Updated Jul 7, 2024

On-the-fly Goliath-style transformer franken-merges with a tiny memory footprint

Python 2 2 Updated Dec 29, 2024

On-device AI across mobile, embedded and edge for PyTorch

Python 4,552 962 Updated Apr 28, 2026

This is the Personality Core for GLaDOS, the first steps towards a real-life implementation of the AI from the Portal series by Valve.

Python 5,521 428 Updated Apr 9, 2026

SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

Python 19,083 2,060 Updated Apr 27, 2026

A fast inference library for running LLMs locally on modern consumer-class GPUs

Python 4,508 330 Updated Mar 4, 2026

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook 8,476 797 Updated Mar 15, 2025