Lists (1)
Sort Name ascending (A-Z)
Stars
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
A simple screen parsing tool towards pure vision based GUI agent
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use thβ¦
This repository contains implementations and illustrative code to accompany DeepMind publications
AirLLM 70B inference with single 4GB GPU
Code release for NeRF (Neural Radiance Fields)
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks
this repository accompanies the book "Grokking Deep Learning"
LLM-powered multiagent persona simulation for imagination enhancement and business insights.
Taming Transformers for High-Resolution Image Synthesis
FinRobot: An Open-Source AI Agent Platform for Financial Analysis using LLMs π π π
Notebooks using the Hugging Face libraries π€
[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models
[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
About Code release for "Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting" (NeurIPS 2021), https://arxiv.org/abs/2106.13008
Fast and Easy Infinite Neural Networks in Python
PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)
Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, Bβ¦
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Use `llama2-wrapper` as your local llama2 backend for Generative Agents/Apps.
Implementing the 4 agentic patterns from scratch
Implementation for <SphereFace: Deep Hypersphere Embedding for Face Recognition> in CVPR'17.
Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.
Fourier Features Let Networks Learn High Frequency Functions in Low Dimensional Domains
This repo contains the code for 1D tokenizer and generator
OLMoE: Open Mixture-of-Experts Language Models