Skip to content
@LLM-Dev-BB

LLM-Dev-BB

Popular repositories Loading

  1. GPU-Benchmarks-on-LLM-Inference GPU-Benchmarks-on-LLM-Inference Public

    Forked from XiongjieDai/GPU-Benchmarks-on-LLM-Inference

    Multiple NVIDIA GPUs or Apple Silicon for Large Language Model Inference?

    Jupyter Notebook 1

  2. llama.cpp llama.cpp Public

    Forked from ggml-org/llama.cpp

    LLM inference in C/C++

    C++

  3. ollama ollama Public

    Forked from ollama/ollama

    Get up and running with Llama 3, Mistral, Gemma 2, and other large language models.

    Go

  4. llm-awq llm-awq Public

    Forked from mit-han-lab/llm-awq

    [MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

    Python

  5. netron netron Public

    Forked from lutzroeder/netron

    Visualizer for neural network, deep learning and machine learning models

    JavaScript

  6. pytorch pytorch Public

    Forked from pytorch/pytorch

    Tensors and Dynamic neural networks in Python with strong GPU acceleration

    Python

Repositories

Showing 10 of 34 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…