llama2

Here are 12 public repositories matching this topic...

b4rtaz / distributed-llama

Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and increase inference speed.

neural-network distributed-computing llm llms open-llm llm-inference llama2 distributed-llm llama3

Updated Oct 14, 2024
C++

inferflow / inferflow

Star

Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).

bloom falcon moe gemma mistral mixture-of-experts model-quantization multi-gpu-inference m2m100 llamacpp llm-inference internlm llama2 qwen baichuan2 mixtral phi-2 deepseek minicpm

Updated Mar 15, 2024
C++

guinmoon / llmfarm_core.swift

Star

Swift library to work with llama and other large language models.

swift ai falcon llama gpt-2 rwkv gptneox starcoder llama2

Updated Oct 30, 2024
C++

zjhellofss / KuiperLLama

Star

校招、秋招、春招、实习好项目，带你从零动手实现支持LLama2/3和Qwen2.5的大模型推理框架。

cpp cuda inference-engine llm llm-inference llama2 qwen qwen2 llama3

Updated Oct 31, 2024
C++

trzy / llava-cpp-server

Star

LLaVA server (llama.cpp).

llama multimodal vision-transformer llm llava llama2

Updated Oct 20, 2023
C++

CoderLSF / fast-llama

Star

Runs LLaMA with Extremely HIGH speed

llama inference-engine cpu-inference llama2

Updated Nov 21, 2023
C++

AXERA-TECH / ax-llm

Star

Explore LLM model deployment based on AXera's AI chips

transformer edge-computing huggingface llm llama2 qwen tinyllama axear

Updated Oct 25, 2024
C++

icppWorld / icpp_llm

Star

on-chain LLMs

internetcomputer llm llama2

Updated Jul 16, 2024
C++

P1ayer-1 / Llama-LibTorch

Star

Llama causal LM fully recreated in LibTorch. Designed to be used in Unreal Engine 5

machine-learning pytorch libtorch llamacpp llama2

Updated Sep 19, 2024
C++

mybigday / llama.node

Star

Node.js binding of Llama.cpp

nodejs node-js llama llamacpp llama-cpp llama2

Updated Oct 28, 2024
C++

niansa / libjustlm

Star

Super easy to use library for doing LLaMA/GPT-J stuff! - Mirror of: https://gitlab.com/niansa/libjustlm

python ai mpt cpp17 llama wrapper-library cpp20 gpt-j llm llm-inference llama2

Updated Mar 25, 2024
C++

niansa / discord_llama

Star

Multi-Model and multi-tasking llama Discord Bot - Mirror of: https://gitlab.com/niansa/discord_llama

ai discord-bot llama cpp20 llm llamacpp llm-inference llama2

Updated Mar 27, 2024
C++

Improve this page

Add a description, image, and links to the llama2 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llama2 topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama2

Here are 12 public repositories matching this topic...

b4rtaz / distributed-llama

inferflow / inferflow

guinmoon / llmfarm_core.swift

zjhellofss / KuiperLLama

trzy / llava-cpp-server

CoderLSF / fast-llama

AXERA-TECH / ax-llm

icppWorld / icpp_llm

P1ayer-1 / Llama-LibTorch

mybigday / llama.node

niansa / libjustlm

niansa / discord_llama

Improve this page

Add this topic to your repo