(prev. "Candle Chat")
An LLM interface implemented in pure Rust using HuggingFace/Candle over Axum Websockets, an SQLite Database, and a Leptos (Wasm) frontend packaged with Tauri!
This project is designed for single and multi-user chat with many Large Language Models (LLMs).
- Local or Remote Inference Backend
- Local or Remote SQLite Database
You can configure your model and default inference settings by putting files in your Config Directory
.
This is automatically configured when you choose a model in the frontend, but you can manually add models if you like.
Example:
# config_model.yaml
repo_id: DanielClough/Candle_Puffin-Phi-v2
q_lvl: q2k
revision: main
tokenizer_file: null
weight_file: null
quantized: true
cpu: false
use_flash_attn: false
template: ShareGPT
# config_inference.yaml
temperature:
top_p:
seed: 299792458
sample_len: 150
repeat_penalty: 1.3
repeat_last_n: 150
load_context: false
role:
If load_context: true
then you can add (small) in <Config Directory>/fireside-chat/context/
.
Large files may cause Out Of Memory errors.
Config Directory
is $HOME/.config/fireside-chat
Config Directory
is $HOME/.cache/hugging-face
You can compile with environment variable the FIRESIDE_BACKEND_URL
, and FIRESIDE_DATABASE_URL
to call a server other than localhost
.
This can be configured in tauri.conf.json
, or in your system environment.
# eg. for Linux
export FIRESIDE_BACKEND_URL=192.168.1.6 && trunk serve
- I am not testing in Windows environments.