Tsukaima is a tool to call local large language models (LLMs) using the existing OpenAI ChatGPT clients. Currently, the following models are supported.
- rinna LLM
- line-corporation/japanese-large-lm-3.6b-instruction-sft
- elyza/ELYZA-japanese-Llama-2-7b-instruct
$ python3 -m venv myvenv
$ source myvenv/bin/activate
(myvenv) $ pip install -U git+https://github.com/shirayu/tsukaima.git
(myvenv) $ wget https://raw.githubusercontent.com/shirayu/tsukaima/main/examples_config/rinna.json -O rinna.json
(myvenv) $ tsukaima --host 0.0.0.0 --port 6006 --config ./rinna.jsonSet API endpoint to the address (Eg: http://0.0.0.0:6006/v1/chat/completions) to use ChatGPT clients such as BetterChatGPT.
Check other config examples.
- Messages whose
roleissystemwill be ignored
Please read tsukaima.schema.schema
- You may need install NCCL
sudo apt install libnccl-dev libnccl2