Self-hosted large language model servers and chat interfaces.
Run large language models locally
User-friendly WebUI for LLMs (Ollama, OpenAI API)
LLM inference in C/C++