Selbst-gehostete LLM-Server und Chat-Oberflächen.
Run large language models locally
User-friendly WebUI for LLMs (Ollama, OpenAI API)
LLM inference in C/C++
AI observability & evaluation: LLM tracing, evaluation, and RAG troubleshooting