# Web API Endpoints: ```text GET /health GET /v1/status GET /v1/models/roles GET /v1/models/ping POST /v1/chat POST /v1/tasks GET /v1/tasks GET /v1/tasks/{task_id} GET /v1/tasks/{task_id}/events GET /v1/tasks/{task_id}/stream GET /v1/approvals/pending POST /v1/approvals/{approval_id}/allow_once POST /v1/approvals/{approval_id}/allow_forever POST /v1/approvals/{approval_id}/deny GET /v1/skills GET /v1/skills/{skill_id} GET /v1/experience GET /v1/experience/{id} GET /v1/memory/search?q=... ``` Chat requests accept optional `reasoning`: ```json { "message": "hello", "reasoning": "auto" } ``` Allowed values: - `auto`: use the `llama-server` default. - `on`: pass `enable_thinking=true` for the thinker response. - `off`: pass `enable_thinking=false` and `thinking_budget_tokens=0`.