ducklm/docs/web_api.md

810 B

Web API

Endpoints:

GET  /health
GET  /v1/status
GET  /v1/models/roles
GET  /v1/models/ping
POST /v1/chat
POST /v1/tasks
GET  /v1/tasks
GET  /v1/tasks/{task_id}
GET  /v1/tasks/{task_id}/events
GET  /v1/tasks/{task_id}/stream
GET  /v1/approvals/pending
POST /v1/approvals/{approval_id}/allow_once
POST /v1/approvals/{approval_id}/allow_forever
POST /v1/approvals/{approval_id}/deny
GET  /v1/skills
GET  /v1/skills/{skill_id}
GET  /v1/experience
GET  /v1/experience/{id}
GET  /v1/memory/search?q=...

Chat requests accept optional reasoning:

{
  "message": "hello",
  "reasoning": "auto"
}

Allowed values:

  • auto: use the llama-server default.
  • on: pass enable_thinking=true for the thinker response.
  • off: pass enable_thinking=false and thinking_budget_tokens=0.