Commit 8e8b163
committed
feat: send full conversation history with each request so the model has context
Previously every send was a single-turn POST: the agent only saw the
new user message, so it couldn't remember the rest of the chat.
Now the frontend bundles the active conversation's full message history
(filtered to user+assistant turns, dropping the thinking placeholder)
into a 'messages' array on the request payload. The handler receives
this and forwards it to vLLM as a multi-turn chat — system prompt,
prior turns, then current user message (with any attached image parts
bound to that last user message).
Backend already supports this in v1.1.0 via the new schema (either
'prompt' or 'messages' is accepted).1 parent 555bfc9 commit 8e8b163
1 file changed
Lines changed: 4 additions & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
300 | 300 | | |
301 | 301 | | |
302 | 302 | | |
303 | | - | |
| 303 | + | |
| 304 | + | |
| 305 | + | |
| 306 | + | |
304 | 307 | | |
305 | 308 | | |
306 | 309 | | |
| |||
0 commit comments