Skip to content

Someone please help me work /slot/action?=save and /slot/action?=restore #9781

Answered by ggerganov
dhandhalyabhavik asked this question in Q&A
Discussion options

You must be logged in to vote

The slot restore logic does not restore the text representation of the prompt. It restores only the KV cache state. So the /slots status reports a stale value for "prompt".

I've made a workaround in #9800

But note that even though the reported state is incorrect, the actual KV cache should have been restored correctly. So if you try to send a new query, it should reuse the cached tokens from the initial run.

Replies: 1 comment 3 replies

Comment options

You must be logged in to vote
3 replies
@ggerganov
Comment options

Answer selected by dhandhalyabhavik
@dhandhalyabhavik
Comment options

@ggerganov
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants