Add cancel() method to interrupt a stream by simonchatts · Pull Request #733 · abetlen/llama-cpp-python

simonchatts · 2023-09-18T10:17:42Z

Fixes #599.

Thanks for all your work on this project!

tk-master · 2023-10-21T20:19:52Z

please accept this pr @abetlen

tk-master · 2023-10-29T20:55:14Z

Actually.. I found an issue with this method.. this will only cancel after a token is generated but if the llm is slow or gets stuck processing the prompt, this doesn't cancel it..

We need a better method.

tk-master · 2023-11-13T12:37:41Z

I'm coming back to this because I need to figure out a better method to interrupt the generation programmatically..

For a console-based scenario it's pretty easy in python, all I have to do is surround the code with try except KeyboardInterrupt: .. then I can just press ctrl+c at any point to gracefully interrupt the llm..

But.. if I'm using a front-end user interface, I haven't managed to make it work properly let's say with a button "Stop generating" that can call a python function.. because of the issue I mentioned in the previous post..

@abetlen sorry to bother again but do you have any suggestions/ideas on how to accomplish this?

woheller69 · 2024-05-06T13:03:12Z

Why not add it now and improve if there is a better solution. For now this would work in most cases.

woheller69 · 2024-05-08T04:23:54Z

has anyone found a reasonable solution for this? Or am I the only one not willing to wait until the model finishes without killing the job and losing context?

jewser · 2024-05-11T05:09:04Z

Any chance this gets merged for now?

madprops · 2024-05-11T13:52:09Z

It indeed blocks until the first token is produced, but cancelling it after that is trivial. The other similar issue is cancelling a model that is loading.

woheller69 · 2024-05-11T13:58:05Z

gpt4all python bindings offer a similar way which allows stopping with the next token

ekcrisp · 2024-11-21T05:55:05Z

+1 can we merge this?

kingbri1 · 2024-11-26T21:00:02Z

Take a look at ggml-org/llama.cpp#10509 which should permanently solve this problem on lcpp's side

Add cancel() method to interrupt a stream

3afc7cf

abetlen force-pushed the main branch 2 times, most recently from 8c93cf8 to cc0fe43 Compare November 14, 2023 20:24

spidersouris mentioned this pull request Nov 5, 2024

Generation Overlaps TC-Zheng/ActuosusAI#21

Closed

kingbri1 mentioned this pull request Nov 26, 2024

Feature Request: Ability to cancel during prompt processing (llama_decode) ggml-org/llama.cpp#10509

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add cancel() method to interrupt a stream#733

Add cancel() method to interrupt a stream#733
simonchatts wants to merge 1 commit intoabetlen:mainfrom
simonchatts:main

simonchatts commented Sep 18, 2023

Uh oh!

tk-master commented Oct 21, 2023

Uh oh!

tk-master commented Oct 29, 2023

Uh oh!

tk-master commented Nov 13, 2023

Uh oh!

woheller69 commented May 6, 2024

Uh oh!

woheller69 commented May 8, 2024

Uh oh!

jewser commented May 11, 2024

Uh oh!

madprops commented May 11, 2024

Uh oh!

woheller69 commented May 11, 2024

Uh oh!

ekcrisp commented Nov 21, 2024

Uh oh!

kingbri1 commented Nov 26, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Conversation

simonchatts commented Sep 18, 2023

Uh oh!

tk-master commented Oct 21, 2023

Uh oh!

tk-master commented Oct 29, 2023

Uh oh!

tk-master commented Nov 13, 2023

Uh oh!

woheller69 commented May 6, 2024

Uh oh!

woheller69 commented May 8, 2024

Uh oh!

jewser commented May 11, 2024

Uh oh!

madprops commented May 11, 2024

Uh oh!

woheller69 commented May 11, 2024

Uh oh!

ekcrisp commented Nov 21, 2024

Uh oh!

kingbri1 commented Nov 26, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants