llama.android: add field formatChat to control whether to parse special tokens when send message #11270

codezjx · 2025-01-17T03:26:48Z

Fix issue #11264
If the user uses the chat template to send a message, the message needs to be tokenized with parse_special = true in common_tokenize() method. Add field formatChat to control whether to parse special tokens when send message in LLamaAndroid.send(). The app can set formatChat according to their actual situation.

Sample Code:

val msg = "<|im_start|>system\n" +
    "You are a helpful AI assistant<|im_end|>\n" +
    "<|im_start|>user\n" +
    "$text<|im_end|>\n" +
    "<|im_start|>assistant\n"
viewModelScope.launch {
    // Explicitly set formatChat = true
    llamaAndroid.send(msg, true)
        .catch {
            Log.e(tag, "send() failed", it)
            messages += it.message!!
        }
        .collect { messages = messages.dropLast(1) + (messages.last() + it) }
}

Before

After

…al tokens when send message

ggerganov

For this basic example this is OK, but keep in mind that applying parse_special = true to the full context is susceptible to special token injection from the user. ~~The correct way is to use llama_chat_apply_template.~~

codezjx · 2025-01-17T12:55:26Z

@ggerganov Thanks for your reminder!

ggerganov · 2025-01-17T12:59:08Z

Yup, I actually initially incorrectly thought that we handle this special token injection in the existing chat examples, but we don't. It's something to fix in the future.

codezjx · 2025-01-17T16:33:50Z

@ggerganov Is the special token injection you just mentioned similar to the following behavior? Tested llama-cli in conversation mode and the user's input special token <|endoftext|> is encoded as 0. It seems that the user's input content needs to be tokenized separately (applying parse_special = false).🤔

waiting for user input

> special token <|endoftext|> injection test
buffer: 'special token <|endoftext|> injection test
'
formatted: '
<|im_start|>user
special token <|endoftext|> injection test
<|im_end|>
<|im_start|>assistant
'
input tokens: [ '':198, '<|im_start|>':1, 'user':4093, '':198, 'special':19159, ' token':9624, ' ':216, '<|endoftext|>':0, ' injection':12852, ' test':1028, '':198, '<|im_end|>':2, '':198, '<|im_start|>':1, 'ass':520, 'istant':9531, '':198 ]

…al tokens when send message (ggml-org#11270)

llama.android: add field formatChat to control whether to parse speci…

f1b07d6

…al tokens when send message

github-actions bot added android Issues specific to Android examples labels Jan 17, 2025

codezjx mentioned this pull request Jan 17, 2025

Misc. bug: [llama.android] Model keeps replying and cannot be stopped normally until it exceeds the context #11264

Closed

ggerganov approved these changes Jan 17, 2025

View reviewed changes

ggerganov merged commit 3edfa7d into ggml-org:master Jan 17, 2025
1 check passed

tinglou pushed a commit to tinglou/llama.cpp that referenced this pull request Feb 13, 2025

llama.android: add field formatChat to control whether to parse speci…

c90e01f

…al tokens when send message (ggml-org#11270)

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Feb 26, 2025

llama.android: add field formatChat to control whether to parse speci…

41acd80

…al tokens when send message (ggml-org#11270)

mglambda pushed a commit to mglambda/llama.cpp that referenced this pull request Mar 8, 2025

llama.android: add field formatChat to control whether to parse speci…

8638c00

…al tokens when send message (ggml-org#11270)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

llama.android: add field formatChat to control whether to parse special tokens when send message #11270

llama.android: add field formatChat to control whether to parse special tokens when send message #11270

Uh oh!

codezjx commented Jan 17, 2025 •

edited

Loading

Uh oh!

ggerganov left a comment •

edited

Loading

Uh oh!

codezjx commented Jan 17, 2025

Uh oh!

Uh oh!

ggerganov commented Jan 17, 2025

Uh oh!

codezjx commented Jan 17, 2025

Uh oh!

Uh oh!

llama.android: add field formatChat to control whether to parse special tokens when send message #11270

llama.android: add field formatChat to control whether to parse special tokens when send message #11270

Uh oh!

Conversation

codezjx commented Jan 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Before

After

Uh oh!

ggerganov left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

codezjx commented Jan 17, 2025

Uh oh!

Uh oh!

ggerganov commented Jan 17, 2025

Uh oh!

codezjx commented Jan 17, 2025

Uh oh!

Uh oh!

codezjx commented Jan 17, 2025 •

edited

Loading

ggerganov left a comment •

edited

Loading