What determines the format of the prompt template?

ffhein@lemmy.world · 1 year ago

What determines the format of the prompt template?

rufus@discuss.tchncs.de · edit-2 1 year ago

You’re right. It’s solely based on how the training data was formatted.

I’m pretty sure this is an error in TheBloke’s description.

(Oobabooga’s webui also includes those tags: https://github.com/oobabooga/text-generation-webui/blob/main/characters/instruction-following/Llama-v2.yaml )

ffhein@lemmy.world · 1 year ago

Thanks! I’m going to do some experiments and see if I get different results. I’ve been using TheBloke’s format and it worked mostly well, but perhaps switching to meta-llama’s format will eliminate the occasional bugs I’ve had.

rufus@discuss.tchncs.de · 1 year ago

That’s probably the most reasonable thing you can do.

I’m not sure how much of a difference we expect from 100% the correct prompt compared to something roughly in that direction. I’ve been tinkering around with instruction style tuned models (from the previous/first llama) and sometimes it doesn’t seem to matter. I also sometimes used a ‘wrong’ prompt for days and couldn’t tell. Maybe the models are ‘intelligent’ enough to compensate for that. I’m not sure. I usually try to get it right to get all the performance out of it.

h3ndrik@feddit.de · 1 year ago

https://huggingface.co/TheBloke/Llama-2-13B-chat-GGML/discussions/7