Skip to content

Commit b628308

Browse files
felipemello1Felipe Mello
andauthored
easy - [fix] update prompt (#654)
Co-authored-by: Felipe Mello <felipemello@fb.com>
1 parent 22d446f commit b628308

File tree

1 file changed

+8
-4
lines changed

1 file changed

+8
-4
lines changed

apps/grpo/data.py

Lines changed: 8 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -30,10 +30,14 @@ async def setup(self):
3030
self._epoch = 0
3131

3232
def gsm8k_transform(sample):
33-
system_prompt = """
34-
Put all your scratchpad work between <think> and </think> tags.
35-
Your final answer should be between <answer> and </answer> tags otherwise it will not be scored.
36-
"""
33+
system_prompt = (
34+
"A conversation between User and Assistant. The user asks a question, "
35+
"and the Assistant solves it. The assistant first thinks about the reasoning "
36+
"process and then provides the user with the answer. The reasoning "
37+
"process and answer are enclosed within <think></think> and <answer></answer> "
38+
"tags, respectively, i.e., <think>reasoning process here</think> "
39+
"<answer>answer here</answer>."
40+
)
3741
request: str = sample["question"]
3842
as_chat = [
3943
{"role": "system", "content": system_prompt},

0 commit comments

Comments
 (0)