We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
qwen3_32b.yaml
1 parent c30ddf5 commit f6e7176Copy full SHA for f6e7176
apps/grpo/qwen3_32b.yaml
@@ -1,5 +1,5 @@
1
# Grouped Relative Policy Optimization (GRPO)
2
-# >>> python -m apps.grpo.main --config apps/grpo/qwen32b.yaml
+# >>> python -m apps.grpo.main --config apps/grpo/qwen3_32b.yaml
3
# NOTE - This has not been tested for correctness yet! All testing so far has been only for infrastructure stability
4
5
# Global configuration
0 commit comments