changes

abidlabs · abidlabs · commit 6beaa2544ad5 · 2026-03-07T12:21:39.000-08:00
diff --git a/autonomous-experiments/01_finding_the_best_learning_rate/PROMPT.md b/autonomous-experiments/01_finding_the_best_learning_rate/PROMPT.md
@@ -1,3 +1,11 @@
 Run a series of experiments sequentially as an autonomous machine learning researcher. Start with learning rates of 1, then 0.5, then 0.1, and so on. The idea is to find the largest learning rate that doesn't lead to wild oscillations in validation loss. So keep watching the Trackio Alerts. If you see instability, then just terminate the job and lower the learning rate, and keep going until you have stable training.
 
-Run the train_nanogpt.py script using Hugging Face Jobs, using my locally logged in Hugging Face token.
+Run the train_nanogpt.py script using Hugging Face Jobs, using my locally logged in Hugging Face token, like this:
+
+hf jobs uv run \
+    --flavor a100-large \
+    --timeout 10m \
+    --secrets HF_TOKEN \
+    --with torch \
+    --with numpy \
+    train_nanogpt.py
diff --git a/autonomous-experiments/01_finding_the_best_learning_rate/train_nanogpt.py b/autonomous-experiments/01_finding_the_best_learning_rate/train_nanogpt.py
@@ -22,11 +22,15 @@
 Data: FineWeb (pre-tokenized with GPT-2 tokenizer, auto-downloaded from HF Hub).
       Downloads ~1.8GB for 9 training shards (~900M tokens) + validation.
 
-Examples:
-  python train_nanogpt.py                              # default: Muon + compile
-  python train_nanogpt.py --optimizer adamw             # compare with AdamW
-  python train_nanogpt.py --max_steps 10000             # train longer
-  python train_nanogpt.py --batch_size 32 --no_compile  # debug without compile
+Run with Hugging Face Jobs like this:
+
+hf jobs uv run \
+    --flavor a100-large \
+    --timeout 10m \
+    --secrets HF_TOKEN \
+    --with torch \
+    --with numpy \
+    train_nanogpt.py
 """
 
 import glob