You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+10-6Lines changed: 10 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -143,6 +143,7 @@ scripts/download_data.sh
143
143
To ensure a smooth training process, please pay close attention to the following configuration requirements in your `lightningrl.yaml` file:
144
144
145
145
#### 1. Experiment Project Name
146
+
146
147
The `project` field within the `experiment` block must match the filename of your configuration file:
147
148
148
149
```yaml
@@ -151,11 +152,12 @@ experiment:
151
152
```
152
153
153
154
#### 2. Model Paths
155
+
154
156
The `model` block requires absolute paths for model checkpoints to ensure proper loading:
155
157
156
-
* **`pretrained_model`**: Must be set to the **absolute path** of your pre-trained model.
157
-
* **`value_base_model`**: This field is associated with `use_value_model` in the `training` block. If `use_value_model` is set to `True`, this field must be populated.
158
-
* *Note:* Currently, the value model does not actively participate in training; it is provided as an optional component to facilitate experimentation and reproducibility.
158
+
* **`pretrained_model`**: Must be set to the **absolute path** of your pre-trained model.
159
+
* **`value_base_model`**: This field is associated with `use_value_model` in the `training` block. If `use_value_model` is set to `True`, this field must be populated.
160
+
* *Note:* Currently, the value model does not actively participate in training; it is provided as an optional component to facilitate experimentation and reproducibility.
159
161
160
162
```yaml
161
163
model:
@@ -164,6 +166,7 @@ model:
164
166
```
165
167
166
168
#### 3. Training Controls
169
+
167
170
Ensure your training flags are configured correctly:
168
171
169
172
```yaml
@@ -191,10 +194,11 @@ For any issues or inquiries, please feel free to open an issue in this repositor
191
194
If you find our work helpful, please consider citing:
192
195
193
196
```bibtex
194
-
@inproceedings{hu2026lightningrl,
197
+
@article{hu2026lightningrl,
195
198
title={LightningRL: Breaking the Accuracy--Parallelism Trade-off of Block-wise dLLMs via Reinforcement Learning},
196
199
author={Hu, Yanzhe and Jin, Yijie and Liu, Pengfei and Yu, Kai and Deng, Zhijie},
197
-
booktitle={Proceedings of the International Conference on Machine Learning (ICML)},
0 commit comments