Reproduce olmov2 SFT

## Description

Match Olmo v2 SFT on core evals (MMLU) and instruction following (alpacaeval)

## Hypothesis or Goal

Trying to match the perf of Olmo v2 SFT given this [dataset](https://huggingface.co/datasets/allenai/tulu-3-sft-mixture)


### Links


* WandB Report:  ([link](https://wandb.ai/marin-community/marin/runs/marin_olmo_tulu_sft_v3-acca67))
* Data Browser: ([link](https://marlin-subtle-barnacle.ngrok-free.app/view?paths=%5B%22gs://marin-us-central2/experiments/606_sft-310a26.json%22%5D))
* Experiment JSON: (link)
* (etc.)



## Results

(What did you find, including relevant evaluation metrics, etc.)



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reproduce olmov2 SFT #606

Description

Hypothesis or Goal

Links

Results

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Reproduce olmov2 SFT #606

Description

Description

Hypothesis or Goal

Links

Results

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions