Skip to content

Experiment: Tootsie Phoenix Cooldown (sensible-starling) #977

@dlwh

Description

@dlwh

Description

Code-Name: sensible-starling

It's time to cool down Tootsie Phoenix. Based on #847 we're going to use our datasets as well as nemotron and dolmino.

I'm going to throw in zloss and possibly megamath, depending.

Hypothesis or Goal

MMLU to the moon (and everything else)

Links

(Delete any that aren't applicable)

Results

(What did you find, including relevant evaluation metrics, etc.)

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions