You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: CLAUDE.md
+16-6Lines changed: 16 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -15,9 +15,9 @@ BalatroBench is a static web application that displays performance leaderboards
15
15
-**about.html**: About page with project information and metrics documentation
16
16
-**script.js**: Shared JavaScript for all pages - fetches and renders leaderboard data, with interactive expandable rows showing detailed charts and statistics
17
17
-**data/**: Contains benchmark results organized by version, strategy, and data type
18
-
-`data/benchmarks/v0.8.0/default/leaderboard.json`: Primary model leaderboard data
19
-
-`data/community/v0.8.0/default/leaderboard.json`: Community strategy leaderboard data
20
-
- Individual model result files in vendor subdirectories (e.g., `openai/gpt-oss-120b.json`)
18
+
-`data/benchmarks/v0.8.1/default/leaderboard.json`: Primary model leaderboard data
19
+
-`data/community/v0.8.1/default/leaderboard.json`: Community strategy leaderboard data
20
+
- Individual model result files in vendor subdirectories (e.g., `openai/gpt-5.json`, `google/gemini-2.5-pro.json`, `anthropic/claude-sonnet-4.json`)
21
21
22
22
### Data Structure
23
23
@@ -39,6 +39,8 @@ Models are identified by `vendor/model` format and ranked by performance metrics
39
39
- Provider usage pie chart
40
40
- Complete per-game statistics table
41
41
- Total aggregated metrics (tokens, costs, time)
42
+
-**Bar Charts**: Performance visualization showing average rounds reached across models
43
+
-**Footer**: Clickable footer component with project information and navigation
42
44
-**Responsive Design**: Columns hide/show based on screen size
43
45
-**Dual Display Modes**: Support for both model-based and community strategy leaderboards
44
46
@@ -56,9 +58,16 @@ python3 -m http.server 8000
56
58
### Dependencies
57
59
58
60
-**Tailwind CSS**: Styling framework loaded from CDN
59
-
-**Chart.js**: Charting library for histogramsand pie charts
61
+
-**Chart.js**: Charting library for histograms, pie charts, and bar charts
60
62
-**Heroicons**: Icon library (included but minimal usage in current implementation)
61
63
64
+
### Current Models
65
+
66
+
The leaderboard includes performance data for models from multiple vendors:
67
+
-**OpenAI**: gpt-5, gpt-oss-120b, gpt-oss-20b
68
+
-**Google**: gemini-2.5-pro
69
+
-**Anthropic**: claude-sonnet-4
70
+
62
71
### File Structure Conventions
63
72
64
73
- All files use UTF-8 encoding with LF line endings
0 commit comments