Skip to content

update visualized.ipynb#516

Merged
kennymckormick merged 2 commits intoopen-compass:mainfrom
Guojiacheng2017:main
Oct 23, 2024
Merged

update visualized.ipynb#516
kennymckormick merged 2 commits intoopen-compass:mainfrom
Guojiacheng2017:main

Conversation

@Guojiacheng2017
Copy link
Copy Markdown
Contributor

  1. update the model name in model2vis list;
  2. solve the problem that some benchmark score is too high and out of range;
  3. solve the problem that some model lack the MMBench_TEST_EN score;

1. solve the problem that some benchmark score is too high and out of range;
2. solve the problem that some model lack the evaluation of MMBench_TEST_EN;
@kennymckormick kennymckormick merged commit db31bbb into open-compass:main Oct 23, 2024
kennymckormick pushed a commit to white2018/VLMEvalKit that referenced this pull request Nov 1, 2024
* Update visualize.ipynb

1. solve the problem that some benchmark score is too high and out of range;
2. solve the problem that some model lack the evaluation of MMBench_TEST_EN;

* * visualized.ipynb
kushal-tri pushed a commit to kushal-tri/VLMEvalKit that referenced this pull request Nov 22, 2024
* Update visualize.ipynb

1. solve the problem that some benchmark score is too high and out of range;
2. solve the problem that some model lack the evaluation of MMBench_TEST_EN;

* * visualized.ipynb
Mercury7353 pushed a commit to Mercury7353/VLMEvalKit that referenced this pull request Apr 28, 2025
* Update visualize.ipynb

1. solve the problem that some benchmark score is too high and out of range;
2. solve the problem that some model lack the evaluation of MMBench_TEST_EN;

* * visualized.ipynb
Koii2k3 pushed a commit to wjnwjn59/VLMEvalKit that referenced this pull request Nov 13, 2025
* Update visualize.ipynb

1. solve the problem that some benchmark score is too high and out of range;
2. solve the problem that some model lack the evaluation of MMBench_TEST_EN;

* * visualized.ipynb
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants