Add Janus-1.3B by hills-code · Pull Request #541 · open-compass/VLMEvalKit

hills-code · 2024-10-23T05:50:14Z

We add Janus-1.3B model to reproduce the results in paper.

	MME	MMB (w/o circular)	SEED	MMMU_DEV_VAL	MM-Vet	POPE
VLMEvalkit (Reproduce)	1342.4	69.8	63.8	31.2	36.8	85.5 (overall) 87.1 (random)
Paper	1338.0	69.4	63.7	30.5	34.3	87 (random)

You can run the evaluation with the following code:

torchrun --nproc-per-node=8 run.py --data POPE MMMU_DEV_VAL MMBench_DEV_EN MME SEEDBench_IMG MMVet --model janus_1.3b --verbose

Note:

We evaluate MMBench without circular mode. You should set circular=False in the file vlmeval/dataset/image_mcq.py
We use the official evaluation of MM-Vet with GPT-4 evaluator.

* add janus eval * update * [Fix] Fix Lint --------- Co-authored-by: wuchengyue <hillwu@deepseek.com> Co-authored-by: kennymckormick <dhd.efz@gmail.com>

wusize · 2025-02-11T03:36:59Z

We add Janus-1.3B model to reproduce the results in paper.

MME MMB
(w/o circular) SEED MMMU_DEV_VAL MM-Vet POPE
VLMEvalkit (Reproduce) 1342.4 69.8 63.8 31.2 36.8 85.5 (overall)
87.1 (random)
Paper 1338.0 69.4 63.7 30.5 34.3 87 (random)
You can run the evaluation with the following code:
torchrun --nproc-per-node=8 run.py --data POPE MMMU_DEV_VAL MMBench_DEV_EN MME SEEDBench_IMG MMVet --model janus_1.3b --verbose
Note:

We evaluate MMBench without circular mode. You should set circular=False in the file vlmeval/dataset/image_mcq.py

We use the official evaluation of MM-Vet with GPT-4 evaluator.

Hi! There are two splits in MMMU_DEV_VAL, i.e., dev and validation. May I know which one the numbers in the table correspond to?

charlesCXK · 2025-03-24T04:30:26Z

We add Janus-1.3B model to reproduce the results in paper.
MME MMB
(w/o circular) SEED MMMU_DEV_VAL MM-Vet POPE
VLMEvalkit (Reproduce) 1342.4 69.8 63.8 31.2 36.8 85.5 (overall)
87.1 (random)
Paper 1338.0 69.4 63.7 30.5 34.3 87 (random)
You can run the evaluation with the following code:
torchrun --nproc-per-node=8 run.py --data POPE MMMU_DEV_VAL MMBench_DEV_EN MME SEEDBench_IMG MMVet --model janus_1.3b --verbose
Note:

We evaluate MMBench without circular mode. You should set circular=False in the file vlmeval/dataset/image_mcq.py

We use the official evaluation of MM-Vet with GPT-4 evaluator.
Hi! There are two splits in MMMU_DEV_VAL, i.e., dev and validation. May I know which one the numbers in the table correspond to?

Hi, it was VAL set~

* add janus eval * update * [Fix] Fix Lint --------- Co-authored-by: wuchengyue <hillwu@deepseek.com> Co-authored-by: kennymckormick <dhd.efz@gmail.com>

wuchengyue added 2 commits October 23, 2024 11:25

add janus eval

eb1eca9

update

0dfa9ab

charlesCXK mentioned this pull request Oct 23, 2024

测试精度对不齐 deepseek-ai/Janus#15

Closed

kennymckormick approved these changes Oct 23, 2024

View reviewed changes

[Fix] Fix Lint

10444ad

kennymckormick merged commit f4646f7 into open-compass:main Oct 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Janus-1.3B#541

Add Janus-1.3B#541
kennymckormick merged 3 commits intoopen-compass:mainfrom
hills-code:janus

hills-code commented Oct 23, 2024

Uh oh!

wusize commented Feb 11, 2025

Note:

Uh oh!

charlesCXK commented Mar 24, 2025

Note:

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

hills-code commented Oct 23, 2024

Note:

Uh oh!

wusize commented Feb 11, 2025

Note:

Uh oh!

charlesCXK commented Mar 24, 2025

Note:

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants