Skip to content

Bili-Sakura/Visual-Generative-Foundation-Model-Collection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

62 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

Visual-Generative-Foundation-Model-Collection

๐Ÿค— Collection

Important

We only hold the core publicly available ones pre-trained on ImageNet.

A selected collection of CORE visual generative foundation model including code, paper, checkpoint etc.

TODO: @src/diffusers Models

  • PixelDiT
  • JLT
  • JiT
  • NiT
  • PixNerd
  • PixelFlow
  • SiT
  • ADM
  • DDT
  • DeCo
  • DiM
  • Diffusion-RWKV
  • DiT
  • DiT-MoE
  • EPG
  • EDM2
  • FD-Loss
  • FiT
  • FiTv2
  • iMF
  • LightningDiT
  • LiT
  • RiT
  • MDT
  • MDTv2
  • PAE
  • ProMoE
  • pMF
  • RAE
  • RAEv2
  • PixelREPA
  • REPA
  • REPA-E
  • Self-Flow
  • USP

Update the checklist as new models are added or completed.

Benchmarks

Note

โ€  DiT-MoE uses additional synthetic training data generated by FLUX and SD3.

FID and IS are evaluated on 50k samples, reported with CFG if applicable. ร—2 in NFEs indicates that CFG doubles NFEs at inference time.

ImageNet-256

Model NFE #Param GFLOPs FID IS Precision Recall Code Paper Model
Pixel modeling
ADM-G 250ร—2 4.59 0.82 0.52 Official Code Paper ๐Ÿค— Model
PixelFlow 677M 1.98 282.1 0.81 0.60 Official Code Paper ๐Ÿค— Model
JiT-H/16 100ร—2 953M 182 1.86 303.4 Official Code Paper ๐Ÿค— Model
PixelREPA-H/16 953M 182 1.81 317.2 Official Code Paper ๐Ÿค— Model
EPG 1 1.58 Official Code Paper ๐Ÿค— Model
PixNerd-XL/16 700M 134 1.93 297 Official Code Paper ๐Ÿค— Model
DeCo-XL/16 682M 1.62 301 0.80 0.62 Official Code Paper ๐Ÿค— Model
pMF-H/16 1 956M 271 2.22 268.8 Official Code Paper ๐Ÿค— Model
Latent modeling
DiT-XL/2 250ร—2 675M 119 2.27 278.24 0.83 0.57 Official Code Paper ๐Ÿค— Model
DiT-MoE-XL/2-8E2Aโ€  4.1B 323.74 1.72 315.73 0.83 0.64 Official Code Paper ๐Ÿค— Model
DiffuSSM-XL-G 673M 2.28 259.13 0.86 0.56 Official Code Paper ๐Ÿค— Model
MDT-XL/2 676M 119 1.79 283.01 0.81 0.61 Official Code Paper ๐Ÿค— Model
MDTv2-XL/2 676M 119 1.58 314.73 0.79 0.65 Official Code Paper ๐Ÿค— Model
FiT-XL/2 824M 153 4.21 254.87 0.84 0.51 Official Code Paper ๐Ÿค— Model
SiT-XL/2 250ร—2 675M 119 2.06 277.50 0.83 0.59 Official Code Paper ๐Ÿค— Model
SiT-XL/2 + REPA 250ร—2 675M 119 1.42 305.7 0.80 0.65 Official Code Paper ๐Ÿค— Model
SiT-XL/2 + USP 675M 119 7.35 128.50 Official Code Paper ๐Ÿค— Model
Self-Flow-XL/2 675M 119 5.70 151.40 0.72 0.67 Official Code Paper ๐Ÿค— Model
FiTv2-XL/2 671M 147 2.26 260.95 0.81 0.59 Official Code Paper ๐Ÿค— Model
LightningDiT-XL/2 724M 119 1.35 295.3 Official Code Paper ๐Ÿค— Model
iMF-XL/2 1 610M 175 1.72 282.0 Official Code Paper ๐Ÿค— Model
LiT-XL/2-G 675M 2.32 265.20 0.82 0.57 Official Code Paper ๐Ÿค— Model
RiT Paper ๐Ÿค— Model
SiT-XL/2 + REG 677M 119 1.36 299.4 0.77 0.66 Official Code Paper ๐Ÿค— Model
DDT-XL/2 724M 119 1.26 310.6 Official Code Paper ๐Ÿค— Model
DRWKV-H/2 779M 34.95 2.16 275.36 0.83 0.58 Official Code Paper ๐Ÿค— Model
NiT-XL 675M 119 2.03 265.26 Official Code Paper ๐Ÿค— Model
ProMoE-XL-Flow 1.568B 2.59 265.62 Official Code Paper ๐Ÿค— Model
RAE, DiT-DH-XL/2 50ร—2 1254M 146 1.13 262.6 Official Code Paper ๐Ÿค— Model

ImageNet-512

Model NFE #Param GFLOPs FID IS Precision Recall Code Paper Model
Pixel modeling
ADM-G 250ร—2 7.72 0.87 0.42 Official Code Paper ๐Ÿค— Model
JiT-H/32 100ร—2 956M 183 1.94 309.1 Official Code Paper ๐Ÿค— Model
EPG 2.35 Official Code Paper ๐Ÿค— Model
PixNerd-XL/16 700M 583 2.84 245.6 Official Code Paper ๐Ÿค— Model
DeCo-XL/16 682M 2.22 290.0 0.80 0.60 Official Code Paper ๐Ÿค— Model
pMF-H/32 1 959M 272 2.48 284.9 Official Code Paper ๐Ÿค— Model
Latent modeling
DiT-XL/2 250ร—2 675M 525 3.04 240.82 0.84 0.54 Official Code Paper ๐Ÿค— Model
DiT-MoE-XL/2-8E2Aโ€  4.1B 2.30 298.35 0.85 0.57 Official Code Paper ๐Ÿค— Model
DiffuSSM-XL-G 673M 3.41 255.06 0.85 0.49 Official Code Paper ๐Ÿค— Model
EDM2-XXL 1523M 552 1.81 Official Code Paper ๐Ÿค— Model
SiT-XL/2 250ร—2 675M 525 2.62 252.21 0.84 0.57 Official Code Paper ๐Ÿค— Model
FiTv2-XL/2 671M 525 2.90 263.11 0.83 0.53 Official Code Paper ๐Ÿค— Model
LiT-XL/2-G 675M 3.69 207.97 0.85 0.53 Official Code Paper ๐Ÿค— Model
DDT-XL/2 724M 525 1.28 305.1 Official Code Paper ๐Ÿค— Model
DRWKV-H/2 779M 2.95 265.20 0.84 0.54 Official Code Paper ๐Ÿค— Model
NiT-XL 675M 525 1.45 272.77 Official Code Paper ๐Ÿค— Model
RAE, DiT-DH-XL/2 50ร—2 1254M 642 1.13 259.6 Official Code Paper ๐Ÿค— Model

About

A collection of visual generative foundation model including code, paper, checkpoint etc.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

โšก