Skip to content

Latest commit

 

History

History
324 lines (193 loc) · 22.8 KB

File metadata and controls

324 lines (193 loc) · 22.8 KB

💡🎞️ Awesome_Image_Generation_with_Thinking

Logo

Image Generation with Thinking.

Awesome License: MIT

Welcome to the Awesome-Image-Generation-with-Thinking repository! This repository represents a comprehensive collection of research focused on empowering models to think during image generation. We explore current works and summarize them into three approaches: explicit reflection, reinforcement learning, and unified multimodal models.


🔔 News

  • [2025-06] We created this repository to maintain a paper list on Awesome-Image-Generation-With-Thinking. Contributions are welcome!

📜 Table of Contents


📖 Survey

🧠 Reinforcement Learning

Reinforcement learning has been proven to be a crucial step in enhancing reasoning capabilities. Here, we summarize methods that utilize reinforcement learning, such as GRPO, into image generation process.

🗒️ Explicit Reflection

Reflection is an essantial step in thinking processes. Explicit reflection, which leverages modalities such as text, object coordinates, and image with editing instructions, is a typical approach.


🚀 Unified LMMs

Unified LMMs inherently excel at text-to-image controllability, hence we collect a list of relevant works.


📚 Benchmarks

Essential resources for understanding the broader landscape and evaluating progress in visual reasoning.

Star History

Star History Chart