MOSAIC Code Release

Official inference release for:

MOSAIC: Generating Consistent, Privacy-Preserving Scenes from Multiple Depth Views in Multi-Room Environments
ICCV 2025

Highlights

Multi-view consistent generation from depth-only trajectories.
Zero-shot inference pipeline built on SDXL + ControlNet.
Includes ready-to-run sample data in the expected input format.

Dataset

Lightweight release sample data is included in this repository (already filtered by final_idx.npy).
Full dataset download: Google Drive

Repository Layout

code_release/
├── README.md
├── requirements.txt
├── assets/
│   └── media/
│       ├── teaser.gif
│       └── teasercrop.mov
├── scripts/
│   └── run_inference.sh
└── mosaic/
    ├── data/
    │   └── ep*/sp*/
    │       ├── depth_raw/
    │       ├── position/
    │       ├── rotation/
    │       ├── gpt_prompt/
    │       └── final_idx.npy
    └── src/
        ├── iccv_ours_weight8.py
        ├── iccv_ours_weight8_pixel.py
        ├── euler_scheduler.py
        ├── run_scene_inference.sh   # main single-scene inference entrypoint
        ├── utils.py
        └── loss/

Setup

1) Create conda environment

conda create -n mosaic python=3.10 -y
conda activate mosaic
pip install --upgrade pip
pip install -r requirements.txt

2) Authenticate Hugging Face (required for SDXL weights)

huggingface-cli login

Quick Start

From repository root:

bash scripts/run_inference.sh \
  --prompt "in van gogh style" \
  --data-dir ../data/ep7/sp4 \
  --script iccv_ours_weight8.py \
  --output-root ../outputs \
  --gpu-id 0

Generated images are saved to:

mosaic/outputs/<script_name>/epX/spY/<prompt>/output_*.png

Script Roles

scripts/run_inference.sh: root-level launcher. It enters mosaic/src/ and calls the main entrypoint.
mosaic/src/run_scene_inference.sh: main inference runner for one scene folder (epX/spY).

Input Data Format

Each scene folder should follow:

<scene_root>/
├── depth_raw/depth_raw_*.npy
├── position/position_*.npy
├── rotation/rotation_*.npy
├── gpt_prompt/gpt_prompt_*.txt
└── final_idx.npy

run_scene_inference.sh validates required subfolders/files before launching inference.

Note: The released sample data in this repository has been pre-filtered by final_idx.npy (non-keyframe entries are removed, full dataset can be downloaded on the drive).

Citation

@inproceedings{liu2025mosaic,
  title={Mosaic: Generating consistent, privacy-preserving scenes from multiple depth views in multi-room environments},
  author={Liu, Zhixuan and Zhu, Haokun and Chen, Rui and Francis, Jonathan and Hwang, Soonmin and Zhang, Ji and Oh, Jean},
  booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision},
  pages={27456--27465},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
assets/media		assets/media
mosaic		mosaic
scripts		scripts
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MOSAIC Code Release

Highlights

Dataset

Repository Layout

Setup

1) Create conda environment

2) Authenticate Hugging Face (required for SDXL weights)

Quick Start

Script Roles

Input Data Format

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MOSAIC Code Release

Highlights

Dataset

Repository Layout

Setup

1) Create conda environment

2) Authenticate Hugging Face (required for SDXL weights)

Quick Start

Script Roles

Input Data Format

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages