PositionIC: Unified Position and Identity Consistency for Image Customization

Hu, Junjie; Han, Tianyang; Ma, Kai; Gao, Jialin; Song, Yang; He, Xianhua; Luo, Junfeng; Wei, Xiaoming; Zhang, Wenqiang

Junjie Hu, Tianyang Han, Kai Ma, Jialin Gao, Yang Song, Xianhua He, Junfeng Luo, Xiaoming Wei, Wenqiang Zhang; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026, pp. 9139-9148

Abstract

Recent subject-driven image customization excels in fidelity, yet fine-grained instance-level spatial control remains an elusive challenge, hindering real-world applications. This limitation stems from two factors: a scarcity of scalable, position-annotated datasets, and the entanglement of identity and layout by global attention mechanisms. To this end, we introduce PositionIC, a unified framework for high-fidelity, spatially controllable multi-subject customization. First, we present BMPDS, the first automatic data-synthesis pipeline for position-annotated multi-subject datasets, effectively providing crucial spatial supervision. Second, we design a lightweight, layout-aware diffusion framework that integrates a novel visibility-aware attention mechanism. This mechanism explicitly models spatial relationships via an NeRF-inspired volumetric weight regulation to effectively decouple instance-level spatial embeddings from semantic identity features, enabling precise, occlusion-aware placement of multiple subjects. Extensive experiments demonstrate PositionIC achieves state-of-the-art performance on public benchmarks, setting new records for spatial precision and identity consistency. Our work represents a significant step towards truly controllable, high-fidelity image customization in multi-entity scenarios.Code and data: https://github.com/MeiGen-AI/PositionIC.

Related Material

[pdf] [supp] [arXiv]

[bibtex]

@InProceedings{Hu_2026_CVPR, author = {Hu, Junjie and Han, Tianyang and Ma, Kai and Gao, Jialin and Song, Yang and He, Xianhua and Luo, Junfeng and Wei, Xiaoming and Zhang, Wenqiang}, title = {PositionIC: Unified Position and Identity Consistency for Image Customization}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)}, month = {June}, year = {2026}, pages = {9139-9148} }