TSA2: Temporal Segment Adaptation and Aggregation for Video Harmonization

Zeyu Xiao, Yurui Zhu, Xueyang Fu, Zhiwei Xiong; Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2024, pp. 4136-4145

Abstract


Video composition merges the foreground and background of different videos, presenting challenges due to variations in capture conditions (e.g., saturation, brightness, and contrast). Video harmonization is a vital process in achieving a realistic composite by seamlessly adjusting the foreground's appearance to match the background. In this paper, we propose TSA2, a novel method for video harmonization that incorporates temporal segment adaptation and aggregation. TSA2 divides the inharmonious input sequence into temporal segments, each corresponding to a different frame rate, allowing effective utilization of complementary information within each segment. The method includes the Temporal Segment Adaptation module, which learns and remaps the distribution difference between background and foreground regions, and the Temporal Segment Aggregation module, which emphasizes and aggregates cross-segment information through element-wise correlations. Experimental results demonstrate that TSA2 outperforms advanced image and video harmonization methods quantitatively and qualitatively.

Related Material


[pdf] [supp]
[bibtex]
@InProceedings{Xiao_2024_WACV, author = {Xiao, Zeyu and Zhu, Yurui and Fu, Xueyang and Xiong, Zhiwei}, title = {TSA2: Temporal Segment Adaptation and Aggregation for Video Harmonization}, booktitle = {Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)}, month = {January}, year = {2024}, pages = {4136-4145} }