Volumetric performance capture from minimal camera viewpoints

Andrew Gilbert, Marco Volino, John Collomosse, Adrian Hilton; Proceedings of the European Conference on Computer Vision (ECCV), 2018, pp. 566-581

Abstract


We present a convolutional autoencoder that enables high fidelity volumetric reconstructions of human performance to be captured from multi-view video comprising only a small set of camera views. Our method yields similar end-to-end reconstruction error to that of a probabilistic visual hull computed using significantly more (double or more) viewpoints. We use a deep prior implicitly learned by the autoencoder trained over a dataset of view-ablated multi-view video footage of a wide range of subjects and actions. This opens up the possibility of high-end volumetric performance capture in on-set and prosumer scenarios where time or cost prohibit a high witness camera count.

Related Material


[pdf] [arXiv]
[bibtex]
@InProceedings{Gilbert_2018_ECCV,
author = {Gilbert, Andrew and Volino, Marco and Collomosse, John and Hilton, Adrian},
title = {Volumetric performance capture from minimal camera viewpoints},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
month = {September},
year = {2018}
}