3DLocator
3DLocator accurately aligns actors on the stage, and even aims to solve unseen poses due to occlusions.
Reconstructing dynamic radiance fields from video clips is challenging, especially when entertainment videos like TV shows are given. Many challenges make the reconstruction difficult due to (1) actors occluding with each other and having diverse facial expressions, (2) cluttered stages, and (3) small baseline views or sudden shot changes. To address these issues, we present ShowMak3r, a comprehensive reconstruction pipeline that allows the editing of scenes like how video clips are made in a production control room. In ShowMak3r, a 3DLocator module locates recovered actors on the stage using depth prior, and estimates unseen human poses via interpolation. The proposed ShotMatcher module then tracks the actors under shot changes. Furthermore, ShowMak3r introduces a face-fitting network that dynamically recovers the actors’ expressions. Experiments on various datasets show that our pipeline can reassemble TV show scenes with new cameras at different timestamps. We also demonstrate that ShowMak3r enables interesting applications such as synthetic shot-making, actor relocation, insertion, deletion, and pose manipulation.
3DLocator accurately aligns actors on the stage, and even aims to solve unseen poses due to occlusions.
ShotMatcher enables continuous tracking of actors under shot-change. It associates actors even when they are not visible in certain shots.
In TV shows, recovering detailed facial expressions are essential for delivering emotion and enhancing realism. We introduce an implicit function-based residual appearance fitting scheme by utilizing a Gaussian deformation network. Instead of moving the Gaussians, we refine colors and opacity to fit the details.
The Big Bang Theory (2007)
Everybody Loves Raymond (1996)
Friends (1994)
Two and a Half Men (2003)
Actor Deletion
Actor Relocation
Actor Insertion
Pose Manipulation
This work was supported by IITP grant (RS-2021-II211343: AI Graduate School Program (Seoul National University) and RS-2023-00227993: Detailed 3D reconstruction for urban areas from unstructured images) and NRF grant (No.2023R1A1C200781211) funded by the Korea government (MSIT). We also thank Daeun Lee for her help with the rebuttal experiments.
@article{kim2025showmak3r,
author = {Kim, Sangmin and Do, Seunguk and Park, Jaesik},
title = {ShowMak3r: Compositional TV Show Reconstruction},
journal = {CVPR},
year = {2025}
}