CS 180: Intro to Computer Vision and Computational Photography, Fall 2024
Final Project: Video Magnification
Aishik Bhattacharyya & Kaitlyn Chen
Overview
The goal of Video Magnification is to emphasize certain visual aspects of a video such as blood flow. To amplify these parts, which are not easy to see with just the human eye, we examine changes at particular temporal frequencies. These frequencies correspond to a certain beats per minute. First, we build the laplacian pyramid as we did in a previous project. We first calculate the gaussian pyramid, downsampling each level. Then, to create the laplacian pyramid, we upsample and subtract consecutive gaussian levels. Next, we create the temporal filter. On every level of the laplacian stack, we convert to the frequency domain using FFT, apply a bandpass filter to its frequencies, convert it back to the time domain, amplify it and add it back to the original value. Finally, we reconstruct the image by looping over every frame, pulling its respective values from the laplacian stack and summing up its upsampled results. We use cv2 to read and write our videos.