Skip to content
/ holoCu Public

CUDA implementation of "Energy-Efficient Adaptive 3D Sensing" (CVPR 2023)

Notifications You must be signed in to change notification settings

btilmon/holoCu

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

32 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Energy-Efficient Adaptive 3D Sensing [CVPR 2023]

Brevin Tilmon1, Zhanghao Sun2, Sanjeev Koppal1, Yicheng Wu3,
Georgios Evangelidis3, Ramzi Zahreddine3, Guru Krishnan3, Sizhuo Ma3, Jian Wang3
University of Florida1, Stanford University2, Snap Inc.3

My GIF

This repository contains a CUDA implementation/simulator for the holographic projector introduced in our CVPR 2023 paper "Energy Efficient Adaptive 3D Sensing". The implementation enables real-time sparse hologram generation on embedded NVIDIA GPUs, such as the Tegra X1's Maxwell 256-core GPU, as well as on most other NVIDIA GPUs, through a fused CUDA kernel of Fresnel Holography. We use this code to achieve higher signal-to-noise ratios (SNR) and accuracy in active stereo depth sensing compared to state-of-the-art 3D sensors, such as Episcan3D and Microsoft Kinect/Intel RealSense. Please see the paper, project website, and supplementary for further technical details.

Simulating 3D Sensors with Holography

Our approach is validated in simulation and on a real Holoeye GAEA 1 Spatial Light Modulator using this repository. The 3D sensors shown below are adaptive (our method), line scanning (EpiScan3D), and full frame (Kinect/Intel RealSense), displayed from left to right. The images from top to bottom represent estimated phase, simulated hologram, and the actual hologram from a spatial light modulator. An adaptive projector is used by default, set --projector="line" to use line scanning projector or --projector="fullFrame" for full frame projector instead in <full, headless>.sh.

My GIF

How it Works

Our implementation is a variation of Fresnel Holography pointwise integration. Please see our paper and supplementary for further details.

Dependencies

There are full and headless versions. The full version renders hologram information to either a monitor or SLM using OpenGL-CUDA interop, and the headless version simply renders hologram information to an image with no hardware required. You only need to install OpenGL for the full version.

  • CUDA (Tested on CUDA Toolkit 10.2, CUDA Toolkit includes cuFFT and cuComplex)
  • OpenGL (Only needed for full version.)
    • sudo apt-get install freeglut3-dev
    • sudo apt-get install libglfw3-dev

We tested on:

  • Tegra X1 Maxwell 256-core GPU on NVIDIA Jetson Nano, Ubuntu 18.04 provided from Jetson Nano Developer Kit SD Card Image. We overclocked both CPU and GPU using sudo jetson-clocks.
  • GTX 1660, Ubuntu 18.04

Headless Version

OpenGL is not required for the headless version. The hologram phase and simulated hologram amplitude are saved to an image as data/<projector>.png. After installing CUDA the hologram phase and simulated hologram amplitude can be computed after running:

mkdir build
cd build
cmake -DBUILD_OPENGL=OFF ..
make
cd ../src/
bash headless.sh

It should look like this:

btilmon@linux:~$ bash headless.sh

Running headless version. Phase and simulated hologram will be saved to 'holoCu/data/'.

Sensor Settings: 
----------------
Sensor:              adaptive
N:                   75
Hologram Size (HxW): 1080x1920
SLM Size (HxW):      1080x1920


3D Sensor SNR:
--------------
Adaptive (Ours):               c75
Line Scanning (EpiScan3D):     c7.21688
Full Frame (Kinect/RealSense): c 


Runtimes (ms):
--------------
Phase Computation:    0.375219
Hologram Simulation:  1.02681
Total:                1.40202

Here is an example saved image with the hologram phase on top and simulated hologram amplitude on bottom:

Full Version

OpenGL is required to display hologram information for the full version. If using a phase only spatial light modulator instead of a monitor, or just for visualization/debugging on a monitor, set --display="phase" in full.sh to display the hologram phase instead of the simulated hologram amplitude. Set --display="hologram" to display the simulated hologram for visualization on a monitor. After installing CUDA and OpenGL the selected hologram information should render to screen after running:

mkdir build
cd build
cmake -DBUILD_OPENGL=ON ..
make
cd ../src/
bash full.sh

It should look like this if using a line projector and displaying the simulated hologram. This is with GTX 1660:

btilmon@linux:~$ bash full.sh

Running full version. Either phase or simulated hologram will be displayed on your monitor/SLM.

Sensor Settings: 
----------------
Sensor:              line
N:                   75
Hologram Size (HxW): 1080x1920
SLM Size (HxW):      1080x1920


3D Sensor SNR:
--------------
Adaptive (Ours):               c75
Line Scanning (EpiScan3D):     c7.21688
Full Frame (Kinect/RealSense): c 

HDMI-0
Frame buffer size: 1920 x 1080

GPU Device 0: "Turing" with compute capability 7.5

Runtimes (ms):
--------------
Phase Computation:    0.871509
Hologram Simulation:  1.01298
Render to Screen:     0.805597
Total:                2.69008

See our adaptive active stereo demo at the top of the README for an example of what the full version can do. If just visualizing on a monitor though, here is an example of visualizing the simualted line holograms on a monitor with the full version:

My GIF

Benchmarks

Our fused CUDA implementation of Fresnel Holography (pointwise integration of diffractive patterns for each hologram point) is faster than Fourier Holography (taking the FFT of a desired projector image) for sparse holograms (~500 hologram points on GTX 1660, ~75 on Tegra X1), even when using the heavily optimized cuFFT for Fourier Holography. In the figure, we used GTX 1660, and MPF stands for Mirror Phase Function which is our Fresnel Holography approach, and FFT stands for Fourier Holography.

Citation

If you found our work useful, please cite our paper:

@inproceedings{tilmon2023e3d,
  title     = {Energy-Efficient Adaptive 3D Sensing},
  author    = {Tilmon, Brevin and Sun, Zhanghao and Koppal, Sanjeev and Wu, Yicheng and Evangelidis, Georgios and Zahreddine, Ramzi and Krishnan, Guru and Ma, Sizhuo and Wang, Jian},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
  month     = {June},
  year      = {2023},
}

Releases

No releases published

Packages

No packages published