Batch kernels for forward pass of Preprocessing #2

sandeepnmenon · 2024-04-28T01:05:58Z

Changes

New kernels for batched preprocessCUDA
New API for the package preprocess_gaussians_batched
New class for rasterization settings GaussianRasterizerBatches
Test file to test and compare batched and non batched preprocess forward kernel

Results

Tests ran on V100

num_gaussians = 1000000
num_batches=64
SH_ACTIVE_DEGREE = 3

Time taken by test_batched_gaussian_rasterizer: 81.2411 ms
Time taken by test_batched_gaussian_rasterizer_batch_processing: 33.5708 ms

…ssian-rasterization into mlsys/batched_preprocess

…s_gaussians function.

…ssian-rasterization into mlsys/batched_preprocess

…er in __init__.py

…of math.tan in test_batched_gaussian_rasterizer_batch_processing function

…essing function

…hed inputs in __init__.py

…asterizer_batch_processing functions in rasterization_tests.py

…hed inputs in __init__.py

…ao/diff-gaussian-rasterization into mlsys/batched_preprocess

…asterizer_batch_processing functions in rasterization_tests.py

…ations

…asterizer_batch_processing functions in rasterization_tests.py

… non-matching values

…matching values

…s as a batch

…of batched_raster_settings

…s as a batch

…nd result_idx instead of only idx.

prapti19

We get a significant speedup. Looks good.

cuda_rasterizer/rasterizer_impl.cu

TarzanZhao · 2024-05-11T04:04:41Z

rasterization_tests.py

+    elapsed_time_ms = start_event.elapsed_time(end_event)
+    print(f"Time taken by test_batched_gaussian_rasterizer_batch_processing: {elapsed_time_ms:.4f} ms")
+
+    # TODO: make the below work


This should work

for means2D in batched_means2D: means2D.retain_grad()

rasterization_tests.py

TarzanZhao

I've reviewed the code, and I think it's good; it can be merged.

TarzanZhao · 2024-05-11T04:23:07Z

rasterization_tests.py

+        viewpoint_camera.image_width = 512
+        viewpoint_camera.world_view_transform = torch.eye(4).cuda()
+        viewpoint_camera.full_proj_transform = torch.eye(4).cuda()
+        viewpoint_camera.camera_center = torch.zeros(3).cuda()


Let us change torch.zeros to be non-zero values and test it in tomorrow meeting.

Fix bug for non zero camera centers and test for non identity view transforms

Tested for these. Will check in those with the backward kernel PR #3

sandeepnmenon and others added 30 commits April 20, 2024 19:17

test function for rasterszaton tests

6aed776

add mock of improved preproc

b7b08ba

batched rasterization

54302e4

Merge branch 'prapti/preproc_gauss' of github.com:TarzanZhao/diff-gau…

a5e505a

…ssian-rasterization into mlsys/batched_preprocess

add rough idea for kernel

53e12d2

Refactor rasterizer import in rasterization_tests.py

f0e0469

Refactor GaussianRasterizerBatches class to support batched preproces…

a16acd0

…s_gaussians function.

Merge branch 'prapti/preproc_gauss' of github.com:TarzanZhao/diff-gau…

aa07ce2

…ssian-rasterization into mlsys/batched_preprocess

Refactor preprocess_gaussians function to remove flag_batched paramet…

268f46a

…er in __init__.py

batched forward pass kernel

7361323

added headers and changed kernel structure to 1d block

7f4935d

solved syntax errors

5f05af5

fixed import syntax in test

543d4b8

formatting changes

8ca5a9f

Refactor GaussianRasterizerBatches class to use torch.tensor instead …

0dbe8fd

…of math.tan in test_batched_gaussian_rasterizer_batch_processing function

Refactor variable name in test_batched_gaussian_rasterizer_batch_proc…

193fa82

…essing function

Refactor preprocess_gaussians function to handle batched and non-batc…

ac43fc4

…hed inputs in __init__.py

Refactor test_batched_gaussian_rasterizer and test_batched_gaussian_r…

4115266

…asterizer_batch_processing functions in rasterization_tests.py

Refactor test_batched_gaussian_rasterizer and test_batched_gaussian_r…

162e7d0

…asterizer_batch_processing functions in rasterization_tests.py

add parity test

fdf3bf5

Refactor preprocess_gaussians function to handle batched and non-batc…

cace4fd

…hed inputs in __init__.py

Merge branch 'prapti_mlsys/batched_preprocess' of github.com:TarzanZh…

7cad1b0

…ao/diff-gaussian-rasterization into mlsys/batched_preprocess

Refactor test_batched_gaussian_rasterizer and test_batched_gaussian_r…

eaf0d42

…asterizer_batch_processing functions in rasterization_tests.py

Refactor test_batched_gaussian_rasterizer and test_batched_gaussian_r…

d9eb4e8

…asterizer_batch_processing functions in rasterization_tests.py

add debug flag to extra_compile_args

c38cfa9

Refactor tan_fovy parameter to be const in CUDA rasterizer files

24905aa

Refactor tan_fovy parameter to be const in CUDA rasterizer files

d376d41

Refactor tan_fovy parameter to be const in CUDA rasterizer files

8c82fa7

Refactor CUDA rasterizer files to use CUDA tensors for batched calcul…

4cca118

…ations

Refactor test_batched_gaussian_rasterizer and test_batched_gaussian_r…

34ebced

…asterizer_batch_processing functions in rasterization_tests.py

sandeepnmenon and others added 16 commits April 26, 2024 14:05

Refactor compare_tensors function in rasterization_tests.py to handle…

b3ad196

… non-matching values

Fix indexing bug in preprocessCUDABatched function

abfb8b4

Update rasterization_tests.py

18f9c20

Refactor compare_tensors function in rasterization_tests.py to handle…

a04b34d

… non-matching values

Refactor compare_tensors function to fix indexing bug and handle non-…

e593132

…matching values

Update forward.cu

e109969

Update rasterization_tests.py

edfea2e

Update forward.cu

53a14e2

fixed sh_sdegree

32a601f

merged commits

147f71d

Refactor GaussianRasterizationSettings class to handle raster_setting…

22eb043

…s as a batch

Refactor rasterization_tests.py to use raster_settings_batch instead …

7ff2fd3

…of batched_raster_settings

fixed namedtuple setting bug

fc48eec

Refactor GaussianRasterizationSettings class to handle raster_setting…

49c5179

…s as a batch

Update setup.py to remove debug flag from extra_compile_args

a0d7127

Fix formatting issues in forward.cu and __init__.py

a21c4b9

sandeepnmenon requested review from prapti19 and TarzanZhao April 28, 2024 01:05

Refactor computeColorFromSH function in forward.cu to use point_idx a…

25c6812

…nd result_idx instead of only idx.

prapti19 approved these changes Apr 29, 2024

View reviewed changes

sandeepnmenon mentioned this pull request Apr 30, 2024

Batch kernels for backward pass of Preprocessing #3

Open

sandeepnmenon added 3 commits May 7, 2024 23:33

replaced python time with torch event records

1b7fdc4

fixed cuda illegal memory bug and can run for 1M gaussians

3c4c667

chore: Update .gitignore to ignore *.pyc files

1e4cbc9

TarzanZhao reviewed May 11, 2024

View reviewed changes

cuda_rasterizer/rasterizer_impl.cu Show resolved Hide resolved

TarzanZhao reviewed May 11, 2024

View reviewed changes

rasterization_tests.py Show resolved Hide resolved

TarzanZhao approved these changes May 11, 2024

View reviewed changes

TarzanZhao reviewed May 11, 2024

View reviewed changes

sandeepnmenon merged commit 13e4cb0 into dist May 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batch kernels for forward pass of Preprocessing #2

Batch kernels for forward pass of Preprocessing #2

sandeepnmenon commented Apr 28, 2024 •

edited

Loading

prapti19 left a comment

TarzanZhao May 11, 2024

TarzanZhao left a comment

TarzanZhao May 11, 2024 •

edited

Loading

sandeepnmenon May 11, 2024

sandeepnmenon May 11, 2024

Batch kernels for forward pass of Preprocessing #2

Batch kernels for forward pass of Preprocessing #2

Conversation

sandeepnmenon commented Apr 28, 2024 • edited Loading

Changes

Results

prapti19 left a comment

Choose a reason for hiding this comment

TarzanZhao May 11, 2024

Choose a reason for hiding this comment

TarzanZhao left a comment

Choose a reason for hiding this comment

TarzanZhao May 11, 2024 • edited Loading

Choose a reason for hiding this comment

sandeepnmenon May 11, 2024

Choose a reason for hiding this comment

sandeepnmenon May 11, 2024

Choose a reason for hiding this comment

sandeepnmenon commented Apr 28, 2024 •

edited

Loading

TarzanZhao May 11, 2024 •

edited

Loading