Skip to content

Commit

Permalink
fix: AOT compiler flags on non-sm90 (#522)
Browse files Browse the repository at this point in the history
Previously non-sm90 cards incorrectly get compiled as sm90, due to the
shallow copy of the compiler flags.
  • Loading branch information
abcdabcd987 authored Oct 10, 2024
1 parent c6d0f65 commit 0aa4726
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion flashinfer-aot/setup.py
Original file line number Diff line number Diff line change
Expand Up @@ -16,6 +16,7 @@

from typing import List, Tuple

import copy
import pathlib
import os
import re
Expand Down Expand Up @@ -372,7 +373,7 @@ def __init__(self, *args, **kwargs) -> None:
"-use_fast_math",
],
}
extra_compile_args_sm90 = extra_compile_args.copy()
extra_compile_args_sm90 = copy.deepcopy(extra_compile_args)
extra_compile_args_sm90["nvcc"].extend(
"-gencode arch=compute_90a,code=sm_90a".split()
)
Expand Down

0 comments on commit 0aa4726

Please # to comment.