You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
For comparisons against other Python libraries, I think it'd make sense to include the assembly time when benchmarking so that the comparison is more apples-to-apples, particularly since assembly can take a significant amount of time when the output is sparse. (Note though that assembly time can be significantly reduced if it is done with the compute step using a fused kernel. With the C++ API at least, you should be able to tell TACO to generate code that simultaneously assembles and computes the output by invoking TensorBase::setAssembleWhileCompute(true) before invoking compile.)
rohany
changed the title
benchmark timing inclusion
taco: ensure assembly is included in timing for all benchmarks
Mar 9, 2021
Should the call to
assemble()
be included in benchmark timing @stephenchouca ?The text was updated successfully, but these errors were encountered: