Skip to content

Commit 689de38

Browse files
authored
two small changes
1 parent bd15972 commit 689de38

1 file changed

Lines changed: 2 additions & 2 deletions

File tree

18.337 2026 hw4.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -82,7 +82,7 @@ C .= A .+ B # 3. benchmark matadd for different n
8282
C = A + B # 4. compare with above — what is the difference in speed and why?
8383
```
8484

85-
To benchmark correctly, use `@belapsed CUDA.@sync mul!(C, A, B)` (and equivalently
85+
To benchmark correctly, load `BenchmarkTools` and use `@belapsed CUDA.@sync mul!(C, A, B)` (and equivalently
8686
for the other operations). Submit your code and a table of the absolute execution time
8787
for matmul and matadd, with and without allocations, as a function of matrix size.
8888

@@ -116,7 +116,7 @@ element-doubling operation. We assign one thread per element. Fill in the blanks
116116

117117
```julia
118118
using KernelAbstractions, CUDA
119-
backend = KernelAbstractions.get_backend(CUDA.zeros(1))
119+
backend = CUDABackend()
120120
elty = Float32
121121
const NUMTHREADSINBLOCK = 64 # threads per CUDA block
122122

0 commit comments

Comments
 (0)