This commit is contained in:
@@ -40,7 +40,7 @@ Tensor cores are so fast that the memory is bottlenecking them. All of the share
|
|||||||

|

|
||||||
|
|
||||||
|
|
||||||
To see how the GPU folk in datacenters live, I booted up a vast ai instance and ran the same matmul, but with cutlass kernels for `sm_100a`.
|
To see how the GPU folk with datacenters live, I booted up a vast ai instance and ran the same matmul, but with cutlass kernels for `sm_100a`.
|
||||||
|
|
||||||

|

|
||||||
|
|
||||||
|
|||||||
Reference in New Issue
Block a user