HNNewShowAskJobs
Built with Tanstack Start
FP8 is ~100 tflops faster when the kernel name has "cutlass" in it(twitter.com)
223 points by limoce 15 hours ago | discuss