Blog posts

2025

Notes on CUTLASS DSL (CuTeDSL)

31 minute read

Published:

Basic, introductory notes on CuTeDSL, a domain-specific language for CUDA programming that allows users to write CUDA kernels in Python conveniently.

Linear Layouts in Triton

24 minute read

Published:

Notes on linear layouts in Triton and its conversion with various traditional layout types.

UROP Working Notes

less than 1 minute read

Published:

My own notes and questions on the UROP project.

2024