The world's first open source Python DSL for NVIDIA PTX.
Cool work! What would you say is the primary limitation of pyptx now?
There is headroom on Blackwell I have not pushed yet. I also need more pypytx primitives to make the GEMM kernels easier to write for users.
Cool work! What would you say is the primary limitation of pyptx now?
There is headroom on Blackwell I have not pushed yet. I also need more pypytx primitives to make the GEMM kernels easier to write for users.