Hi MiniMax team,
First, thank you for the great work on MSA and for open-sourcing it. Also thanks for contributing the original HF Kernel Hub package.
I tested and packaged a FlashRT-maintained Blackwell / GB10 extension of the MiniMax MSA decode-sparse path for the HF Kernel Hub:
https://huggingface.co/kernels/flashrt/MiniMaxAI-msa-blackwell
This package targets NVIDIA Blackwell compute capability 12.x and has been validated on DGX Spark / GB10 / SM121. The main goal is to make the MiniMax MSA decode-sparse path usable beyond the original SM100-only target.
Source repo:
https://github.com/LiangSu8899/FlashRT-HF-kernels/tree/main/MiniMaxAI-msa-blackwell
I’d be happy to keep iterating on this and gradually port more of the SM100 MSA API surface to Blackwell where it makes sense.
Hi MiniMax team,
First, thank you for the great work on MSA and for open-sourcing it. Also thanks for contributing the original HF Kernel Hub package.
I tested and packaged a FlashRT-maintained Blackwell / GB10 extension of the MiniMax MSA decode-sparse path for the HF Kernel Hub:
https://huggingface.co/kernels/flashrt/MiniMaxAI-msa-blackwell
This package targets NVIDIA Blackwell compute capability 12.x and has been validated on DGX Spark / GB10 / SM121. The main goal is to make the MiniMax MSA decode-sparse path usable beyond the original SM100-only target.
Source repo:
https://github.com/LiangSu8899/FlashRT-HF-kernels/tree/main/MiniMaxAI-msa-blackwell
I’d be happy to keep iterating on this and gradually port more of the SM100 MSA API surface to Blackwell where it makes sense.