Hellow guys, Welcome to my website, and you are watching HPCA'21 SpAtten: Efficient Sparse Attention Architecture with Cascade Token/Head Pruning Hanrui Wang. and this vIdeo is uploaded by MIT HAN Lab at 2021-02-13T08:21:24-08:00. We are pramote this video only for entertainment and educational perpose only. So, I hop you like our website.