• Chao Liu's avatar
    Gemm+Reduce Fusion (#128) · f95267f1
    Chao Liu authored
    * add gridwise gemm v4r1
    
    * rename
    
    * adding gemm+reduce
    
    * adding gemm+reduce
    
    * adding gemm+reduce
    
    * adding gemm+reduce
    
    * use sfc in shuffling
    
    * remove hardcode
    
    * remove hardcode
    
    * refactor
    
    * fix build
    
    * adding gemm+reduce
    
    * adding gemm+reduce
    
    * adding gemm+reduce
    
    * adding gemm+reduce
    
    * adding gemm+reduce
    
    * format
    
    * clean
    
    * adding gemm+reduce
    
    * adding profiler for gemm+reduce
    
    * adding gemm+reduce profiler
    
    * fix build
    
    * clean up
    
    * gemm+reduce
    
    * fix build
    
    * update DeviceGemm_Xdl_CShuffle; update enum to enum class
    
    * clean up
    
    * add test for gemm+reduce
    
    * clean up
    
    * refactor
    
    * fix build
    
    * fix build
    f95267f1
gemm_xdl_fp16.cpp 9.47 KB