• Chao Liu's avatar
    ckProfiler and device-level XDL GEMM operator (#48) · e823d518
    Chao Liu authored
    * add DeviceGemmXdl
    
    * update script
    
    * fix naming issue
    
    * fix comment
    
    * output HostTensorDescriptor
    
    * rename
    
    * padded GEMM for fwd v4r4r4 nhwc
    
    * refactor
    
    * refactor
    
    * refactor
    
    * adding ckProfiler
    
    * adding ckProfiler
    
    * refactor
    
    * fix tuning parameter bug
    
    * add more gemm instances
    
    * add more fp16 GEMM instances
    
    * fix profiler driver
    
    * fix bug in tuning parameter
    
    * add fp32 gemm instances
    
    * small fix
    
    * refactor
    
    * rename
    
    * refactor gemm profiler; adding DeviceConv and conv profiler
    
    * refactor
    
    * fix
    
    * add conv profiler
    
    * refactor
    
    * adding more GEMM and Conv instance
    
    * Create README.md
    
    Add build instruction for ckProfiler
    
    * Create README.md
    
    Add Readme for gemm_xdl example
    
    * Update README.md
    
    Remove build instruction from top most folder
    
    * Update README.md
    
    * clean up
    e823d518
README.md 1 Byte