• rocking5566's avatar
    external api for gemm + layernorm (#285) · 12235112
    rocking5566 authored
    * Extract base class for elementwise
    
    * Refactor interface of DeviceGemmReduce. Do not use tuple in interface
    
    * [What] Rename d into reduce in gemm + reduction related code
    [Why] Prepare to add d term for add
    
    * Unify base class of gemm + reduce and gemm + bias + add + reduce
    
    * 1. Rename gemm_bias_add_reduce for external api
     2. Refine cmake
    
    * Add normalize device operation
    
    * [What] Reorder the argument
    [Why] Because d0 is also the input of c.
    
    * Add type string
    
    * Add example of gemm_bias_add_layernorm  via external api
    
    * Refactor example code
    
    * clang-format
    
    * Fix compile error
    
    * clang-format
    
    * Add external api for gemm_add_add_layernorm and normalize
    
    * Add client example
    
    * clang-format
    12235112
CMakeLists.txt 324 Bytes