Skip to content
GitLab
Menu
Projects
Groups
Snippets
Help
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
liyinrong
composable_kernel
Graph
feature/add-gemm-add-add-fastgelu-int4-examples
Switch branch/tag
You can move around the graph by using the arrow keys.
Begin with the selected commit
Created with Raphaël 2.2.0
7
Sep
8
7
6
5
4
3
2
1
31
Aug
30
29
30
29
28
27
28
27
26
25
24
23
22
21
20
19
18
17
16
15
13
12
11
12
11
10
9
8
7
6
5
4
3
2
1
31
Jul
30
29
28
27
26
25
Add stderr to QA logfiles, process splitK and ONNX gemm kernels (#402)
Add 'BlockSize' parameter to 'DevicePermute'
make device/grid level code
gemm_splitk_bias
gemm_splitk_bias
Add missing include directive
Check tensor descriptor dimensions in 'GridwiseElementwise_1D'
change profiler and instance to mutiple d0
add dynamic d0_element_op
mutiple d0
Rename 'GridwisePermute' to 'GridwiseCopy'
Add N/H/WPerBlock template parameter to 'DevicePermute'
Add comment to indicate template argument location
add some code
Explicitly use ck::math::sqrt in batchnorm-forward kernels
Merge remote-tracking branch 'origin/develop' into aosewski/softmax_ut
Add debug code the verify result
Test non innermost dim for fp32 and int8
Fix syntax.
Test cases when reduced dim is not innermost axis.
Renaming in the kernel arguments
host softmax: handle all reduce
Transform descriptor into 3 dimensions
Tiny correction and remove un-used file under example/34_batchnorm
Change problem description for 'DevicePermute'
Remove never-entered-if-clause
Remove no-longer used method
Check if input/output shape meet the requirement
init version
fix example; make padding on by default in example; fix argument checks
Merge branch 'develop' into tensor_permutation
added elementwise permute example
Merge branch 'develop' into feature/add-permute-device-op
Remove no-longer used type argument
Add 'GridwisePermute' kernel
Fused attention instances & padding tests (#395)
GemmGemm TNNT instances (#399)
fix format
Use more reasonable return value for Invoker::Run()
Passing 'axes' to 'DevicePermute'
Softmax client example (#396)
remove useless code
Menu
Projects
Groups
Snippets
Help