Skip to content
GitLab
Menu
Projects
Groups
Snippets
Help
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
liyinrong
composable_kernel
Graph
layernorm_welford
Switch branch/tag
You can move around the graph by using the arrow keys.
Begin with the selected commit
Created with Raphaël 2.2.0
31
Aug
30
29
30
29
28
27
28
27
26
25
24
23
22
21
20
19
18
17
16
15
13
12
11
12
11
10
9
8
7
6
5
4
3
2
1
31
Jul
30
29
28
27
26
25
24
22
21
20
19
18
19
18
17
16
15
14
15
14
13
12
11
10
9
8
7
fix host bias+gelu bug
attention kernel proper granular padding for all 4 dims
shrink input value range for attention kernel validation to avoid occasional error by 1e-3
add adhoc padding test for atten
refactor attention padding to better fit in unit tests
add TNTT gemm_gemm + atten kernel instances
add gemm spec in kernel name
change input parameter for ckProfiler
ckprofiler finish code
trim unnecessary check
modify comment
add conv+conv example, 1x1 only
conv_conv_v2
conv_conv_v2
Add debug info to DeviceBatchedGemmXdl and instances to batched_gemm
Minor fix
Add debug info to DeviceGemmXdl_CShuffle and instances to gemm_add_add_fastgelu
refactor conv
update test
conv_conv
conv_conv
Add debug info to DeviceGemmXdl_CShuffle
Add debug info to DeviceGemmXdl
Gemm reduce examples int4/int8/fp32/bf16 (#368)
Padding for attention: bmm+scale+softmax+bmm kernel (#385)
start add to ckprofiler
add instance
gelu change to relu and GetElementSpaceSize bug
Merge branch 'tensor_permutation' of github.com:ROCmSoftwarePlatform/composable_kernel into tensor_permutation
changed deviceelementwise parameters for outscalar
Fix code-comment mismatch
Remove macro PP_DEFINE_LAYOUT_TYPE()
add comments for usages of padding bmm+scale+softmax+bmm
att-with-mask
att-with-mask
Add gemm instances
Use different initialization method for examples
Add 'final' specifier to utility classes
Group same-dim-layouts together in 'LayoutSetting<>'
Add check for the 'RLayout' type argument
Use same A/B data type for host Conv in int4 example
Remove debug messages
Use named variables to replace magic numbers
Merge remote-tracking branch 'origin/develop' into conv_conv
Merge branch 'develop' into feature/add-convnd-fwd-reduce-examples
Try to workaround flaky GemmSoftmaxGemm tests (#386)
Menu
Projects
Groups
Snippets
Help