VOOZH about

URL: https://huggingface.co/datasets/GindaChen/megatron-prof-data-v9

⇱ GindaChen/megatron-prof-data-v9 · Datasets at Hugging Face


Dataset Preview
Duplicate
0
string
1
float64
megatron.core.transformer.attention.forward.qkv
197.073761
megatron.core.transformer.attention.forward.adjust_key_value
0.1096
megatron.core.transformer.attention.forward.rotary_pos_emb
0.087072
megatron.core.transformer.attention.forward.core_attention
860.149475
megatron.core.transformer.attention.forward.linear_proj
1.501952
megatron.core.transformer.transformer_layer._forward_attention.self_attention
1,060.835693
megatron.core.transformer.transformer_layer._forward_attention.self_attn_bda
1,103.405151
megatron.core.transformer.mlp.forward.linear_fc1
15.728672
megatron.core.transformer.mlp.forward.activation
474.251373
megatron.core.transformer.mlp.forward.linear_fc2
10.712512
megatron.core.transformer.transformer_layer._forward_mlp.mlp
502.245209
megatron.core.transformer.transformer_layer._forward_mlp.mlp_bda
0.45184
megatron.core.transformer.attention.forward.qkv
7.838624
megatron.core.transformer.attention.forward.adjust_key_value
0.002976
megatron.core.transformer.attention.forward.rotary_pos_emb
0.07984
megatron.core.transformer.attention.forward.core_attention
16.827616
megatron.core.transformer.attention.forward.linear_proj
1.4552
megatron.core.transformer.transformer_layer._forward_attention.self_attention
26.358528
megatron.core.transformer.transformer_layer._forward_attention.self_attn_bda
0.452288
megatron.core.transformer.mlp.forward.linear_fc1
5.807456
megatron.core.transformer.mlp.forward.activation
0.662016
megatron.core.transformer.mlp.forward.linear_fc2
5.723648
megatron.core.transformer.transformer_layer._forward_mlp.mlp
12.206848
megatron.core.transformer.transformer_layer._forward_mlp.mlp_bda
0.451584
megatron.core.transformer.attention.forward.qkv
2.606688
megatron.core.transformer.attention.forward.adjust_key_value
0.002912
megatron.core.transformer.attention.forward.rotary_pos_emb
0.002976
megatron.core.transformer.attention.forward.core_attention
6.24656
megatron.core.transformer.attention.forward.linear_proj
1.493664
megatron.core.transformer.transformer_layer._forward_attention.self_attention
10.373056
megatron.core.transformer.transformer_layer._forward_attention.self_attn_bda
0.522976
megatron.core.transformer.mlp.forward.linear_fc1
5.837856
megatron.core.transformer.mlp.forward.activation
0.751808
megatron.core.transformer.mlp.forward.linear_fc2
5.739264
megatron.core.transformer.transformer_layer._forward_mlp.mlp
12.342464
megatron.core.transformer.transformer_layer._forward_mlp.mlp_bda
0.519968
megatron.core.transformer.attention.forward.qkv
2.660864
megatron.core.transformer.attention.forward.adjust_key_value
0.002944
megatron.core.transformer.attention.forward.rotary_pos_emb
0.002976
megatron.core.transformer.attention.forward.core_attention
6.968032
megatron.core.transformer.attention.forward.linear_proj
1.538496
megatron.core.transformer.transformer_layer._forward_attention.self_attention
11.192064
megatron.core.transformer.transformer_layer._forward_attention.self_attn_bda
0.543008
megatron.core.transformer.mlp.forward.linear_fc1
6.032384
megatron.core.transformer.mlp.forward.activation
0.78288
megatron.core.transformer.mlp.forward.linear_fc2
5.911232
megatron.core.transformer.transformer_layer._forward_mlp.mlp
12.74048
megatron.core.transformer.transformer_layer._forward_mlp.mlp_bda
0.557792
megatron.core.transformer.attention.forward.qkv
2.783616
megatron.core.transformer.attention.forward.adjust_key_value
0.002944
megatron.core.transformer.attention.forward.rotary_pos_emb
0.002976
megatron.core.transformer.attention.forward.core_attention
7.114752
megatron.core.transformer.attention.forward.linear_proj
1.567936
megatron.core.transformer.transformer_layer._forward_attention.self_attention
11.491584
megatron.core.transformer.transformer_layer._forward_attention.self_attn_bda
0.557856
megatron.core.transformer.mlp.forward.linear_fc1
6.152032
megatron.core.transformer.mlp.forward.activation
0.804224
megatron.core.transformer.mlp.forward.linear_fc2
6.015296
megatron.core.transformer.transformer_layer._forward_mlp.mlp
12.984736
megatron.core.transformer.transformer_layer._forward_mlp.mlp_bda
0.55696
megatron.core.transformer.attention.forward.qkv
2.795712
megatron.core.transformer.attention.forward.adjust_key_value
0.003008
megatron.core.transformer.attention.forward.rotary_pos_emb
0.002976
megatron.core.transformer.attention.forward.core_attention
7.113344
megatron.core.transformer.attention.forward.linear_proj
1.570464
megatron.core.transformer.transformer_layer._forward_attention.self_attention
11.504032
megatron.core.transformer.transformer_layer._forward_attention.self_attn_bda
0.556896
megatron.core.transformer.mlp.forward.linear_fc1
6.144864
megatron.core.transformer.mlp.forward.activation
0.801184
megatron.core.transformer.mlp.forward.linear_fc2
6.009728
megatron.core.transformer.transformer_layer._forward_mlp.mlp
12.967648
megatron.core.transformer.transformer_layer._forward_mlp.mlp_bda
0.557248
megatron.core.transformer.attention.forward.qkv
2.777248
megatron.core.transformer.attention.forward.adjust_key_value
0.002944
megatron.core.transformer.attention.forward.rotary_pos_emb
0.002976
megatron.core.transformer.attention.forward.core_attention
7.061664
megatron.core.transformer.attention.forward.linear_proj
1.539584
megatron.core.transformer.transformer_layer._forward_attention.self_attention
11.4024
megatron.core.transformer.transformer_layer._forward_attention.self_attn_bda
0.54208
megatron.core.transformer.mlp.forward.linear_fc1
6.023904
megatron.core.transformer.mlp.forward.activation
0.781344
megatron.core.transformer.mlp.forward.linear_fc2
5.890464
megatron.core.transformer.transformer_layer._forward_mlp.mlp
12.707456
megatron.core.transformer.transformer_layer._forward_mlp.mlp_bda
0.541952
megatron.core.transformer.attention.forward.qkv
2.729856
megatron.core.transformer.attention.forward.adjust_key_value
0.002976
megatron.core.transformer.attention.forward.rotary_pos_emb
0.002976
megatron.core.transformer.attention.forward.core_attention
7.215072
megatron.core.transformer.attention.forward.linear_proj
1.605056
megatron.core.transformer.transformer_layer._forward_attention.self_attention
11.573824
megatron.core.transformer.transformer_layer._forward_attention.self_attn_bda
0.567872
megatron.core.transformer.mlp.forward.linear_fc1
6.269952
megatron.core.transformer.mlp.forward.activation
0.815392
megatron.core.transformer.mlp.forward.linear_fc2
6.065568
megatron.core.transformer.transformer_layer._forward_mlp.mlp
13.162688
megatron.core.transformer.transformer_layer._forward_mlp.mlp_bda
0.556928
megatron.core.transformer.attention.forward.qkv
2.782912
megatron.core.transformer.attention.forward.adjust_key_value
0.00272
megatron.core.transformer.attention.forward.rotary_pos_emb
0.002976
megatron.core.transformer.attention.forward.core_attention
7.097888
End of preview.

No dataset card yet

Downloads last month
5