VOOZH about

URL: https://huggingface.co/datasets/GindaChen/megatron-prof-data-v10

⇱ GindaChen/megatron-prof-data-v10 · Datasets at Hugging Face


Dataset Preview
Duplicate
0
string
1
float64
megatron.core.transformer.attention.forward.qkv
235.090561
megatron.core.transformer.attention.forward.adjust_key_value
0.003104
megatron.core.transformer.attention.forward.rotary_pos_emb
0.003136
megatron.core.transformer.attention.forward.core_attention
836.380249
megatron.core.transformer.attention.forward.linear_proj
1.484064
megatron.core.transformer.transformer_layer._forward_attention.self_attention
1,074.752808
megatron.core.transformer.transformer_layer._forward_attention.self_attn_bda
1,103.562866
megatron.core.transformer.mlp.forward.linear_fc1
8.837568
megatron.core.transformer.mlp.forward.activation
469.043427
megatron.core.transformer.mlp.forward.linear_fc2
6.065408
megatron.core.transformer.transformer_layer._forward_mlp.mlp
485.610168
megatron.core.transformer.transformer_layer._forward_mlp.mlp_bda
0.452608
megatron.core.transformer.attention.forward.qkv
2.578528
megatron.core.transformer.attention.forward.adjust_key_value
0.0032
megatron.core.transformer.attention.forward.rotary_pos_emb
0.003104
megatron.core.transformer.attention.forward.core_attention
6.068768
megatron.core.transformer.attention.forward.linear_proj
1.432608
megatron.core.transformer.transformer_layer._forward_attention.self_attention
10.107264
megatron.core.transformer.transformer_layer._forward_attention.self_attn_bda
0.452672
megatron.core.transformer.mlp.forward.linear_fc1
5.725888
megatron.core.transformer.mlp.forward.activation
0.6752
megatron.core.transformer.mlp.forward.linear_fc2
5.672992
megatron.core.transformer.transformer_layer._forward_mlp.mlp
12.08784
megatron.core.transformer.transformer_layer._forward_mlp.mlp_bda
0.460928
megatron.core.transformer.attention.forward.qkv
2.585184
megatron.core.transformer.attention.forward.adjust_key_value
0.003104
megatron.core.transformer.attention.forward.rotary_pos_emb
0.003104
megatron.core.transformer.attention.forward.core_attention
6.164992
megatron.core.transformer.attention.forward.linear_proj
1.437152
megatron.core.transformer.transformer_layer._forward_attention.self_attention
10.214112
megatron.core.transformer.transformer_layer._forward_attention.self_attn_bda
0.460704
megatron.core.transformer.mlp.forward.linear_fc1
5.723744
megatron.core.transformer.mlp.forward.activation
0.675328
megatron.core.transformer.mlp.forward.linear_fc2
5.692224
megatron.core.transformer.transformer_layer._forward_mlp.mlp
12.105696
megatron.core.transformer.transformer_layer._forward_mlp.mlp_bda
0.460992
megatron.core.transformer.attention.forward.qkv
2.593152
megatron.core.transformer.attention.forward.adjust_key_value
0.003136
megatron.core.transformer.attention.forward.rotary_pos_emb
0.003136
megatron.core.transformer.attention.forward.core_attention
6.173216
megatron.core.transformer.attention.forward.linear_proj
1.43792
megatron.core.transformer.transformer_layer._forward_attention.self_attention
10.231136
megatron.core.transformer.transformer_layer._forward_attention.self_attn_bda
0.46224
megatron.core.transformer.mlp.forward.linear_fc1
5.733536
megatron.core.transformer.mlp.forward.activation
0.673728
megatron.core.transformer.mlp.forward.linear_fc2
5.688384
megatron.core.transformer.transformer_layer._forward_mlp.mlp
12.109696
megatron.core.transformer.transformer_layer._forward_mlp.mlp_bda
0.473184
megatron.core.transformer.attention.forward.qkv
2.618784
megatron.core.transformer.attention.forward.adjust_key_value
0.003136
megatron.core.transformer.attention.forward.rotary_pos_emb
0.003104
megatron.core.transformer.attention.forward.core_attention
6.668256
megatron.core.transformer.attention.forward.linear_proj
1.566112
megatron.core.transformer.transformer_layer._forward_attention.self_attention
10.8792
megatron.core.transformer.transformer_layer._forward_attention.self_attn_bda
0.556992
megatron.core.transformer.mlp.forward.linear_fc1
6.146048
megatron.core.transformer.mlp.forward.activation
0.800864
megatron.core.transformer.mlp.forward.linear_fc2
6.009824
megatron.core.transformer.transformer_layer._forward_mlp.mlp
12.969952
megatron.core.transformer.transformer_layer._forward_mlp.mlp_bda
0.556416
megatron.core.transformer.attention.forward.qkv
2.75232
megatron.core.transformer.attention.forward.adjust_key_value
0.003136
megatron.core.transformer.attention.forward.rotary_pos_emb
0.003104
megatron.core.transformer.attention.forward.core_attention
7.050624
megatron.core.transformer.attention.forward.linear_proj
1.555424
megatron.core.transformer.transformer_layer._forward_attention.self_attention
11.384224
megatron.core.transformer.transformer_layer._forward_attention.self_attn_bda
0.552256
megatron.core.transformer.mlp.forward.linear_fc1
6.082848
megatron.core.transformer.mlp.forward.activation
0.794048
megatron.core.transformer.mlp.forward.linear_fc2
5.820096
megatron.core.transformer.transformer_layer._forward_mlp.mlp
12.709856
megatron.core.transformer.transformer_layer._forward_mlp.mlp_bda
0.53696
megatron.core.transformer.attention.forward.qkv
2.678496
megatron.core.transformer.attention.forward.adjust_key_value
0.003136
megatron.core.transformer.attention.forward.rotary_pos_emb
0.003104
megatron.core.transformer.attention.forward.core_attention
6.881536
megatron.core.transformer.attention.forward.linear_proj
1.504448
megatron.core.transformer.transformer_layer._forward_attention.self_attention
11.089344
megatron.core.transformer.transformer_layer._forward_attention.self_attn_bda
0.536864
megatron.core.transformer.mlp.forward.linear_fc1
5.90688
megatron.core.transformer.mlp.forward.activation
0.77472
megatron.core.transformer.mlp.forward.linear_fc2
5.771968
megatron.core.transformer.transformer_layer._forward_mlp.mlp
12.466016
megatron.core.transformer.transformer_layer._forward_mlp.mlp_bda
0.536768
megatron.core.transformer.attention.forward.qkv
2.654976
megatron.core.transformer.attention.forward.adjust_key_value
0.003168
megatron.core.transformer.attention.forward.rotary_pos_emb
0.003168
megatron.core.transformer.attention.forward.core_attention
6.728128
megatron.core.transformer.attention.forward.linear_proj
1.505856
megatron.core.transformer.transformer_layer._forward_attention.self_attention
10.91408
megatron.core.transformer.transformer_layer._forward_attention.self_attn_bda
0.55184
megatron.core.transformer.mlp.forward.linear_fc1
6.082816
megatron.core.transformer.mlp.forward.activation
0.79424
megatron.core.transformer.mlp.forward.linear_fc2
5.948224
megatron.core.transformer.transformer_layer._forward_mlp.mlp
12.837376
megatron.core.transformer.transformer_layer._forward_mlp.mlp_bda
0.551712
megatron.core.transformer.attention.forward.qkv
2.751488
megatron.core.transformer.attention.forward.adjust_key_value
0.003072
megatron.core.transformer.attention.forward.rotary_pos_emb
0.003104
megatron.core.transformer.attention.forward.core_attention
7.142624
End of preview.

No dataset card yet

Downloads last month
2