Dataset Preview
0 string | 1 float64 |
|---|---|
megatron.core.transformer.attention.forward.qkv | 197.073761 |
megatron.core.transformer.attention.forward.adjust_key_value | 0.1096 |
megatron.core.transformer.attention.forward.rotary_pos_emb | 0.087072 |
megatron.core.transformer.attention.forward.core_attention | 860.149475 |
megatron.core.transformer.attention.forward.linear_proj | 1.501952 |
megatron.core.transformer.transformer_layer._forward_attention.self_attention | 1,060.835693 |
megatron.core.transformer.transformer_layer._forward_attention.self_attn_bda | 1,103.405151 |
megatron.core.transformer.mlp.forward.linear_fc1 | 15.728672 |
megatron.core.transformer.mlp.forward.activation | 474.251373 |
megatron.core.transformer.mlp.forward.linear_fc2 | 10.712512 |
megatron.core.transformer.transformer_layer._forward_mlp.mlp | 502.245209 |
megatron.core.transformer.transformer_layer._forward_mlp.mlp_bda | 0.45184 |
megatron.core.transformer.attention.forward.qkv | 7.838624 |
megatron.core.transformer.attention.forward.adjust_key_value | 0.002976 |
megatron.core.transformer.attention.forward.rotary_pos_emb | 0.07984 |
megatron.core.transformer.attention.forward.core_attention | 16.827616 |
megatron.core.transformer.attention.forward.linear_proj | 1.4552 |
megatron.core.transformer.transformer_layer._forward_attention.self_attention | 26.358528 |
megatron.core.transformer.transformer_layer._forward_attention.self_attn_bda | 0.452288 |
megatron.core.transformer.mlp.forward.linear_fc1 | 5.807456 |
megatron.core.transformer.mlp.forward.activation | 0.662016 |
megatron.core.transformer.mlp.forward.linear_fc2 | 5.723648 |
megatron.core.transformer.transformer_layer._forward_mlp.mlp | 12.206848 |
megatron.core.transformer.transformer_layer._forward_mlp.mlp_bda | 0.451584 |
megatron.core.transformer.attention.forward.qkv | 2.606688 |
megatron.core.transformer.attention.forward.adjust_key_value | 0.002912 |
megatron.core.transformer.attention.forward.rotary_pos_emb | 0.002976 |
megatron.core.transformer.attention.forward.core_attention | 6.24656 |
megatron.core.transformer.attention.forward.linear_proj | 1.493664 |
megatron.core.transformer.transformer_layer._forward_attention.self_attention | 10.373056 |
megatron.core.transformer.transformer_layer._forward_attention.self_attn_bda | 0.522976 |
megatron.core.transformer.mlp.forward.linear_fc1 | 5.837856 |
megatron.core.transformer.mlp.forward.activation | 0.751808 |
megatron.core.transformer.mlp.forward.linear_fc2 | 5.739264 |
megatron.core.transformer.transformer_layer._forward_mlp.mlp | 12.342464 |
megatron.core.transformer.transformer_layer._forward_mlp.mlp_bda | 0.519968 |
megatron.core.transformer.attention.forward.qkv | 2.660864 |
megatron.core.transformer.attention.forward.adjust_key_value | 0.002944 |
megatron.core.transformer.attention.forward.rotary_pos_emb | 0.002976 |
megatron.core.transformer.attention.forward.core_attention | 6.968032 |
megatron.core.transformer.attention.forward.linear_proj | 1.538496 |
megatron.core.transformer.transformer_layer._forward_attention.self_attention | 11.192064 |
megatron.core.transformer.transformer_layer._forward_attention.self_attn_bda | 0.543008 |
megatron.core.transformer.mlp.forward.linear_fc1 | 6.032384 |
megatron.core.transformer.mlp.forward.activation | 0.78288 |
megatron.core.transformer.mlp.forward.linear_fc2 | 5.911232 |
megatron.core.transformer.transformer_layer._forward_mlp.mlp | 12.74048 |
megatron.core.transformer.transformer_layer._forward_mlp.mlp_bda | 0.557792 |
megatron.core.transformer.attention.forward.qkv | 2.783616 |
megatron.core.transformer.attention.forward.adjust_key_value | 0.002944 |
megatron.core.transformer.attention.forward.rotary_pos_emb | 0.002976 |
megatron.core.transformer.attention.forward.core_attention | 7.114752 |
megatron.core.transformer.attention.forward.linear_proj | 1.567936 |
megatron.core.transformer.transformer_layer._forward_attention.self_attention | 11.491584 |
megatron.core.transformer.transformer_layer._forward_attention.self_attn_bda | 0.557856 |
megatron.core.transformer.mlp.forward.linear_fc1 | 6.152032 |
megatron.core.transformer.mlp.forward.activation | 0.804224 |
megatron.core.transformer.mlp.forward.linear_fc2 | 6.015296 |
megatron.core.transformer.transformer_layer._forward_mlp.mlp | 12.984736 |
megatron.core.transformer.transformer_layer._forward_mlp.mlp_bda | 0.55696 |
megatron.core.transformer.attention.forward.qkv | 2.795712 |
megatron.core.transformer.attention.forward.adjust_key_value | 0.003008 |
megatron.core.transformer.attention.forward.rotary_pos_emb | 0.002976 |
megatron.core.transformer.attention.forward.core_attention | 7.113344 |
megatron.core.transformer.attention.forward.linear_proj | 1.570464 |
megatron.core.transformer.transformer_layer._forward_attention.self_attention | 11.504032 |
megatron.core.transformer.transformer_layer._forward_attention.self_attn_bda | 0.556896 |
megatron.core.transformer.mlp.forward.linear_fc1 | 6.144864 |
megatron.core.transformer.mlp.forward.activation | 0.801184 |
megatron.core.transformer.mlp.forward.linear_fc2 | 6.009728 |
megatron.core.transformer.transformer_layer._forward_mlp.mlp | 12.967648 |
megatron.core.transformer.transformer_layer._forward_mlp.mlp_bda | 0.557248 |
megatron.core.transformer.attention.forward.qkv | 2.777248 |
megatron.core.transformer.attention.forward.adjust_key_value | 0.002944 |
megatron.core.transformer.attention.forward.rotary_pos_emb | 0.002976 |
megatron.core.transformer.attention.forward.core_attention | 7.061664 |
megatron.core.transformer.attention.forward.linear_proj | 1.539584 |
megatron.core.transformer.transformer_layer._forward_attention.self_attention | 11.4024 |
megatron.core.transformer.transformer_layer._forward_attention.self_attn_bda | 0.54208 |
megatron.core.transformer.mlp.forward.linear_fc1 | 6.023904 |
megatron.core.transformer.mlp.forward.activation | 0.781344 |
megatron.core.transformer.mlp.forward.linear_fc2 | 5.890464 |
megatron.core.transformer.transformer_layer._forward_mlp.mlp | 12.707456 |
megatron.core.transformer.transformer_layer._forward_mlp.mlp_bda | 0.541952 |
megatron.core.transformer.attention.forward.qkv | 2.729856 |
megatron.core.transformer.attention.forward.adjust_key_value | 0.002976 |
megatron.core.transformer.attention.forward.rotary_pos_emb | 0.002976 |
megatron.core.transformer.attention.forward.core_attention | 7.215072 |
megatron.core.transformer.attention.forward.linear_proj | 1.605056 |
megatron.core.transformer.transformer_layer._forward_attention.self_attention | 11.573824 |
megatron.core.transformer.transformer_layer._forward_attention.self_attn_bda | 0.567872 |
megatron.core.transformer.mlp.forward.linear_fc1 | 6.269952 |
megatron.core.transformer.mlp.forward.activation | 0.815392 |
megatron.core.transformer.mlp.forward.linear_fc2 | 6.065568 |
megatron.core.transformer.transformer_layer._forward_mlp.mlp | 13.162688 |
megatron.core.transformer.transformer_layer._forward_mlp.mlp_bda | 0.556928 |
megatron.core.transformer.attention.forward.qkv | 2.782912 |
megatron.core.transformer.attention.forward.adjust_key_value | 0.00272 |
megatron.core.transformer.attention.forward.rotary_pos_emb | 0.002976 |
megatron.core.transformer.attention.forward.core_attention | 7.097888 |
End of preview.
No dataset card yet
- Downloads last month
- 5
