Dataset Preview
0 string | 1 float64 |
|---|---|
megatron.core.transformer.attention.forward.qkv | 235.090561 |
megatron.core.transformer.attention.forward.adjust_key_value | 0.003104 |
megatron.core.transformer.attention.forward.rotary_pos_emb | 0.003136 |
megatron.core.transformer.attention.forward.core_attention | 836.380249 |
megatron.core.transformer.attention.forward.linear_proj | 1.484064 |
megatron.core.transformer.transformer_layer._forward_attention.self_attention | 1,074.752808 |
megatron.core.transformer.transformer_layer._forward_attention.self_attn_bda | 1,103.562866 |
megatron.core.transformer.mlp.forward.linear_fc1 | 8.837568 |
megatron.core.transformer.mlp.forward.activation | 469.043427 |
megatron.core.transformer.mlp.forward.linear_fc2 | 6.065408 |
megatron.core.transformer.transformer_layer._forward_mlp.mlp | 485.610168 |
megatron.core.transformer.transformer_layer._forward_mlp.mlp_bda | 0.452608 |
megatron.core.transformer.attention.forward.qkv | 2.578528 |
megatron.core.transformer.attention.forward.adjust_key_value | 0.0032 |
megatron.core.transformer.attention.forward.rotary_pos_emb | 0.003104 |
megatron.core.transformer.attention.forward.core_attention | 6.068768 |
megatron.core.transformer.attention.forward.linear_proj | 1.432608 |
megatron.core.transformer.transformer_layer._forward_attention.self_attention | 10.107264 |
megatron.core.transformer.transformer_layer._forward_attention.self_attn_bda | 0.452672 |
megatron.core.transformer.mlp.forward.linear_fc1 | 5.725888 |
megatron.core.transformer.mlp.forward.activation | 0.6752 |
megatron.core.transformer.mlp.forward.linear_fc2 | 5.672992 |
megatron.core.transformer.transformer_layer._forward_mlp.mlp | 12.08784 |
megatron.core.transformer.transformer_layer._forward_mlp.mlp_bda | 0.460928 |
megatron.core.transformer.attention.forward.qkv | 2.585184 |
megatron.core.transformer.attention.forward.adjust_key_value | 0.003104 |
megatron.core.transformer.attention.forward.rotary_pos_emb | 0.003104 |
megatron.core.transformer.attention.forward.core_attention | 6.164992 |
megatron.core.transformer.attention.forward.linear_proj | 1.437152 |
megatron.core.transformer.transformer_layer._forward_attention.self_attention | 10.214112 |
megatron.core.transformer.transformer_layer._forward_attention.self_attn_bda | 0.460704 |
megatron.core.transformer.mlp.forward.linear_fc1 | 5.723744 |
megatron.core.transformer.mlp.forward.activation | 0.675328 |
megatron.core.transformer.mlp.forward.linear_fc2 | 5.692224 |
megatron.core.transformer.transformer_layer._forward_mlp.mlp | 12.105696 |
megatron.core.transformer.transformer_layer._forward_mlp.mlp_bda | 0.460992 |
megatron.core.transformer.attention.forward.qkv | 2.593152 |
megatron.core.transformer.attention.forward.adjust_key_value | 0.003136 |
megatron.core.transformer.attention.forward.rotary_pos_emb | 0.003136 |
megatron.core.transformer.attention.forward.core_attention | 6.173216 |
megatron.core.transformer.attention.forward.linear_proj | 1.43792 |
megatron.core.transformer.transformer_layer._forward_attention.self_attention | 10.231136 |
megatron.core.transformer.transformer_layer._forward_attention.self_attn_bda | 0.46224 |
megatron.core.transformer.mlp.forward.linear_fc1 | 5.733536 |
megatron.core.transformer.mlp.forward.activation | 0.673728 |
megatron.core.transformer.mlp.forward.linear_fc2 | 5.688384 |
megatron.core.transformer.transformer_layer._forward_mlp.mlp | 12.109696 |
megatron.core.transformer.transformer_layer._forward_mlp.mlp_bda | 0.473184 |
megatron.core.transformer.attention.forward.qkv | 2.618784 |
megatron.core.transformer.attention.forward.adjust_key_value | 0.003136 |
megatron.core.transformer.attention.forward.rotary_pos_emb | 0.003104 |
megatron.core.transformer.attention.forward.core_attention | 6.668256 |
megatron.core.transformer.attention.forward.linear_proj | 1.566112 |
megatron.core.transformer.transformer_layer._forward_attention.self_attention | 10.8792 |
megatron.core.transformer.transformer_layer._forward_attention.self_attn_bda | 0.556992 |
megatron.core.transformer.mlp.forward.linear_fc1 | 6.146048 |
megatron.core.transformer.mlp.forward.activation | 0.800864 |
megatron.core.transformer.mlp.forward.linear_fc2 | 6.009824 |
megatron.core.transformer.transformer_layer._forward_mlp.mlp | 12.969952 |
megatron.core.transformer.transformer_layer._forward_mlp.mlp_bda | 0.556416 |
megatron.core.transformer.attention.forward.qkv | 2.75232 |
megatron.core.transformer.attention.forward.adjust_key_value | 0.003136 |
megatron.core.transformer.attention.forward.rotary_pos_emb | 0.003104 |
megatron.core.transformer.attention.forward.core_attention | 7.050624 |
megatron.core.transformer.attention.forward.linear_proj | 1.555424 |
megatron.core.transformer.transformer_layer._forward_attention.self_attention | 11.384224 |
megatron.core.transformer.transformer_layer._forward_attention.self_attn_bda | 0.552256 |
megatron.core.transformer.mlp.forward.linear_fc1 | 6.082848 |
megatron.core.transformer.mlp.forward.activation | 0.794048 |
megatron.core.transformer.mlp.forward.linear_fc2 | 5.820096 |
megatron.core.transformer.transformer_layer._forward_mlp.mlp | 12.709856 |
megatron.core.transformer.transformer_layer._forward_mlp.mlp_bda | 0.53696 |
megatron.core.transformer.attention.forward.qkv | 2.678496 |
megatron.core.transformer.attention.forward.adjust_key_value | 0.003136 |
megatron.core.transformer.attention.forward.rotary_pos_emb | 0.003104 |
megatron.core.transformer.attention.forward.core_attention | 6.881536 |
megatron.core.transformer.attention.forward.linear_proj | 1.504448 |
megatron.core.transformer.transformer_layer._forward_attention.self_attention | 11.089344 |
megatron.core.transformer.transformer_layer._forward_attention.self_attn_bda | 0.536864 |
megatron.core.transformer.mlp.forward.linear_fc1 | 5.90688 |
megatron.core.transformer.mlp.forward.activation | 0.77472 |
megatron.core.transformer.mlp.forward.linear_fc2 | 5.771968 |
megatron.core.transformer.transformer_layer._forward_mlp.mlp | 12.466016 |
megatron.core.transformer.transformer_layer._forward_mlp.mlp_bda | 0.536768 |
megatron.core.transformer.attention.forward.qkv | 2.654976 |
megatron.core.transformer.attention.forward.adjust_key_value | 0.003168 |
megatron.core.transformer.attention.forward.rotary_pos_emb | 0.003168 |
megatron.core.transformer.attention.forward.core_attention | 6.728128 |
megatron.core.transformer.attention.forward.linear_proj | 1.505856 |
megatron.core.transformer.transformer_layer._forward_attention.self_attention | 10.91408 |
megatron.core.transformer.transformer_layer._forward_attention.self_attn_bda | 0.55184 |
megatron.core.transformer.mlp.forward.linear_fc1 | 6.082816 |
megatron.core.transformer.mlp.forward.activation | 0.79424 |
megatron.core.transformer.mlp.forward.linear_fc2 | 5.948224 |
megatron.core.transformer.transformer_layer._forward_mlp.mlp | 12.837376 |
megatron.core.transformer.transformer_layer._forward_mlp.mlp_bda | 0.551712 |
megatron.core.transformer.attention.forward.qkv | 2.751488 |
megatron.core.transformer.attention.forward.adjust_key_value | 0.003072 |
megatron.core.transformer.attention.forward.rotary_pos_emb | 0.003104 |
megatron.core.transformer.attention.forward.core_attention | 7.142624 |
End of preview.
No dataset card yet
- Downloads last month
- 2
