pass params_dtype to qk_norm creation by pstjohn · Pull Request #2718 · NVIDIA/TransformerEngine

pstjohn · 2026-02-28T00:54:36Z

Previously layers would fail with

            assert (
>               query_layer.dtype == key_layer.dtype and query_layer.dtype == value_layer.dtype
            ), "Queries, keys and values must have the same data type!"
E           AssertionError: Queries, keys and values must have the same data type!

transformer_engine/pytorch/attention/dot_product_attention/dot_product_attention.py:1063: AssertionError

if you created a layer with dtype != float32. This ensures the dtype of the layernorm layers match those of the base attention layer.

Signed-off-by: Peter St. John <pstjohn@nvidia.com>

greptile-apps · 2026-02-28T00:58:10Z

Greptile Summary

This PR fixes a dtype mismatch bug in MultiheadAttention when using QK normalization with non-float32 dtypes. The fix ensures RMSNorm and LayerNorm normalization modules receive the params_dtype parameter, preventing assertion failures when queries, keys, and values have different dtypes.

Key changes:

Modified _create_qk_norm_modules() to accept and pass params_dtype to RMSNorm/LayerNorm constructors
L2Normalization correctly excluded (parameter-free operation)
Added test parametrization for torch.float32 and torch.bfloat16 to verify the fix

Confidence Score: 5/5

This PR is safe to merge with minimal risk - it's a targeted bug fix with comprehensive test coverage
The fix is straightforward and correct: it passes the existing params_dtype parameter through to normalization layers to ensure dtype consistency. The change is well-tested with parametrized tests covering both float32 and bfloat16. L2Normalization is correctly excluded as it's parameter-free. No breaking changes or edge cases identified.
No files require special attention

Important Files Changed

Filename	Overview
transformer_engine/pytorch/attention/multi_head_attention.py	Correctly passes `params_dtype` parameter to QK normalization layers (RMSNorm/LayerNorm) to ensure dtype consistency with the main attention layer
tests/pytorch/test_qk_norm.py	Added comprehensive test coverage for different `params_dtype` values (float32, bfloat16) to verify the dtype fix works correctly

_{Last reviewed commit: 4db2067}

greptile-apps

_{2 files reviewed, no comments}

_{Edit Code Review Agent Settings | Greptile}

ksivaman

LGTM

yaox12 · 2026-03-02T01:43:12Z

/te-ci pytorch

yaox12 · 2026-03-02T05:33:48Z

/te-ci pytorch

pass params_dtype to qk_norm creation

8061e42

Signed-off-by: Peter St. John <pstjohn@nvidia.com>

greptile-apps bot reviewed Feb 28, 2026

View reviewed changes

ksivaman approved these changes Feb 28, 2026

View reviewed changes

pstjohn mentioned this pull request Feb 28, 2026

Add qwen3 model NVIDIA/bionemo-framework#1485

Open

Merge branch 'main' into pstjohn/qk-norm-dtype

37eaccf

Merge branch 'main' into pstjohn/qk-norm-dtype

4db2067

negvet approved these changes Mar 2, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pass params_dtype to qk_norm creation#2718

pass params_dtype to qk_norm creation#2718
pstjohn wants to merge 3 commits intoNVIDIA:mainfrom
pstjohn:pstjohn/qk-norm-dtype

pstjohn commented Feb 28, 2026

Uh oh!

greptile-apps bot commented Feb 28, 2026 •

edited

Loading

Uh oh!

greptile-apps bot left a comment

Uh oh!

ksivaman left a comment

Uh oh!

yaox12 commented Mar 2, 2026

Uh oh!

yaox12 commented Mar 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

pstjohn commented Feb 28, 2026

Uh oh!

greptile-apps bot commented Feb 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 5/5

Important Files Changed

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

ksivaman left a comment

Choose a reason for hiding this comment

Uh oh!

yaox12 commented Mar 2, 2026

Uh oh!

yaox12 commented Mar 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

greptile-apps bot commented Feb 28, 2026 •

edited

Loading