[experimental][ET-VK][qconv] Use specialization constants for conv2d params in q8ta shaders#18163
[experimental][ET-VK][qconv] Use specialization constants for conv2d params in q8ta shaders#18163SS-JIA wants to merge 1 commit intogh/SS-JIA/483/basefrom
Conversation
…params in q8ta shaders Replace Conv2DParams UBO with specialization constants in q8ta_conv2d, q8ta_conv2d_dw, and q8ta_im2col shaders. This allows the GPU compiler to treat conv params (kernel size, stride, padding, dilation, groups) as compile-time constants, enabling loop unrolling, dead code elimination, and constant folding. Differential Revision: [D96503584](https://our.internmc.facebook.com/intern/diff/D96503584/) [ghstack-poisoned]
…params in q8ta shaders Replace Conv2DParams UBO with specialization constants in q8ta_conv2d, q8ta_conv2d_dw, and q8ta_im2col shaders. This allows the GPU compiler to treat conv params (kernel size, stride, padding, dilation, groups) as compile-time constants, enabling loop unrolling, dead code elimination, and constant folding. Differential Revision: [D96503584](https://our.internmc.facebook.com/intern/diff/D96503584/) ghstack-source-id: 351872206 Pull Request resolved: #18163
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18163
Note: Links to docs will display an error until the docs builds have been completed. ❌ 8 New Failures, 8 Unrelated FailuresAs of commit 92d132d with merge base cc27e6b ( NEW FAILURES - The following jobs have failed:
FLAKY - The following job failed but was likely due to flakiness present on trunk:
BROKEN TRUNK - The following jobs failed but was present on the merge base:👉 Rebase onto the `viable/strict` branch to avoid these failures
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
This PR needs a
|
Stack from ghstack (oldest at bottom):
Replace Conv2DParams UBO with specialization constants in q8ta_conv2d,
q8ta_conv2d_dw, and q8ta_im2col shaders. This allows the GPU compiler to
treat conv params (kernel size, stride, padding, dilation, groups) as
compile-time constants, enabling loop unrolling, dead code elimination,
and constant folding.
Differential Revision: D96503584