[ET-VK][qlinear] Add bias support to q4gsw and dq8ca_q4gsw quantized linear ops#18061
[ET-VK][qlinear] Add bias support to q4gsw and dq8ca_q4gsw quantized linear ops#18061SS-JIA wants to merge 4 commits intogh/SS-JIA/478/basefrom
Conversation
…linear ops Wire bias through the q4gsw and dq8ca_q4gsw quantized linear operators. Add add_bias_to_out_tile() helper in the output tile computation header and call it from all three shader variants (tiled, coop, dq8ca_tiled). Remove the bias guard in the pattern matcher to allow biased linear layers. Differential Revision: [D95970172](https://our.internmc.facebook.com/intern/diff/D95970172/) [ghstack-poisoned]
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18061
Note: Links to docs will display an error until the docs builds have been completed. ❌ 4 New Failures, 1 Unrelated FailureAs of commit eb4c833 with merge base 8a285b7 ( NEW FAILURES - The following jobs have failed:
BROKEN TRUNK - The following job failed but were present on the merge base:👉 Rebase onto the `viable/strict` branch to avoid these failures
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
This PR needs a
|
… quantized linear ops" Wire bias through the q4gsw and dq8ca_q4gsw quantized linear operators. Add add_bias_to_out_tile() helper in the output tile computation header and call it from all three shader variants (tiled, coop, dq8ca_tiled). Remove the bias guard in the pattern matcher to allow biased linear layers. Differential Revision: [D95970172](https://our.internmc.facebook.com/intern/diff/D95970172/) [ghstack-poisoned]
… quantized linear ops" Wire bias through the q4gsw and dq8ca_q4gsw quantized linear operators. Add add_bias_to_out_tile() helper in the output tile computation header and call it from all three shader variants (tiled, coop, dq8ca_tiled). Remove the bias guard in the pattern matcher to allow biased linear layers. Differential Revision: [D95970172](https://our.internmc.facebook.com/intern/diff/D95970172/) [ghstack-poisoned]
… quantized linear ops" Wire bias through the q4gsw and dq8ca_q4gsw quantized linear operators. Add add_bias_to_out_tile() helper in the output tile computation header and call it from all three shader variants (tiled, coop, dq8ca_tiled). Remove the bias guard in the pattern matcher to allow biased linear layers. Differential Revision: [D95970172](https://our.internmc.facebook.com/intern/diff/D95970172/) [ghstack-poisoned]
Stack from ghstack (oldest at bottom):
Wire bias through the q4gsw and dq8ca_q4gsw quantized linear operators.
Add add_bias_to_out_tile() helper in the output tile computation header and call
it from all three shader variants (tiled, coop, dq8ca_tiled). Remove the bias
guard in the pattern matcher to allow biased linear layers.
Differential Revision: D95970172