Skip to content

[SYCL] Support Q4_1, Q5_0, Q5_1 in Flash-attention#23812

Open
arthw wants to merge 2 commits into
ggml-org:masterfrom
arthw:enhance_flash-attention
Open

[SYCL] Support Q4_1, Q5_0, Q5_1 in Flash-attention#23812
arthw wants to merge 2 commits into
ggml-org:masterfrom
arthw:enhance_flash-attention

Conversation

@arthw
Copy link
Copy Markdown
Contributor

@arthw arthw commented May 28, 2026

Support Q4_1, Q5_0, Q5_1 in Flash-attention
UT cases are passed locally.

@arthw arthw requested a review from a team as a code owner May 28, 2026 11:11
@github-actions github-actions Bot added documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language labels May 28, 2026
@arthw arthw added the merge ready A maintainer can use this label to indicate that they consider the changes final and ready to merge. label May 29, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning merge ready A maintainer can use this label to indicate that they consider the changes final and ready to merge. SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant