Skip to content

Conversation

@Anri-Lombard
Copy link

Summary

  • Document that MLX's mask="causal" uses lower-right alignment
  • Clarify the difference from PyTorch's default is_causal=True (upper-left)

When T_q != T_kv, this distinction matters:

  • MLX (lower-right): Last query aligns with last key
  • PyTorch default (upper-left): First query aligns with first key

References:

Relates to #2835

Clarify that MLX uses lower-right alignment for causal masks when
T_q != T_kv, which differs from PyTorch's default upper-left alignment.

Relates to ml-explore#2835
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant