RFC 239: Policy on LLM assistance in contributions by jugglinmike · Pull Request #239 · web-platform-tests/rfcs

jugglinmike · 2026-05-20T19:54:59Z

This initial draft takes a maximalist approach (and a permissive stance) to promote a robust and grounded discussion.

Rendered

gsnedders · 2026-05-21T21:00:09Z

Fixes #202

gsnedders

Thanks for trying to tackle this!

I think we should also write something about use of LLMs for review, if nothing else.

gsnedders · 2026-05-21T21:16:46Z

+A few examples of policies on LLM use in FOSS contributions:
+
+- permissive
+  - [ghostty/AI_POLICY.md at main · ghostty-org/ghostty](https://github.com/ghostty-org/ghostty/blob/main/AI_POLICY.md)
+  - [Policy about LLM generated code from PRs · Issue #28335 · opencv/opencv](https://github.com/opencv/opencv/issues/28335)
+  - [CONTRIBUTING.md: Guidelines relevant to AI-assisted contributions by gasche · Pull Request #14052 · ocaml/ocaml](https://github.com/ocaml/ocaml/pull/14052)
+  - [LLVM AI Tool Use Policy — LLVM 23.0.0git documentation](https://llvm.org/docs/AIToolPolicy.html)
+- prohibitive
+  - [Code of Conduct ⚡ Zig Programming Language](https://ziglang.org/code-of-conduct/#strict-no-llm-no-ai-policy)
+  - [Getting Started - The Servo Book](https://book.servo.org/contributing/getting-started.html#ai-contributions)


I feel like there's three others which are notably relevant here: Chromium's, and Firefox's, given they are two of the five repos which have approval to land changes in WPT without further review. (WebKit and Test262 do not currently have policies — TC39's explicitly does not apply to code.)

gsnedders · 2026-05-21T21:25:00Z

+> Commits generated entirely by an LLM must be attributed to the LLM in the
+> "Author" field.


This feels problematic. If we attribute a PR to Claude, Gemini, or OpenAI's GPT, if I try and contact the author… well, I don't think Anthropic, Google, or OpenAI are going to be very helpful?

Both Chromium and Firefox's policies are crystal clear that humans are still the authors and must self-review before submitting.

Therefore, when there's still a human very much in the loop who is required to self-review, it does not seem reasonable to consider the LLM the author — and the Chromium policy is explicit that, "Authors must attest that the code they submit is their original creation, regardless of whether AI tooling was used".

Strong agree on this point for all the reasons you give. The Author field's purpose is to give a contact for problems/question, not assign blame. Listing an LLM is worthless there.

And also, yeah, assigning authorship to an LLM is abrogating your responsibility as an engineer to commit useful code that you understand.

gsnedders · 2026-05-21T21:29:02Z

+> Contributions that contain substantial amounts of tool-generated content must
+> be labeled as such.


Neither Chromium nor Firefox require this today, and it's entirely plausible we've already had commits land into WPT via exports which don't meet this bar.

That said, Chromium's policy here is currently:

To aid reviewers, authors should flag areas that they are not confident about that had AI assistance.

This is maybe a weaker form, and hopefully something more in line with existing contributions.

gsnedders · 2026-05-21T21:35:12Z

+> ### For Trusted External Review
+>
+> Some external projects conduct review which the WPT maintainers recognize as
+> authoritative. From rendering engines like Gecko to dedicated test suites
+> like WASM, patches merged in these projects are incorporated into WPT without
+> further review. The policy outlined by this document does not apply to these
+> contributions; the external projects are trusted to determine their own
+> mechanisms for quality assurance.


This feels like it should probably be at least in part in another RFC that tries to define our existing policies?

As far as I'm aware, there's currently five repos which have approval to incorporate based on downstream review — Chromium, Firefox, Servo, Test262, and WebKit.

My understanding of the unwritten policy is we trust downstream reviewers; I can't even find the various places where we've elucidated parts of the policy over the years.

gsnedders · 2026-05-21T22:07:39Z

+
+## Details
+
+Proposed text:


Proposed for where?

gsnedders · 2026-05-21T23:06:43Z

+> contributions; the external projects are trusted to determine their own
+> mechanisms for quality assurance.
+
+## Risks


I think it's worthwhile including at least a few more technical risks:

Contributions of tests generated by an LLM closely looking at a specific implementation's code, matching that implementation, rather than the spec. (This is, of course, already an issue — but could inevitably become more of a problem if we get more, larger contributions.)

Contributions not matching the spec at all. I've seen this mostly with trying to generate tests to assert ordering of things which end of using HTML's parallelism and HTML's event loops; that case is especially annoying because it can lead to flaky tests.

RFC 239: Policy on LLM assistance in contributions

ba9ec3e

gsnedders requested changes May 21, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RFC 239: Policy on LLM assistance in contributions#239

RFC 239: Policy on LLM assistance in contributions#239
jugglinmike wants to merge 1 commit into
web-platform-tests:mainfrom
bocoup:llm-policy

jugglinmike commented May 20, 2026 •

edited

Loading

Uh oh!

gsnedders commented May 21, 2026

Uh oh!

gsnedders left a comment

Uh oh!

gsnedders May 21, 2026

Uh oh!

gsnedders May 21, 2026

Uh oh!

tabatkins May 21, 2026

Uh oh!

gsnedders May 21, 2026

Uh oh!

gsnedders May 21, 2026

Uh oh!

gsnedders May 21, 2026

Uh oh!

gsnedders May 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		> Commits generated entirely by an LLM must be attributed to the LLM in the
		> "Author" field.

		> Contributions that contain substantial amounts of tool-generated content must
		> be labeled as such.

Conversation

jugglinmike commented May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gsnedders commented May 21, 2026

Uh oh!

gsnedders left a comment

Choose a reason for hiding this comment

Uh oh!

gsnedders May 21, 2026

Choose a reason for hiding this comment

Uh oh!

gsnedders May 21, 2026

Choose a reason for hiding this comment

Uh oh!

tabatkins May 21, 2026

Choose a reason for hiding this comment

Uh oh!

gsnedders May 21, 2026

Choose a reason for hiding this comment

Uh oh!

gsnedders May 21, 2026

Choose a reason for hiding this comment

Uh oh!

gsnedders May 21, 2026

Choose a reason for hiding this comment

Uh oh!

gsnedders May 21, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jugglinmike commented May 20, 2026 •

edited

Loading