Using Pipeline Driver in workload by fschlimb · Pull Request #84 · llvm/lighthouse

fschlimb · 2026-03-19T15:23:18Z

This PR makes workloads (examples based on workload class) use the smae pipepline definition and Driver infrastructure as lh-opt

move type dispatch from TransformStage to Transform
Accept ready modules as payload and TransformStages
rename schedule_modules -> pipeline

rengolin · 2026-03-19T16:27:45Z

lighthouse/pipeline/opt.py

-
-    def __init__(self, filename: str):
+    def __init__(self, source: str | ir.Module):
+        self._module = None


Do we really need _ for all field names?

We do not need to, but that's python's way to mark them private.

rengolin · 2026-03-19T16:31:46Z

lighthouse/workload/workload.py

 from contextlib import contextmanager
 from typing import Optional

+from lighthouse.pipeline.opt import Driver, Stage, Transform, TransformStage


I think it should be the other way around: Driver imports the rest of the infra.

From an earlier conversation with @tkarna, the Workload object is not a driver nor it is a pipeline.

We need to reach those agreements before starting the cross dependencies between our modules.

We can split the Driver, but the workload and Driver will use something which handles the pipeline. I didn't want to create yet another abstraction for no good reason, but if you insist.
I don't see the point of running examples through the Driver. Unless we get rid of workload.

IN my mind Workload is a different insfrastructure than pipeline etc. It is a user of most of the other infra.

One possible way to separate the Workload and Driver is to move the workload.lower_payload method to runner as a helper function.

We should separate the runner anyway at some point

But there's also the discussion is the workload is part of the pipeline, or if it contains the pipeline, or if they're orthogonal to each other...

In my mind, the workload is a helper/infrastructure class, which helps writing end-to-end examples, including benchmarking and correctness checking. Hence it uses a pipeline etc. and hides away boilerplate code. It is orthogonal to things like lh-opt or lh-run. A different design is of course possible. If we want to change the current approach, this should go into separate PR.

lighthouse/pipeline/opt.py

tkarna · 2026-03-20T06:13:37Z

lighthouse/pipeline/opt.py

+      * source: either a filename (str) for a file that will be imported into
+                a schedule (mlir or python), or a ready ir.Module.


I'm not particularly fond of this filename or ir.Module dualism.

The way that I see this is Transform is a utility tool to help importing transform schedules from a python or mlir file. With your changes, Transform is needed in TransformStage only to check whether the schedule module contains the expected named sequence and if not, it raises a descriptive error message. So the __str__ method is the only added value of the Transform object.

Suggested changes:

Rename Transform to something like TransformFileImporter. Move named sequence check there. This could actually also be a helper function, we don't really need the object.

TransformStage takes a schedule module. (Optionally it can also accept a TransformFileImporter). It could still do another named sequence check if needed.

Yes, I had similar thoughts. I am not sure, it that was a motivation in the orig design, but one can create a Transform when there is no context available yet. If we want to keep that possibility (of course only for files/strings) we need a way to defer the Stage assembly. In your approach this would mean we'll need some dualism when creating the TransformStage. So it would only move it from Transform to TransformStage.

I am not sure, it that was a motivation in the orig design, but one can create a TransformPass when there is no context available yet

I guess you meant Transform here. Hmm, TransformStage does need a context though. So we can only have the how-to-read-from-file metadata without a context.

In your approach this would mean we'll need some dualism when creating the TransformStage.

Yes, if we allow TransformStage to take either TransformFileImporter object or ir.module. We could just accept the latter. The functional implementation of create-transform-stage-from-file would then be something like
TransformStage(import_schedule_module(filename))

Or we could have TransformStage with the api: TransformStage(schedule_module) and TransformStage.from_file(filename). The latter could set _source_file etc metadata which could be used in error messages if needed. In this case the Transform class would not be needed at all.

Would be fine with me.
Still, I don't think it makes the whole thing better. The dualism is basically simply polymorphism, which I don't see as problematic in any way but actually like a lot. With your latest suggestion the user/creator of TransformStage is required to do the dispatch and not the Transform "wrapper". I could argue that the Transform is the right place to do this, not the stage. But as I said, both are fine with me.
What do others think? @rengolin

fschlimb marked this pull request as draft March 19, 2026 15:23

fschlimb changed the title ~~[draft] allowing ready (Transform)Stages and not only strings~~ [draft] Using Pipeline Driver in workload Mar 19, 2026

fschlimb force-pushed the pipeline_in_workload branch from b596b09 to a53744f Compare March 19, 2026 16:20

fschlimb changed the title ~~[draft] Using Pipeline Driver in workload~~ Using Pipeline Driver in workload Mar 19, 2026

fschlimb marked this pull request as ready for review March 19, 2026 16:22

fschlimb requested review from adam-smnk, Copilot, rengolin, rolfmorel and tkarna and removed request for Copilot March 19, 2026 16:22

rengolin reviewed Mar 19, 2026

View reviewed changes

tkarna reviewed Mar 20, 2026

View reviewed changes

lighthouse/pipeline/opt.py Outdated Show resolved Hide resolved

tkarna reviewed Mar 20, 2026

View reviewed changes

fschlimb added 6 commits March 20, 2026 02:36

allowing ready (Transform)Stages and not only strings

f41bfc2

allow payload to be string/file or ready module

ca4b03f

using Driver in workload

3472032

using passes in feed-forward.mpi

9155ef9

->

681b377

simplified ff and workload examples

cbed907

fschlimb force-pushed the pipeline_in_workload branch from 1921381 to cbed907 Compare March 20, 2026 09:41

simplifying matmul pipeline

80fc4a5

		* source: either a filename (str) for a file that will be imported into
		a schedule (mlir or python), or a ready ir.Module.

Conversation

fschlimb commented Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

fschlimb Mar 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fschlimb commented Mar 19, 2026 •

edited

Loading

fschlimb Mar 20, 2026 •

edited

Loading