Add support for torch.export exported models by tolleybot · Pull Request #1499 · dotnet/TorchSharp

tolleybot · 2025-10-22T14:18:52Z

Add support for torch.export exported models (#1498)

Implements functionality to load and execute PyTorch models exported via torch.export (.pt2 files), enabling .NET applications to run ExportedProgram models as the PyTorch ecosystem transitions from ONNX to torch.export.

Summary

This PR adds support for loading and running AOTInductor-compiled .pt2 models in TorchSharp using torch::inductor::AOTIModelPackageLoader from LibTorch 2.9+.

Key Points:

✅ Inference-only API (no training support)
✅ Models must be compiled with torch._inductor.aoti_compile_and_package() in Python
✅ 30-40% better latency than TorchScript (according to PyTorch docs)
✅ Compatible with LibTorch 2.9+ which includes AOTIModelPackageLoader symbols

Implementation

Native Layer (C++)

Files:

src/Native/LibTorchSharp/Utils.h - Added AOTIModelPackageLoader header include
src/Native/LibTorchSharp/THSExport.h - C++ API declarations
src/Native/LibTorchSharp/THSExport.cpp - Implementation using torch::inductor::AOTIModelPackageLoader

Key Changes:

// Utils.h - Added header include for all files
#include "torch/csrc/inductor/aoti_package/model_package_loader.h"

// THSExport.cpp - Simple wrapper around AOTIModelPackageLoader
ExportedProgramModule THSExport_load(const char* filename)
{
    auto* loader = new torch::inductor::AOTIModelPackageLoader(filename);
    return loader;
}

void THSExport_Module_run(
    const ExportedProgramModule module,
    const Tensor* input_tensors,
    const int input_length,
    Tensor** result_tensors,
    int* result_length)
{
    std::vector<torch::Tensor> inputs;
    // ... convert inputs
    std::vector<torch::Tensor> outputs = module->run(inputs);
    // ... convert outputs
}

Managed Layer (C#)

Files:

src/TorchSharp/PInvoke/LibTorchSharp.THSExport.cs - PInvoke declarations
src/TorchSharp/Export/ExportedProgram.cs - High-level C# API

API Design:

// Basic usage
using var exported = torch.export.load("model.pt2");
var results = exported.run(input);

// Generic typing for single tensor output
using var exported = torch.export.load<Tensor>("model.pt2");
Tensor result = exported.run(input);

// Generic typing for tuple output
using var exported = torch.export.load<(Tensor, Tensor)>("model.pt2");
var (sum, diff) = exported.run(x, y);

Features:

Implements IDisposable for proper resource cleanup
Generic ExportedProgram<TResult> for type-safe returns
Support for single tensors, arrays, and tuples (up to 3 elements)
run(), forward(), and call() methods (all equivalent)

Testing

Files:

test/TorchSharpTest/TestExport.cs - 7 comprehensive unit tests
test/TorchSharpTest/generate_export_models.py - Python script to generate test models
test/TorchSharpTest/*.pt2 - 6 test models

Test Coverage:

[Fact] public void TestLoadExport_SimpleLinear()       // Basic model
[Fact] public void TestLoadExport_LinearReLU()         // Multi-layer
[Fact] public void TestLoadExport_TwoInputs()          // Multiple inputs
[Fact] public void TestLoadExport_TupleOutput()        // Tuple return
[Fact] public void TestLoadExport_ListOutput()         // Array return
[Fact] public void TestLoadExport_Sequential()         // Complex model
[Fact] public void TestExport_LoadNonExistentFile()    // Error handling

All 7 tests pass successfully.

Dependencies

Updated:

build/Dependencies.props - Updated LibTorch from 2.7.1 to 2.9.0

LibTorch 2.9.0 includes the torch::inductor::AOTIModelPackageLoader implementation that was previously only available in PyTorch source code.

Technical Details

Two .pt2 Formats

PyTorch has two different .pt2 export formats:

Python-only (from torch.export.save()):
- Cannot be loaded in C++
- Uses pickle-based serialization
- NOT supported by this implementation
AOTInductor-compiled (from torch._inductor.aoti_compile_and_package()):
- Can be loaded in C++ via AOTIModelPackageLoader
- Ahead-of-time compiled for specific device
- ✅ Supported by this implementation

Python Model Generation

To create compatible .pt2 files:

import torch
import torch._inductor

model = MyModule()
example_inputs = (torch.randn(1, 10),)

# Export the model
exported = torch.export.export(model, example_inputs)

# Compile with AOTInductor for C++ compatibility
torch._inductor.aoti_compile_and_package(
    exported,
    package_path="model.pt2"
)

Limitations

Inference only: No training, no parameter updates, no gradient computation
Device-specific: Models compiled for CPU cannot run on CUDA and vice versa
No device movement: Cannot move model between devices at runtime
LibTorch 2.9+ required: Older versions don't include AOTIModelPackageLoader

Performance

According to PyTorch documentation, AOTInductor provides:

30-40% better latency compared to TorchScript
Optimized for production inference workloads
Single-graph representation with only ATen-level operations

Testing

# Build
dotnet build src/TorchSharp/TorchSharp.csproj

# Run tests
dotnet test test/TorchSharpTest/TorchSharpTest.csproj --filter "FullyQualifiedName~TestExport"

Migration Guide

For users currently using TorchScript:

Before (TorchScript):

# Python
torch.jit.save(traced_model, "model.pt")

// C#
var module = torch.jit.load("model.pt");
var result = module.forward(input);

After (torch.export):

# Python
import torch._inductor
exported = torch.export.export(model, example_inputs)
torch._inductor.aoti_compile_and_package(exported, package_path="model.pt2")

// C#
using var exported = torch.export.load("model.pt2");
var result = exported.run(input);

References

Issue: Add support for torch.export models #1498
PyTorch torch.export docs: https://docs.pytorch.org/docs/stable/export.html
AOTInductor docs: https://pytorch.org/docs/stable/torch.compiler_aot_inductor.html
PyTorch source: torch/csrc/inductor/aoti_package/model_package_loader.h

Fixes #1498

tolleybot · 2025-10-23T14:11:18Z

@dotnet-policy-service agree

src/Native/LibTorchSharp/THSExport.cpp

tolleybot · 2025-10-30T15:19:52Z

Build Failures : Missing LibTorch 2.9.0 Packages

I believe the CI builds are failing because the build system requires .sha files for LibTorch package validation, and these are missing for LibTorch 2.9.0

Missing SHA files:

❌ Linux: libtorch-cxx11-abi-shared-with-deps-2.9.0+cpu.zip.sha
❌ Windows: libtorch-win-shared-with-deps-2.9.0+cpu.zip.sha
✅ macOS arm64: libtorch-macos-arm64-2.9.0.zip.sha (exists)

Package availability check:

Linux cxx11-abi: 403 error (not published yet)
Windows: Available
macOS arm64: Available

Why my local tests passed: I was building against the PyTorch Python installation at
/opt/homebrew/lib/python3.11/site-packages/torch/ which includes LibTorch 2.9.0 with AOTIModelPackageLoader support

Should we wait for PyTorch to publish all LibTorch 2.9.0 packages?

masaru-kimura-hacarus · 2025-10-31T01:34:44Z

@tolleybot

Missing SHA files:

❌ Linux: libtorch-cxx11-abi-shared-with-deps-2.9.0+cpu.zip.sha
...

Package availability check:

Linux cxx11-abi: 403 error (not published yet)
...

Should we wait for PyTorch to publish all LibTorch 2.9.0 packages?

although i'm not sure about libtorch package naming convention,
- PyTorch Get Started page shows me that libtorch-shared-with-deps-2.9.0+cpu.zip seems cxx11 ABI.
- PyTorch upstream released libtorch-cxx11-abi-shared-with-deps-2.7.1+cpu.zip, but no release for 2.8.0 or later in this package naming.
- OTOH, PyTorch upstream released libtorch-shared-with-deps-2.6.0+cpu.zip or earlier, and released libtorch-shared-with-deps-2.8.0+cpu.zip or later; only 2.7.0 and 2.7.1 are missing in this package naming.

masaru-kimura-hacarus · 2025-10-31T05:15:31Z

@tolleybot

i'll attached a report created by Deep Research enabled Google Gemini 2.5 Pro, to answer "why libtorch-cxx11-abi-shared-with-deps-2.9.0+cpu.zip doesn't exists".
Technical Analysis of the LibTorch ZIP File Naming Convention Change.pdf
- as the executive summary said;
  
  since PyTorch version 2.8.0, filenames in the format libtorch-cxx11-abi-shared-with-deps-VERSION.zip are no longer present, having been replaced by a unified format: libtorch-shared-with-deps-VERSION.zip.
- please don't care the last section titled "引用文献" (which is a Japanese word equivalent to "bibliography") uses some Japanse words, since the initial research is done by Japanese prompt and Google Gemini export feature looks malfunction if translation task involved.

tolleybot · 2025-10-31T13:23:51Z

@masaru-kimura-hacarus Thank you for the detailed investigation and the Gemini Deep Research report! You're absolutely right. I was looking for the wrong package name.

I've just pushed the correct SHA files using the new naming convention. Let's see if the CI builds pass now

tolleybot · 2025-10-31T18:03:15Z

@dotnet-policy-service agree

tolleybot · 2025-12-01T14:37:21Z

👋 Friendly ping on this PR! It's been open for a little while and I wanted to check if there's anything I can do to help move it forward. Happy to address any feedback or make adjustments as needed.

masaru-kimura-hacarus · 2025-12-02T05:15:12Z

@tolleybot

i'm not TorchSharp upstream dev. and don't have right to manage this PR.
PRs i created before were merged by @alinpahontu2912.
- most probably, he can manage if possible.
- i'm also waiting upstream response for my opening PRs, but no joy.

tolleybot · 2026-02-19T18:26:38Z

Rebased onto latest main with libtorch 2.10 backend. Regenerated all .pt2 test models with PyTorch 2.10. Ready for review.

Copilot

Pull request overview

Adds a new TorchSharp integration for running PyTorch torch.export / AOTInductor-packaged .pt2 models (via LibTorch 2.9+ torch::inductor::AOTIModelPackageLoader), enabling inference-only execution from .NET.

Changes:

Introduces native (C++) bindings to load and run .pt2 packages and wires them into TorchSharp via P/Invoke.
Adds a managed torch.export API (ExportedProgram + generic typed returns) to load/run exported programs.
Adds .pt2 test fixtures, a Python generator script, and new unit tests covering basic load/run scenarios.

Reviewed changes

Copilot reviewed 11 out of 17 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
`src/Native/LibTorchSharp/THSExport.h`	Declares native API for loading/running AOTI `.pt2` exported programs.
`src/Native/LibTorchSharp/THSExport.cpp`	Implements the wrapper over `torch::inductor::AOTIModelPackageLoader` and marshals tensor inputs/outputs.
`src/Native/LibTorchSharp/Utils.h`	Adds ExportedProgram module typedef (and currently the AOTI header include).
`src/Native/LibTorchSharp/THSJIT.h`	Exposes helper declarations intended for sharing with export support.
`src/Native/LibTorchSharp/CMakeLists.txt`	Adds new export source/header to the native build.
`src/TorchSharp/PInvoke/LibTorchSharp.THSExport.cs`	Adds P/Invoke declarations for the new native export APIs.
`src/TorchSharp/Export/ExportedProgram.cs`	Adds managed `torch.export.load()` + `ExportedProgram` runtime wrapper and typed-return convenience API.
`test/TorchSharpTest/TestExport.cs`	Adds unit tests covering load/run with single output, multi-input, tuple output, and array output.
`test/TorchSharpTest/generate_export_models.py`	Adds a script to generate AOTInductor-packaged `.pt2` test fixtures.
`test/TorchSharpTest/TorchSharpTest.csproj`	Ensures `.pt2` fixtures are copied to test output directory.
`RELEASENOTES.md`	Notes the new torch.export support under API changes.

Comments suppressed due to low confidence (2)

test/TorchSharpTest/TestExport.cs:75

ExportedProgram<TResult> adds special handling for ValueTuple<,,> (3 tensor outputs), but the current tests only cover single output, Tensor[], and ValueTuple<,>. Add a unit test (and a small generated .pt2 fixture) that returns 3 tensors to ensure the ValueTuple<,,> path works end-to-end.

        public void TestLoadExport_TupleOutput()
        {
            // Test loading a model that returns a tuple
            using var exported = torch.export.load<(Tensor, Tensor)>(@"tuple_out.export.pt2");
            Assert.NotNull(exported);

src/Native/LibTorchSharp/Utils.h:8

Utils.h is included by most native binding files; adding torch/csrc/inductor/aoti_package/model_package_loader.h here makes the entire native build depend on this internal header even when torch.export support isn’t used. Since ExportedProgramModule is just a pointer typedef, consider forward-declaring torch::inductor::AOTIModelPackageLoader and/or moving the include + typedef into THSExport.h to keep compile dependencies localized.

#include "torch/torch.h"
#include "torch/csrc/inductor/aoti_package/model_package_loader.h"

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-24T09:25:43Z

src/TorchSharp/Export/ExportedProgram.cs

+            }
+
+            // Free the native array (tensors are now owned by managed Tensor objects)
+            Marshal.FreeHGlobal(result_ptr);
+


result_ptr is freed with Marshal.FreeHGlobal, but the native side allocates the returned pointer array with C++ new[] (new Tensor[...]). This allocator/free mismatch can crash or corrupt the heap. Expose a native free API that uses delete[] (and call it here), or change the native allocation to malloc/CoTaskMemAlloc to match FreeHGlobal.

Fixed. Added a dedicated THSExport_Module_run_free_results() native function that uses delete[] to free the array. The C# side now calls this instead of Marshal.FreeHGlobal.

Copilot · 2026-02-24T09:25:43Z

src/Native/LibTorchSharp/THSExport.cpp

+        // Allocate output array and copy results
+        *result_length = outputs.size();
+        *result_tensors = new Tensor[outputs.size()];
+


The returned pointer array is allocated with new Tensor[outputs.size()] but there is no corresponding exported API to free it from managed code (and FreeHGlobal is not compatible with new[]). Add an exported free function that delete[]s this array (or switch to a caller-provided allocator callback), and consider using size_t/int64_t for result_length to avoid truncation from outputs.size().

Fixed both issues. Added THSExport_Module_run_free_results() for proper delete[] cleanup, and changed result_length from int to int64_t to avoid truncation.

alinpahontu2912 · 2026-02-27T09:55:20Z

Hey @tolleybot can you also address the copilot comments ? Also, there are some test failures from TestExport

Add functionality to load and execute PyTorch models exported via torch.export (.pt2 files) using AOTInductor compilation, enabling .NET applications to run ExportedProgram models. Native layer: - THSExport.h/.cpp C++ wrappers using AOTIModelPackageLoader API - ExportedProgramModule typedef localized in THSExport.h - CMakeLists.txt updated to include THSExport sources - Proper memory management with dedicated free function for result arrays Managed layer: - LibTorchSharp.THSExport.cs PInvoke declarations - ExportedProgram and ExportedProgram<TResult> classes in Export namespace - torch.export.load() API following PyTorch conventions - Correct allocator pairing (native delete[] via THSExport_Module_run_free_results) Capabilities: - Load .pt2 files compiled with torch._inductor.aoti_compile_and_package() - Inference-only forward pass with type-safe generics - Single tensor, array, and 2/3-tuple output support - IDisposable resource cleanup Tests: - 8 unit tests covering load, execute, multi-input, tuple/list/3-tuple outputs - 7 test .pt2 models generated with PyTorch 2.10 - generate_export_models.py for model regeneration Fixes dotnet#1498

tolleybot · 2026-02-27T14:06:55Z

@alinpahontu2912 Thanks for the review!

Addressed the Copilot comments inline in each thread.

TestExport CI failures — fixed:
The .pt2 test models contain AOTInductor-compiled native code (.so/.dylib), which is platform-specific. The checked-in models were compiled on macOS arm64 and can't load on Linux x64 or Windows x64. Added an ExportTestFactAttribute that skips the model-loading tests on non-matching platforms. The error handling test (TestExport_LoadNonExistentFile) remains platform-independent and runs everywhere.

masaru-kimura-hacarus reviewed Oct 24, 2025

View reviewed changes

src/Native/LibTorchSharp/THSExport.cpp Show resolved Hide resolved

tolleybot force-pushed the tolleybot/1498 branch from 1f64f5b to af266bd Compare October 28, 2025 14:10

tolleybot force-pushed the tolleybot/1498 branch from f5d82b7 to b1c3dac Compare February 19, 2026 18:22

alinpahontu2912 requested a review from Copilot February 24, 2026 09:17

Copilot started reviewing on behalf of alinpahontu2912 February 24, 2026 09:17 View session

Copilot AI reviewed Feb 24, 2026

View reviewed changes

tolleybot force-pushed the tolleybot/1498 branch from b1c3dac to 9427efe Compare February 25, 2026 15:13

tolleybot force-pushed the tolleybot/1498 branch from 9427efe to 4b97275 Compare February 27, 2026 14:06

Conversation

tolleybot commented Oct 22, 2025

Add support for torch.export exported models (#1498)

Summary

Implementation

Native Layer (C++)

Managed Layer (C#)

Testing

Dependencies

Technical Details

Two .pt2 Formats

Python Model Generation

Limitations

Performance

Testing

Migration Guide

References

Uh oh!

tolleybot commented Oct 23, 2025

Uh oh!

Uh oh!

tolleybot commented Oct 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

masaru-kimura-hacarus commented Oct 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

masaru-kimura-hacarus commented Oct 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tolleybot commented Oct 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tolleybot commented Oct 31, 2025

Uh oh!

tolleybot commented Dec 1, 2025

Uh oh!

masaru-kimura-hacarus commented Dec 2, 2025

Uh oh!

tolleybot commented Feb 19, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

tolleybot Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

tolleybot Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

alinpahontu2912 commented Feb 27, 2026

Uh oh!

tolleybot commented Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

tolleybot commented Oct 30, 2025 •

edited

Loading

masaru-kimura-hacarus commented Oct 31, 2025 •

edited

Loading

masaru-kimura-hacarus commented Oct 31, 2025 •

edited

Loading

tolleybot commented Oct 31, 2025 •

edited

Loading

tolleybot commented Feb 27, 2026 •

edited

Loading