Add approximate parameter to GELU activation function by alinpahontu2912 · Pull Request #1548 · dotnet/TorchSharp

alinpahontu2912 · 2026-02-27T11:33:01Z

Add support for the 'approximate' parameter in GELU, matching PyTorch's torch.nn.GELU(approximate='tanh') functionality.

Changes:

Add GELU.Approximate enum with 'none' and 'tanh' values
Thread approximate parameter through all layers: native C++, PInvoke, Tensor methods, functional API, and module factory
Add new overloads (no breaking changes to existing API)
Add test for tanh approximation mode

Add support for the 'approximate' parameter in GELU, matching PyTorch's torch.nn.GELU(approximate='tanh') functionality. Changes: - Add GELU.Approximate enum with 'none' and 'tanh' values - Thread approximate parameter through all layers: native C++, PInvoke, Tensor methods, functional API, and module factory - Add new overloads (no breaking changes to existing API) - Add test for tanh approximation mode Fixes dotnet#1368 Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Copilot

Pull request overview

Adds support for PyTorch’s approximate mode to GELU (notably "tanh"), threading the option through the native (C++), P/Invoke, Tensor, functional, and module APIs, and adding a regression test.

Changes:

Introduces Modules.GELU.Approximate (none / tanh) and plumbs it through nn.GELU and nn.functional.gelu.
Extends Tensor gelu/gelu_ to accept an approximation mode and updates the corresponding native/PInvoke signatures.
Adds a unit test validating the tanh approximation path and that it differs from the exact mode.

Reviewed changes

Copilot reviewed 6 out of 6 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
test/TorchSharpTest/NN.cs	Adds a test covering GELU tanh approximation behavior.
src/TorchSharp/Tensor/Tensor.cs	Adds `gelu`/`gelu_` overloads that pass approximation through to native.
src/TorchSharp/PInvoke/LibTorchSharp.THSTensor.cs	Updates P/Invoke signatures to accept the approximation string.
src/TorchSharp/NN/Activation/GELU.cs	Adds approximation enum + overloads in module factory and functional API.
src/Native/LibTorchSharp/THSTensor.h	Updates native exports for GELU to accept an approximation parameter.
src/Native/LibTorchSharp/THSTensor.cpp	Passes approximation through to `torch::gelu` / `torch::gelu_`.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-05T09:30:22Z

src/TorchSharp/PInvoke/LibTorchSharp.THSTensor.cs

        [DllImport("LibTorchSharp")]
-        internal static extern IntPtr THSTensor_gelu(IntPtr tensor);
+        internal static extern IntPtr THSTensor_gelu(IntPtr tensor, [MarshalAs(UnmanagedType.LPStr)] string approximate);

        [DllImport("LibTorchSharp")]
-        internal static extern IntPtr THSTensor_gelu_(IntPtr tensor);
+        internal static extern IntPtr THSTensor_gelu_(IntPtr tensor, [MarshalAs(UnmanagedType.LPStr)] string approximate);


The new P/Invoke declarations for THSTensor_gelu/THSTensor_gelu_ introduce an LPStr string parameter but don’t specify CharSet/BestFitMapping/ThrowOnUnmappableChar like the other LPStr-based imports in this file (e.g., THSTensor_load/meshgrid/div). This can lead to inconsistent marshaling behavior across platforms and re-enables best-fit character mapping. Consider updating these DllImport attributes to match the existing pattern used for other string parameters in LibTorchSharp.THSTensor.cs.

Copilot · 2026-03-05T09:30:22Z

src/TorchSharp/Tensor/Tensor.cs

+            public Tensor gelu(TorchSharp.Modules.GELU.Approximate approximate)
+            {
+                var res = NativeMethods.THSTensor_gelu(Handle, approximate == TorchSharp.Modules.GELU.Approximate.tanh ? "tanh" : "none");
                if (res == IntPtr.Zero)
                    CheckForErrors();
                return new Tensor(res);
            }

            public Tensor gelu_()
            {
-                var res = NativeMethods.THSTensor_gelu_(Handle);
+                var res = NativeMethods.THSTensor_gelu_(Handle, "none");
+                if (res == IntPtr.Zero)
+                    CheckForErrors();
+                return new Tensor(res);
+            }
+
+            public Tensor gelu_(TorchSharp.Modules.GELU.Approximate approximate)
+            {
+                var res = NativeMethods.THSTensor_gelu_(Handle, approximate == TorchSharp.Modules.GELU.Approximate.tanh ? "tanh" : "none");


The Tensor.gelu overloads take TorchSharp.Modules.GELU.Approximate, which is a nested enum on an nn.Module type. That makes a core Tensor API depend on the Modules layer and forces callers of tensor.gelu(...) / functional.gelu(...) to reference Modules.GELU for what is essentially an ATen algorithm option. Consider moving the approximation enum to a more neutral location (e.g., torch.nn or torch) and having the Tensor/functional overloads use that type (keeping the current overload as a forwarding shim if you want to preserve source compatibility).

alinpahontu2912 requested a review from Copilot March 5, 2026 09:24

Copilot started reviewing on behalf of alinpahontu2912 March 5, 2026 09:25 View session

Copilot AI reviewed Mar 5, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add approximate parameter to GELU activation function#1548

Add approximate parameter to GELU activation function#1548
alinpahontu2912 wants to merge 1 commit intodotnet:mainfrom
alinpahontu2912:feature/gelu-approximate-parameter

alinpahontu2912 commented Feb 27, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 5, 2026

Uh oh!

Copilot AI Mar 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

alinpahontu2912 commented Feb 27, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants