confrisk: preparatory changes by hieblmi · Pull Request #1154 · lightninglabs/loop

hieblmi · 2026-06-19T13:57:07Z

extracted commits from #1141
This PR pulls out general correctness and notification-delivery fixes ahead of the dynamic confirmation work.

It fixes a few independent issues in static address loop-in/deposit handling and hardens notification fanout so slow subscribers cannot
block unrelated notification delivery.

What

Cancel static loop-in invoices when initialization fails before the swap is stored.
Finalize withdrawn deposits asynchronously so final-state cleanup cannot block the FSM.
Fix static loop-in state queries so failed swaps are included in final-state results.
Prevent slow notification subscribers from blocking manager fanout.
Queue must-deliver notification types while keeping optional reservation notifications best-effort.

Why

These changes reduce failure modes that are not specific to dynamic confirmations, but become more important as static loop-ins rely more
heavily on notification-driven flows.

Moving them first keeps the dynamic confirmation PR focused on the actual feature behavior while landing independent fixes separately.

hieblmi · 2026-06-19T13:58:46Z

@claude please review

gemini-code-assist · 2026-06-19T14:00:31Z

Summary of Changes

Hello, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request introduces preparatory fixes for dynamic confirmation work by improving the robustness of static loop-in and notification delivery flows. The changes focus on preventing system stalls caused by slow subscribers, ensuring proper cleanup of transient resources like invoices during failure scenarios, and improving the reliability of state tracking for loop-in swaps.

Highlights

Notification Fanout Hardening: Implemented a per-subscriber FIFO queue to prevent slow subscribers from blocking the notification manager, and added logic to drop reservation notifications for slow subscribers.
Static Loop-In Invoice Cleanup: Added automatic cancellation of static loop-in invoices when initialization fails before the swap is persisted, preventing orphan invoice accumulation.
Asynchronous Deposit Finalization: Modified deposit finalization to occur asynchronously, ensuring that busy manager loops do not stall withdrawal confirmation.
State Query Improvements: Updated static loop-in state queries to correctly include failed swaps in final-state results.

New Features

🧠 You can now enable Memory (public preview) to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize the Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counterproductive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for GitHub and other Google products, sign up here.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces non-blocking notification delivery for subscribers, asynchronous deposit finalization, and automatic invoice cleanup on initialization failures. Specifically, a per-subscriber FIFO queue is added to prevent slow subscribers from blocking the main loop, and FinalizeDepositAction is updated to run asynchronously. Additionally, InitHtlcAction now ensures that pre-created invoices are canceled if an error occurs before persistence. Feedback on the changes highlights a potential memory leak in the FIFO queue implementation where sliced elements are not zeroed out, which could prevent garbage collection of pointer types.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

gemini-code-assist · 2026-06-19T14:02:26Z

+			ntfn := q.pending[0]
+			q.pending = q.pending[1:]
+			q.Unlock()


In Go, slicing a slice (e.g., q.pending = q.pending[1:]) does not zero out the retired element. If T is a pointer type, the underlying array will retain a reference to the popped notification, preventing it from being garbage collected until that index in the underlying array is overwritten or the queue is garbage collected. To avoid this memory leak, explicitly zero out the popped element before slicing.

ntfn := q.pending[0] var zero T q.pending[0] = zero q.pending = q.pending[1:] q.Unlock()

claude · 2026-06-19T14:03:22Z

Code Review

Overall this is a well-scoped preparatory PR that addresses real failure modes. The fixes are independent and the PR description maps cleanly to the diffs. A few things worth flagging before merge:

`notifications/manager.go` — notification queue

removeSubscriber fix is required for compilation. Adding enqueue func(any) to the subscriber struct makes it non-comparable, so the old s != sub is a compile error in Go. The fix to s.recvChan != sub.recvChan is correct and necessary.

Race between queue goroutine and close(notifChan).
SubscribeStaticLoopInSweepRequests and SubscribeUnfinishedSwaps both close notifChan via context.AfterFunc. The queue goroutine concurrently sends to that same channel. There is a window where the goroutine is in case recvChan <- ntfn: while context.AfterFunc fires close(notifChan), causing a "send on closed channel" panic. The recover() block is there exactly for this reason, but using recover() to absorb a closed-channel panic is a code smell — it would silently swallow a double-close bug too. Consider using a dedicated done chan struct{} that the subscriber owns and closes independently of notifChan, or use sync.Once around the close. At minimum add a comment explaining why the recover is needed (the AfterFunc race), not just what it catches.

Unbounded queue growth.
q.pending is an unbounded slice. A subscriber that never drains (e.g. its read loop is stuck) will accumulate notifications indefinitely. For must-deliver types (StaticLoopInSweep, UnfinishedSwap) this is intentional, but worth a brief comment and perhaps a log.Warnf when the queue depth exceeds some threshold.

queueNotification fallback path is unreachable.
The enqueue == nil branch falls back to a blocking send. Looking at callers, every subscriber that goes through queueNotification (StaticLoopInSweep, UnfinishedSwap) sets enqueue. The nil branch exists for defensive reasons — a comment or a log.Warnf("enqueue is nil, falling back to blocking send") would clarify intent.

Test: TestManager_SlowSubscriberDoesNotBlock.
The test uses require.Len(t, slowChan, 1) at the end. This only works because SubscribeReservations creates a buffered-1 channel and the slow subscriber never reads. The assertion is correct but it silently depends on channel capacity being exactly 1. A brief comment would help.

`staticaddr/loopin/sql_store.go` — `toJointStringStates` fix

This is a subtle but significant bug. The SQL query uses:

(',' || $1 || ',') LIKE ('%,' || u.update_state || ',%')

With the old {...} wrapper, the parameter was e.g. ",{HtlcTimeoutSwept,Succeeded,SucceededTransitioningFailed,Failed},". The curly braces break matching for:

The first element (HtlcTimeoutSwept): surrounded by {... so ,HtlcTimeoutSwept, never appears.
The last element (Failed): trailed by } so ,Failed, never appears (only ,Failed},).

Middle elements like Succeeded worked by coincidence (flanked by real commas). The fix correctly produces a plain comma-separated string that the LIKE pattern can match uniformly.

The new test adds a Failed-state swap and asserts it appears in finalizedSwaps — good direct coverage of the bug.

Minor: result ordering. The test asserts finalizedSwaps[0] is Succeeded and finalizedSwaps[1] is Failed based on insertion order. The SQL has ORDER BY swaps.id, so insertion order should be stable, but it's worth a brief comment in the test explaining the expected order.

`staticaddr/deposit/actions.go` — async `FinalizeDepositAction`

The goroutine is correctly bounded by f.quitChan rather than a request context, and the comment explains the shutdown-race reasoning. A few notes:

If the manager's read loop of finalizedDepositChan is slow or blocked for a long time (not shutdown), the goroutine lingers. This is probably fine in practice but worth a line in the comment.
The action now always returns fsm.NoOp — the previous fsm.OnError on context cancellation is intentionally dropped. The test TestFinalizeDepositActionIgnoresRequestCancellation covers this, but the comment in FinalizeDepositAction could explicitly note this behaviour change.

`staticaddr/loopin/actions.go` — invoice cleanup

The deferred cleanup via the invoiceNeedsCleanup flag and returnError helper is clean. context.WithoutCancel + timeout in cancelSwapInvoice is the right approach to ensure cleanup runs even when the caller's context is already done. The refactoring to reuse cancelSwapInvoice in MonitorInvoiceAndHtlcTxAction is a good simplification.

One readability nit: event is set by returnError and later read inside the defer. The closed-over variable makes the flow non-obvious on first read. A short comment above the defer (e.g. // event is set by returnError below; defer reads it to decide whether to cancel) would help.

Test coverage

Coverage is solid: the three notification tests (SlowSubscriberDoesNotBlock, UnfinishedSwapNotificationWaitsForSubscriber, StaticLoopInSweepNotificationQueuesForSlowSubscriber) directly exercise the new queue semantics. The three FinalizeDepositAction tests cover the non-blocking guarantee, request-context independence, and already-canceled context. The two InitHtlcAction tests verify invoice cancellation on server error and fee-guard failure. No gaps noticed.

Summary

All five issues in the PR description appear correctly fixed. The SQL bug fix in particular was non-obvious and well-caught. Main asks before merge:

Add a comment (or fix the race more cleanly) around the recover() in newNotificationQueue.
Add the ordering comment in TestGetStaticAddressLoopInSwapsByStates.

Neither is a blocker but both would reduce confusion for future readers.

hieblmi · 2026-06-19T14:45:15Z

@claude please re-review after I added queued notification handling of reservations as well.

claude · 2026-06-19T14:49:08Z

Code Review

PR: confrisk: preparatory changes
Summary: This PR extracts correctness fixes from #1141 ahead of the dynamic confirmation feature. It addresses four independent issues: orphaned invoices on init failure, blocking deposit finalization, failed swaps missing from state queries, and slow subscribers blocking notification fanout.

The overall direction is sound — each fix targets a real failure mode, test coverage is strong, and the changes are well-scoped. A few areas worth discussing:

`notifications/manager.go` — Per-subscriber FIFO queue

Positive:

The queue design is clean: a per-subscriber goroutine drains a local []T slice, so a slow subscriber cannot stall the fanout loop.
The removeSubscriber comparison change from s != sub to s.recvChan != sub.recvChan is a required correctness fix, not optional — adding enqueue func(any) to subscriber makes the struct non-comparable in Go, so the old == comparison would be a compile error.

Concerns:

PR description vs. code mismatch — The description says "Queue must-deliver notification types while keeping optional reservation notifications best-effort." But SubscribeReservations also receives an enqueue queue. Only SubscribeHtlcConfirmed remains best-effort (using its existing timeout select). Either the description needs updating or the intent is for reservations to stay best-effort (i.e., SubscribeReservations should not set enqueue).
recover() for closed-channel panic — The goroutine guards against writing to a closed channel using recover():
```
defer func() {
    if recover() != nil {
        log.Debugf("subscriber channel closed before notification delivery")
    }
}()
```
This race arises because context.AfterFunc closes notifChan while the queue goroutine may still be writing to it. Giving the goroutine ownership of the channel's close (close it in the goroutine's deferred cleanup after ctx.Done()) would eliminate the panic path entirely. Using recover() to paper over a write-to-closed-channel panic can also mask unrelated panics in the goroutine.
Unbounded queue — q.pending grows without limit. A subscriber that is consistently slow (e.g., stuck in a long DB operation) will accumulate an ever-growing backlog. Adding a configurable cap with a drop-and-log policy would prevent unbounded memory growth in degraded conditions.
Head element not nil'd after dequeue — After q.pending = q.pending[1:] the old position still holds a reference in the backing array, delaying GC for pointer-typed notifications. Consider:
```
ntfn := q.pending[0]
q.pending[0] = nil   // release pointer for GC
q.pending = q.pending[1:]
```

`staticaddr/loopin/actions.go` — Invoice cleanup on init failure

Positive:

The deferred invoiceNeedsCleanup flag is a clean pattern: it avoids duplicating cancel calls on every error path, and the final invoiceNeedsCleanup = false after successful persistence makes the lifecycle clear.
context.WithoutCancel(ctx) with a 5-second timeout is the right idiom for cleanup that must run even after a canceled request context.

Minor:

cancelSwapInvoice accepts a ctx but immediately discards it via context.WithoutCancel(ctx). The function only uses the passed context to derive a detached one. The function could just take no context argument and use context.Background() directly, making the intent clearer and the caller simpler. Alternatively, the docstring should explain why the parent context is intentionally abandoned.

`staticaddr/deposit/actions.go` — Async finalization

Positive:

Sending on finalizedDepositChan in a goroutine prevents the FSM from stalling while the manager loop is busy holding other locks.
Using quitChan (tied to the FSM lifetime, not the request context) as the cancellation signal is correct — the shutdown comment explains the intended restart behavior clearly.

Test coverage: TestFinalizeDepositActionDoesNotBlock, TestFinalizeDepositActionIgnoresRequestCancellation, and TestFinalizeDepositActionIgnoresCanceledContext together cover the important edge cases well.

`staticaddr/loopin/sql_store.go` — State list query fix

This is a real bug fix. The SQL query uses:

(',' || $1 || ',') LIKE ('%,' || u.update_state || ',%')

The old toJointStringStates produced {State1,State2}, which after wrapping became ',{State1,State2},'. The curly braces broke the comma-separated pattern match for the first and last states (e.g., Succeeded became {Succeeded and Failed became Failed}), silently excluding them from results. The fix (removing {}) is correct.

Minor: The updated sql_store_test.go asserts on specific slice indices (finalizedSwaps[0], finalizedSwaps[1]) relying on the ORDER BY swaps.id clause preserving insertion order. This is reasonable but implicit — a brief comment noting the dependency on the SQL ordering would make the test intent clearer.

Test quality

Tests are well-structured and cover the important scenarios:

Notification queue: slow-subscriber non-blocking, queued delivery for unfinished-swap and sweep notifications.
Invoice cleanup: server error path, fee guard failure path.
Deposit finalization: blocks vs. non-blocking, context cancellation resilience.
State query: failed swap now appears in FinalStates results.

The shared assertQueuedSwapHashNotifications helper is a nice DRY abstraction across the two queue test cases.

Summary

Area	Verdict
SQL state query bug fix	✅ Correct and important
Async deposit finalization	✅ Clean fix, good tests
Invoice cleanup on init failure	✅ Correct, minor naming suggestion
Notification queue design	⚠️ Works, but see: recover() concern, unbounded growth, PR description mismatch
Test coverage	✅ Good

The main items worth resolving before merge are the PR description/code mismatch on reservation notification queuing, and the recover() approach for the closed-channel race.

starius

LGTM! 💾
A couple of comments left.

starius · 2026-06-19T15:59:40Z


 func toJointStringStates(states []fsm.StateType) string {
-	return "{" + strings.Join(toStrings(states), ",") + "}"
+	return strings.Join(toStrings(states), ",")


Good fix! Can you adjust the commit message to explain that now it covers the first and the last states as well, and add a test for first state, please?

Thanks, I added context to the commit message and added tests for it!

starius · 2026-06-19T16:05:34Z

+			return
+		}
+
+		f.cancelSwapInvoice(ctx)


This fix is correct for pre-storage situations.

What happens with an invoice in SignHtlcTx/MonitorInvoiceAndHtlcTx → OnError → UnlockDeposits paths?

good catch, I made UnlockDepositsAction call cancelSwapInvoice() so we always cancel the invoice on failed swaps.

starius · 2026-06-19T17:46:55Z

 		case <-deadlineChan:
 			// If the server didn't pay the invoice on time, we
 			// cancel the invoice and keep monitoring the htlc tx
 			// confirmation. We also need to unlock the deposits to
 			// re-enable them for loop-ins and withdrawals.
 			cancelInvoice()

 			event := f.UnlockDepositsAction(ctx, nil)
 			if event != fsm.OnError {
 				f.Errorf("unable to unlock deposits after " +
 					"payment deadline")
 			}



This guard is dead code. UnlockDepositsAction returns fsm.OnError on both paths — the success path returns fsm.OnError (line 879), and the failure path returns f.HandleError(err), which is also fsm.OnError. So event != fsm.OnError is never true and the "unable to unlock deposits after payment deadline" error is never logged, including when the unlock genuinely fails.

Concretely: when the payment deadline fires we cancel the invoice, try to unlock the deposits, and keep monitoring the htlc. If TransitionDeposits(..., deposit.Deposited) fails here we get no signal for it — the contextual log can't fire — and the FSM keeps running as though the deposits were freed while they actually stay locked until the swap resolves another way (e.g. the htlc timeout sweep). The deposits are the user's funds, so this is a "don't silently fail an operation" issue: low severity, but strictly worse than what's trivially available, since we can just check the real error.

I confirmed the premise on a live regtest run: on a failed init the loop-in FSM entered UnlockDeposits, the deposit transitioned LoopingIn -> Deposited (a successful unlock), and the FSM still emitted OnError -> Failed — i.e. UnlockDepositsAction returns fsm.OnError even on success, which is exactly why this guard can never fire.

Suggested fix: https://gist.github.com/starius/915dee5b117c07f2042170d992274ec6

Thank you, this is now fixed by

extract the actual deposit transition into unlockDeposits(ctx) error,

stop calling UnlockDepositsAction from the payment-deadline path,

call unlockDeposits(ctx) directly there so we can inspect the real error.

A static address loop-in that creates an invoice but then fails before storage cannot be recovered after restart, and persisted error paths that unlock deposits should not leave the swap invoice live either. The payment-deadline path also needs to report the real deposit unlock error instead of checking the FSM event returned by UnlockDepositsAction, because that action returns OnError on both success and failure. Cancel private swap invoices on pre-storage init failures, monitor timeouts, and stored unlock paths using a detached timeout-limited helper. Factor deposit unlocking into an error-returning helper so deadline handling can log transition failures, and add coverage for early-init cancellation plus generic unlock invoice cleanup and error propagation.

Final deposit states should not stall while deposit locks are held, because a blocked manager receive loop can otherwise hold up the deposit FSM; if shutdown happens before notification delivery, startup recovery can still resume from the final state. Send finalization notifications from a goroutine so final states are recorded without waiting on the manager receive loop, and add tests for blocked manager delivery and shutdown races.

The final-state query previously formatted the state list with braces, which broke the comma-delimited match for both boundary entries: the first state was compared with a leading brace and the last state with a trailing brace. Final-state callers need swaps at both ends of the list, including HtlcTimeoutSwept and Failed, to remain visible once they reach a terminal status. Return the final-state list as plain comma-separated state names, update the store comment to match the query format, and extend the store test so it inserts and retrieves swaps in the first final state, a middle final state, and the last final state without relying on result order.

A slow optional notification subscriber must not stall the manager lock, while recovery and sweep work requests still need reliable local delivery to keep daemon progress deterministic. Treating all subscribers as blocking makes best-effort fanout a backpressure source for unrelated required work. Make reservation fanout nonblocking for slow subscribers, keep recovery and sweep notifications on a required cancellation-aware delivery path, and cover both slow best-effort subscribers and required subscribers in manager tests.

Required notification fanout should not block the manager lock, but subscribers still need ordered delivery once brief backpressure clears. Sending must-deliver notifications directly can couple manager progress to subscriber receive timing, while queueing optional reservation notifications would contradict their best-effort delivery semantics. Add bounded per-subscriber queues for must-deliver notifications, let those queues own channel shutdown instead of relying on recover for closed-channel sends, keep reservation fanout best-effort, and cover queued delivery, queue cleanup, and capacity drops in manager tests.

hieblmi · 2026-06-22T09:05:30Z

@claude re-review this please

claude · 2026-06-22T09:09:42Z

Code Review: `confrisk: preparatory changes` (#1154)

This PR bundles five independent correctness fixes ahead of the dynamic confirmation work. The changes are well-motivated and the scope is appropriate. Overall quality is high — the generic queue implementation is clean, the tests are thorough, and the SQL fix resolves a real silent bug. A few issues are worth discussing before merge.

1. `notifications/manager.go` — Non-blocking fanout & per-subscriber queue

Good:

newNotificationQueue[T any] is a well-structured generic. The FIFO queue decouples the hot fanout path from slow subscriber delivery, and the notify chan struct{}{1} idiom efficiently wakes the drain goroutine without accumulating redundant signals.
The zero-value drain (q.pending[0] = zero; q.pending = q.pending[1:]) correctly avoids retaining stale pointers in the backing array.
Moving close(recvChan) from context.AfterFunc into the queue goroutine (defer close(recvChan)) is the right ownership model: one owner, one close.
Switching removeSubscriber to compare s.recvChan != sub.recvChan is a necessary fix now that subscriber holds a func field (making direct struct comparison invalid).

Issues:

[Bug] removeSubscriber comparator is using any interface equality — verify semantics are correct.
The comparison s.recvChan != sub.recvChan compares two any values. Since both sides hold the same channel pointer, this works by interface identity. However, this is fragile because it relies on the fact that recvChan is always set to the same pointer in both the added and removed subscriber. Suggest adding a brief comment (or a test) to document this assumption, since it's subtle. This could also be made explicit by adding a unique ID field to subscriber.

[Minor] Context-cancelled delivery in the queue goroutine.
The goroutine's top-of-loop ctx check is non-blocking:

select {
case <-ctx.Done():
    return
default:
}

This means that if items are in the queue when ctx.Done() fires, the goroutine may deliver one more item to recvChan before the next loop iteration catches the done signal. For must-deliver semantics this is fine, but the behaviour is slightly surprising. A comment noting "partial drain on cancel is intentional" would help future readers.

[Minor] queueNotification blocking fallback.
The fallback path (when enqueue == nil) does a blocking send that could stall the fanout loop:

select {
case recvChan <- ntfn:
case <-sub.subCtx.Done():
}

Since every subscriber constructed via SubscribeUnfinishedSwaps/SubscribeStaticLoopInSweepRequests will have enqueue set, this path is unreachable in practice today. Still, consider replacing the blocking send with a drop + warning, or removing the fallback entirely if there's no plan to create queue-less must-deliver subscribers.

2. `staticaddr/loopin/sql_store.go` — SQL state-query bug fix

This is a real, silent data-loss bug. The SQL filter is:

(',' || $1 || ',') LIKE ('%,' || u.update_state || ',%')

The old format {HtlcTimeoutSwept,Succeeded,...,Failed} added curly braces, which consumed the sentinel commas for the first (HtlcTimeoutSwept) and last (Failed) states in the list. Those two states were silently excluded from every GetStaticAddressLoopInSwapsByStates result. The new comma-only format HtlcTimeoutSwept,Succeeded,...,Failed correctly gets the sentinel commas added by the SQL expression.

The expanded test confirms all four final states (HtlcTimeoutSwept, Succeeded, SucceededTransitioningFailed, Failed) are now returned. This fix is correct and important.

[Suggestion] Add a one-line comment in toJointStringStates explaining why no braces:

// The SQL query wraps this with leading/trailing commas to match complete
// state names; do not add extra delimiters here.
func toJointStringStates(states []fsm.StateType) string {
    return strings.Join(toStrings(states), ",")
}

3. `staticaddr/deposit/actions.go` — Async finalization

The move from a synchronous blocking send (gated on the caller's context) to an async goroutine gated on quitChan is correct. The old code could deadlock if the manager loop was busy when the FSM tried to report completion; the new code breaks that cycle.

[Issue] Goroutine lifetime guarantee.
The spawned goroutine will only exit when either quitChan is closed or finalizedDepositChan is read. If the manager stops reading finalizedDepositChan before closing quitChan (e.g. on an unusual shutdown path), this goroutine will leak indefinitely.

Given that the deposit is already in a final state by this point, the practical risk is low — the FSM will eventually be torn down. But it's worth confirming that every code path that stops the manager also closes quitChan (or drains finalizedDepositChan).

[Minor] The parameter rename ctx context.Context → _ context.Context is consistent with the new implementation and a good cleanup.

4. `staticaddr/loopin/actions.go` — Invoice cleanup & deposit unlock refactor

InitHtlcAction deferred cleanup:
The invoiceNeedsCleanup + returnError pattern is clean and correct. The cleanup window is precisely bounded: it's active only between invoice creation and swap persistence, and the invoiceNeedsCleanup = false line at the success path is easy to follow.

[Issue] UnlockDepositsAction now cancels the invoice — but the inline call in MonitorInvoiceAndHtlcTxAction bypasses this.
The new UnlockDepositsAction calls f.cancelSwapInvoice() before unlocking deposits:

func (f *FSM) UnlockDepositsAction(...) fsm.EventType {
    f.cancelSwapInvoice()
    err := f.unlockDeposits(ctx)
    ...
}

But MonitorInvoiceAndHtlcTxAction on payment deadline calls f.unlockDeposits(ctx) directly:

cancelInvoice()   // cancels the *subscription*, not the invoice itself
err = f.unlockDeposits(ctx)

cancelInvoice() here cancels the lndclient subscription, not the HTLC invoice. The actual CancelInvoice call is skipped on this path. Is this intentional? If a payment deadline fires, the invoice presumably should also be cancelled to prevent the server from paying it after the fact. If this is a pre-existing behaviour that isn't the focus of this PR, please document it as a known gap.

[Minor] cancelSwapInvoice uses context.Background() with a 5-second timeout. This is correct — the action's own context (ctx) may already be done by the time the defer runs. The 5-second bound prevents unbounded hangs. Consider making defaultInvoiceCleanupTimeout slightly more visible (e.g. in the constant block near the other timeouts) so it's easy to tune later.

5. `notifications/manager_test.go` — Test fixes & new coverage

Good:

Fixing testReservationId2 = []byte{0x01, 0x02} → []byte{0x03, 0x04} catches a pre-existing test bug where both IDs were identical, making equality assertions vacuously pass.
assertQueuedSwapHashNotifications is a well-factored helper that avoids duplicating the timing/ordering logic across both queued-notification tests.
TestNotificationQueueDropsAtCapacity with maxPending = 0 correctly tests the always-drop boundary, though 0 is a degenerate input that bypasses the NewManager defaulting logic. A comment noting this would help.

[Minor] Timing sensitivity in TestManager_SlowReservationSubscriberDoesNotBlock.
The test uses time.After(50 * time.Millisecond) to assert a notification was not received. This works on unloaded CI but could produce a flaky false negative on a heavily loaded machine. Consider using require.Never or a short select + default to assert the absence more robustly.

Summary

Area	Verdict
SQL state-query fix (`toJointStringStates`)	✅ Real bug fix, well-tested
Notification queue implementation	✅ Correct; minor documentation gaps
Async deposit finalization	✅ Correct; goroutine leak worth confirming
Invoice cleanup on init failure	✅ Correct; inline `unlockDeposits` bypass worth clarifying
Test coverage	✅ Comprehensive; one minor timing concern
Test data bug fix (`testReservationId2`)	✅ Pre-existing bug correctly fixed

The most actionable items before merge:

Clarify whether the payment-deadline path in MonitorInvoiceAndHtlcTxAction should call cancelSwapInvoice() (currently only calls cancelInvoice() on the subscription).
Confirm quitChan is always closed on all shutdown paths to bound the goroutine from FinalizeDepositAction.
Add a comment to toJointStringStates explaining the no-braces requirement to prevent future regression.

hieblmi changed the title ~~confrisk: preparatory PR~~ confrisk: preparatory changes Jun 19, 2026

gemini-code-assist Bot reviewed Jun 19, 2026

View reviewed changes

hieblmi force-pushed the pre-dyn-conf-fixes branch from adfd82a to 283bc5f Compare June 19, 2026 14:25

hieblmi requested a review from starius June 19, 2026 14:30

hieblmi force-pushed the pre-dyn-conf-fixes branch from f321b97 to cb83ba9 Compare June 19, 2026 14:44

hieblmi self-assigned this Jun 19, 2026

starius approved these changes Jun 19, 2026

View reviewed changes

hieblmi added 4 commits June 22, 2026 10:18

hieblmi force-pushed the pre-dyn-conf-fixes branch from cb83ba9 to e1f0fd0 Compare June 22, 2026 08:26

hieblmi force-pushed the pre-dyn-conf-fixes branch from e1f0fd0 to 0bb06b0 Compare June 22, 2026 09:03

hieblmi merged commit 129d9c1 into lightninglabs:master Jun 22, 2026
10 checks passed

hieblmi deleted the pre-dyn-conf-fixes branch June 22, 2026 11:13

Conversation

hieblmi commented Jun 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What

Why

Uh oh!

hieblmi commented Jun 19, 2026

Uh oh!

gemini-code-assist Bot commented Jun 19, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Jun 19, 2026

Choose a reason for hiding this comment

Uh oh!

claude Bot commented Jun 19, 2026

Code Review

notifications/manager.go — notification queue

staticaddr/loopin/sql_store.go — toJointStringStates fix

staticaddr/deposit/actions.go — async FinalizeDepositAction

staticaddr/loopin/actions.go — invoice cleanup

Test coverage

Summary

Uh oh!

hieblmi commented Jun 19, 2026

Uh oh!

claude Bot commented Jun 19, 2026

Code Review

notifications/manager.go — Per-subscriber FIFO queue

staticaddr/loopin/actions.go — Invoice cleanup on init failure

staticaddr/deposit/actions.go — Async finalization

staticaddr/loopin/sql_store.go — State list query fix

Test quality

Summary

Uh oh!

starius left a comment

Choose a reason for hiding this comment

Uh oh!

starius Jun 19, 2026

Choose a reason for hiding this comment

Uh oh!

hieblmi Jun 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

starius Jun 19, 2026

Choose a reason for hiding this comment

Uh oh!

hieblmi Jun 22, 2026

Choose a reason for hiding this comment

Uh oh!

starius Jun 19, 2026

Choose a reason for hiding this comment

Uh oh!

hieblmi Jun 22, 2026

Choose a reason for hiding this comment

Uh oh!

hieblmi commented Jun 22, 2026

Uh oh!

claude Bot commented Jun 22, 2026

Code Review: confrisk: preparatory changes (#1154)

1. notifications/manager.go — Non-blocking fanout & per-subscriber queue

2. staticaddr/loopin/sql_store.go — SQL state-query bug fix

3. staticaddr/deposit/actions.go — Async finalization

4. staticaddr/loopin/actions.go — Invoice cleanup & deposit unlock refactor

5. notifications/manager_test.go — Test fixes & new coverage

Summary

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

hieblmi commented Jun 19, 2026 •

edited

Loading

`notifications/manager.go` — notification queue

`staticaddr/loopin/sql_store.go` — `toJointStringStates` fix

`staticaddr/deposit/actions.go` — async `FinalizeDepositAction`

`staticaddr/loopin/actions.go` — invoice cleanup

`notifications/manager.go` — Per-subscriber FIFO queue

`staticaddr/loopin/actions.go` — Invoice cleanup on init failure

`staticaddr/deposit/actions.go` — Async finalization

`staticaddr/loopin/sql_store.go` — State list query fix

hieblmi Jun 22, 2026 •

edited

Loading

Code Review: `confrisk: preparatory changes` (#1154)

1. `notifications/manager.go` — Non-blocking fanout & per-subscriber queue

2. `staticaddr/loopin/sql_store.go` — SQL state-query bug fix

3. `staticaddr/deposit/actions.go` — Async finalization

4. `staticaddr/loopin/actions.go` — Invoice cleanup & deposit unlock refactor

5. `notifications/manager_test.go` — Test fixes & new coverage