chore: Fix flaky FlagSynchronizerSpec polling tests by kinyoklion · Pull Request #486 · launchdarkly/ios-client-sdk

kinyoklion · 2026-03-19T18:03:53Z

Requirements

I have added test coverage for new or changed functionality
I have followed the repository's pull request submission guidelines
I have validated my changes against all supported platform versions

Related issues

Follows up on chore: Fix flaky FlagSynchronizerSpec tests (SDK-2042) #483 and chore: Tweak flaky test #480 which addressed other flaky tests in this file
Fixes the build-ios (15.4.0, macos-14) CI failure: polling_timer_fires__one_second_interval__stops_polling, failed - expected to equal <2>, got <6>
Fixes the build-ios (16.4.0, macos-15) CI failure: streaming_events__event_reported_while_polling__reports_an_event_error, failed - waitUntil(..) expects its completion closure to be only called once

Describe the solution you've provided

Adds a didSignal guard to prevent onSyncComplete callbacks from re-entering and double-signaling the semaphore. For tests that verify polling stopped, captures the call count after stopping and uses Thread.sleep to confirm no further requests occur. For tests that only need to confirm polling started, relaxes exact-count assertions to >= 1. Five tests are fixed:

changeIsOnlineSpec ("stops polling") — Added didSignal guard. The isOnline = false + semaphore.signal() inside the callback was already present in v11. Changed == 1 to >= 1, then captures countAfterStop, sleeps 1.5s, and asserts count unchanged — proving polling actually stopped.
changeIsOnlineSpec ("starts polling") — Added didSignal guard to prevent double-signaling. No other changes to callback body; original test structure preserved (isOnline stays true, cleanup at the end). Changed == 1 to >= 1.
changeIsOnlineSpec ("does not stop polling") — Added didSignal guard. No isOnline = false in callback; original test structure preserved. Changed == 1 to >= 1.
streamingProcessingSpec ("event reported while polling") — Replaced the first waitUntil block with the semaphore + didSignal guard pattern. The original waitUntil { done in ... { _ in done() } ... isOnline = true } failed when polling timer ticks called done() more than once.
pollingTimerFiresSpec ("stops polling") — After stopping polling inside the callback at requestCount == 2, the test now captures the call count, waits 1.5 seconds, and asserts the count hasn't changed — directly proving polling actually stopped. Uses >= 2 instead of == 2 to tolerate in-flight callbacks.

All changes are test-code-only; no application code is modified.

Describe alternatives you've considered

An earlier revision stopped polling (isOnline = false) inside the callbacks for tests #2 and #3 as well, but this introduced a timing dependency: the count staying at exactly 1 relied on the main queue processing the callback before the next 1-second timer tick. The current approach avoids that dependency entirely — the guard prevents re-signaling and >= 1 tolerates any number of extra ticks.

Additional context

For reviewers — human review checklist:

>= 1 in "does not stop polling": This is weaker than the original == 1. The original assertion caught a restart regression (second isOnline = true triggering another immediate flag request). With >= 1, both normal polling and a restart regression satisfy the assertion. The startPolling() guard (flagRequestTimer == nil) in production code is what prevents restarts; this test no longer independently verifies that. If this is a concern, an alternative would be exposing timer state for direct inspection.

didSignal is not explicitly synchronized, matching the existing didCallDone pattern in LDTimerSpec. The guard only needs to prevent the callback body from running a second time — it does not need to be atomic to accomplish this.

Thread.sleep(forTimeInterval: 1.5) is used in both "stops polling" tests. This is generous relative to the 1-second polling interval but could theoretically be tight on very slow CI runners. Let me know if you'd prefer a longer wait or a different verification approach.

pollingTimerFiresSpec and changeIsOnlineSpec "stops polling" both use >= N then verify count stability: There is a small window where an in-flight getFeatureFlags call could land between isOnline = false and the countAfterStop read. The >= N tolerates this, and the 1.5-second sleep assertion proves no further polling occurs regardless of the exact count at stop time.

Link to Devin session: https://app.devin.ai/sessions/120fb9abea7743249e11b5bcbbcd1d8a
Requested by: @kinyoklion

Stop polling inside onSyncComplete callbacks before signaling the semaphore so that no additional timer ticks race with the assertions that follow. Three tests are fixed: 1. pollingTimerFiresSpec ("stops polling") — added an early-return guard for requestCount > 2 so that callbacks already in flight when isOnline is set to false become no-ops. 2. changeIsOnlineSpec ("starts polling") — moved isOnline = false inside the callback before signaling the semaphore, matching the pattern already used by the "stops polling" tests. 3. changeIsOnlineSpec ("does not stop polling") — same fix as #2. Co-Authored-By: rlamb@launchdarkly.com <kingdewman@gmail.com>

devin-ai-integration · 2026-03-19T18:03:57Z

🤖 Devin AI Engineer

I'll be helping with this pull request! Here's what you should know:

✅ I will automatically:

Address comments on this PR. Add '(aside)' to your comment to have me ignore it.
Look at CI failures and help fix them

Note: I can only respond to comments from users who have write access to this repository.

⚙️ Control Options:

Disable automatic comment and CI monitoring

kinyoklion · 2026-03-19T20:25:40Z

LaunchDarkly/LaunchDarklyTests/ServiceObjects/FlagSynchronizerSpec.swift

                                // timer ticks from racing with assertions.
                                testContext.flagSynchronizer.isOnline = false
                                semaphore.signal()
+                            } else if requestCount > 2 {


I am concerned about this one, as we want to make sure it stopped polling. But if we ignore requests counts over 2, then we aren't checking that it stopped polling.

Good call — the early-return guard didn't actually verify polling stopped since getFeatureFlagsCallCount is incremented on the mock before the callback runs.

Replaced with a stronger assertion: after stopping polling, capture the count, sleep 1.5 seconds, then verify the count hasn't changed. This actually proves no further polling occurred.

let countAfterStop = testContext.serviceMock.getFeatureFlagsCallCount expect(countAfterStop) >= 2 // Wait briefly to confirm no further polling occurs. Thread.sleep(forTimeInterval: 1.5) expect(testContext.serviceMock.getFeatureFlagsCallCount) == countAfterStop

aside: I don't particularly like this but it does seem like it accomplishes the goal better.

kinyoklion · 2026-03-19T20:27:03Z

LaunchDarkly/LaunchDarklyTests/ServiceObjects/FlagSynchronizerSpec.swift


                    DispatchQueue.global().async {
                        testContext = TestContext(streamingMode: .polling, useReport: false) { _ in
+                            // Stop polling inside the callback to prevent further


I am concerned about this as well, if the test it supposed to check that it doesn't stop polling, then stopping polling defeats that. Does the assertion need re-framed.

It is supposed to be something like "polls more than once"?

Agreed — stopping polling inside the callback defeated the test's purpose. Reframed the test to wait for 2 callbacks (proving polling continued after the double-set), then stop and assert:

expect(testContext.serviceMock.getFeatureFlagsCallCount) == 2

If isOnline = true; isOnline = true had restarted polling each time, we'd see more requests (from two independent polling cycles starting). Seeing exactly 2 confirms a single polling cycle ran uninterrupted.

1. pollingTimerFiresSpec ("stops polling"): Remove the early-return guard which didn't actually verify polling stopped. Instead, capture the call count after stopping, wait 1.5 seconds, and assert the count hasn't changed — proving polling truly stopped. 2. changeIsOnlineSpec ("does not stop polling"): Reframe to wait for 2 callbacks (proving polling continued after the double-set of isOnline = true), then stop and verify only 2 requests (not 4, which would indicate a restart). Co-Authored-By: rlamb@launchdarkly.com <kingdewman@gmail.com>

Co-Authored-By: rlamb@launchdarkly.com <kingdewman@gmail.com>

kinyoklion · 2026-03-19T20:45:05Z

LaunchDarkly/LaunchDarklyTests/ServiceObjects/FlagSynchronizerSpec.swift


                    DispatchQueue.global().async {
                        testContext = TestContext(streamingMode: .polling, useReport: false) { _ in
+                            // Stop polling inside the callback to prevent further


aside: We just want to know it started polling, so once we know it did, we can stop it.

kinyoklion · 2026-03-19T20:45:31Z

LaunchDarkly/LaunchDarklyTests/ServiceObjects/FlagSynchronizerSpec.swift


                    // polling starts by requesting flags
-                    expect(testContext.flagSynchronizer.isOnline) == true
+                    expect(testContext.flagSynchronizer.isOnline) == false


aside: Now it won't be running by this point.

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, have a team admin enable autofix in the Cursor dashboard.}

LaunchDarkly/LaunchDarklyTests/ServiceObjects/FlagSynchronizerSpec.swift

Revert the reframing that waited for 2 callbacks and asserted == 2, since that count is ambiguous (normal polling and a restart regression both produce 2 requests). Instead, use a didSignal guard so the callback is a no-op after the first invocation, stop polling before signaling, and assert getFeatureFlagsCallCount == 1. This preserves the original test intent: the second isOnline = true must not trigger another immediate flag request. Co-Authored-By: rlamb@launchdarkly.com <kingdewman@gmail.com>

Co-Authored-By: rlamb@launchdarkly.com <kingdewman@gmail.com>

…oes-not-stop tests Remove isOnline=false from 'starts polling' and 'does not stop polling' callbacks since those tests don't need to verify stop behavior. The didSignal guard prevents double-signaling, and >= 1 assertions tolerate any extra timer ticks without timing dependencies. Only the 'stops polling' test keeps isOnline=false in the callback (it was already there in v11) since that's what it's testing. Co-Authored-By: rlamb@launchdarkly.com <kingdewman@gmail.com>

devin-ai-integration · 2026-03-19T21:41:05Z

You're right — my earlier explanation was wrong. I traced through the actual code and here's what's really going on:

reportSyncComplete dispatches to DispatchQueue.main.async — the callback runs asynchronously on main, not synchronously on syncQueue
LDTimer fires on RunLoop.main, then timerFired() dispatches processTimer to syncQueue.async
The mock's getFeatureFlags is synchronous — increments count and calls completion inline
So: timer fire → syncQueue: getFeatureFlags (count++) → reportSyncComplete → dispatches callback to main

The didSignal guard only prevents the callback body from re-executing. It does NOT prevent getFeatureFlags from being called again by subsequent timer ticks. So you're right — with the guard alone, the count could be > 1 if a timer tick fires before the callback processes on main.

I've simplified the approach:

"starts polling" and "does not stop polling": removed isOnline = false from the callback entirely. Just use didSignal guard (for single semaphore signal) + >= 1 assertion (tolerates extra timer ticks). Original test structure preserved — isOnline stays true, cleanup at the end.
"stops polling" (changeIsOnlineSpec): keeps isOnline = false in callback (was already in v11, and that's what it's testing) + guard + >= 1.
pollingTimerFiresSpec: unchanged (Thread.sleep verification).

kinyoklion · 2026-03-19T21:46:36Z

LaunchDarkly/LaunchDarklyTests/ServiceObjects/FlagSynchronizerSpec.swift

                    expect(testContext.flagSynchronizer.isOnline) == false
                    expect(testContext.flagSynchronizer.streamingMode) == .polling
-                    expect(testContext.serviceMock.getFeatureFlagsCallCount) == 2
+                    let countAfterStop = testContext.serviceMock.getFeatureFlagsCallCount


I feel like there is still a race condition here. Similar to other cases we don't know that we will not get one more request to getFeatureFlags. Is there any way to know that the polling has finished stopping?

Good question. Yes, there is a window where one more getFeatureFlags call could sneak through — between isOnline = false in the callback and the countAfterStop read. That's why countAfterStop uses >= 2 instead of == 2: it tolerates in-flight calls.

The key assertion is the second one: after the 1.5-second sleep, getFeatureFlagsCallCount == countAfterStop. Whatever the count was right after stopping (2, 3, etc.), it shouldn't change during the sleep. If it does, polling didn't actually stop.

So the test doesn't claim to know the exact count at stop time — it just verifies the count stabilizes, which proves polling stopped.

Also: the CI failure on this push was a different flaky test (event_reported_while_polling) with the same root cause — waitUntil callback called multiple times by polling timer ticks. I've fixed that one too (same semaphore + didSignal guard pattern) in the latest push.

Replace waitUntil with semaphore + didSignal guard to prevent the onSyncComplete callback from calling done() multiple times when polling timer ticks fire between the initial callback and the waitUntil completion. Co-Authored-By: rlamb@launchdarkly.com <kingdewman@gmail.com>

Co-Authored-By: rlamb@launchdarkly.com <kingdewman@gmail.com>

kinyoklion · 2026-03-19T22:05:37Z

LaunchDarkly/LaunchDarklyTests/ServiceObjects/FlagSynchronizerSpec.swift

                    expect(testContext.flagSynchronizer.isOnline) == false
                    expect(testContext.flagSynchronizer.streamingMode) == .polling
-                    expect(testContext.serviceMock.getFeatureFlagsCallCount) == 1
+                    expect(testContext.serviceMock.getFeatureFlagsCallCount) >= 1


>= 1 would also be true if it kept polling. So we need some upper bounds check.

Good call — added the same capture-count-then-sleep pattern from pollingTimerFiresSpec. Now captures countAfterStop, sleeps 1.5s, and asserts getFeatureFlagsCallCount == countAfterStop. This proves polling actually stopped rather than just checking >= 1.

Capture count after stopping, sleep 1.5s, verify count unchanged. Same pattern as pollingTimerFiresSpec — proves polling actually stopped rather than just checking >= 1 which would pass if polling continued. Co-Authored-By: rlamb@launchdarkly.com <kingdewman@gmail.com>

Co-Authored-By: rlamb@launchdarkly.com <kingdewman@gmail.com>

kinyoklion · 2026-03-19T22:52:38Z

aside: We should try to develop a more deterministic way of interacting with these components in the future. I think ideally we could have an async queue that tracks everything that happens, and then we verify things on that queue by waiting for specific conditions.

This is mostly how the JS equivalent tests work. But we would likely need some additional API surface. For example callbacks to know when something has actually stopped. Or async stop methods.

devin-ai-integration bot assigned kinyoklion Mar 19, 2026

devin-ai-integration bot added the devin-pr label Mar 19, 2026

kinyoklion commented Mar 19, 2026

View reviewed changes

devin-ai-integration bot and others added 4 commits March 19, 2026 20:29

chore: trigger CI re-run (1 of 3)

5e84839

Co-Authored-By: rlamb@launchdarkly.com <kingdewman@gmail.com>

chore: trigger CI re-run (2 of 3)

96376d6

Co-Authored-By: rlamb@launchdarkly.com <kingdewman@gmail.com>

chore: trigger CI re-run (3 of 3)

0433d4e

Co-Authored-By: rlamb@launchdarkly.com <kingdewman@gmail.com>

kinyoklion commented Mar 19, 2026

View reviewed changes

kinyoklion marked this pull request as ready for review March 19, 2026 20:51

kinyoklion requested a review from a team as a code owner March 19, 2026 20:51

cursor bot reviewed Mar 19, 2026

View reviewed changes

LaunchDarkly/LaunchDarklyTests/ServiceObjects/FlagSynchronizerSpec.swift Outdated Show resolved Hide resolved

kinyoklion marked this pull request as draft March 19, 2026 21:09

devin-ai-integration bot and others added 2 commits March 19, 2026 21:09

chore: Add didSignal guard to remaining polling tests

341b70d

Co-Authored-By: rlamb@launchdarkly.com <kingdewman@gmail.com>

kinyoklion commented Mar 19, 2026

View reviewed changes

devin-ai-integration bot and others added 5 commits March 19, 2026 21:49

chore: trigger CI re-run (1 of 3)

3684736

Co-Authored-By: rlamb@launchdarkly.com <kingdewman@gmail.com>

chore: trigger CI re-run (2 of 3)

6ced240

Co-Authored-By: rlamb@launchdarkly.com <kingdewman@gmail.com>

chore: trigger CI re-run (3 of 3)

be6a5d2

Co-Authored-By: rlamb@launchdarkly.com <kingdewman@gmail.com>

chore: trigger CI re-run (2 of 3)

8e87633

Co-Authored-By: rlamb@launchdarkly.com <kingdewman@gmail.com>

kinyoklion commented Mar 19, 2026

View reviewed changes

devin-ai-integration bot and others added 3 commits March 19, 2026 22:06

chore: trigger CI stress test (1 of 3)

2beb73e

Co-Authored-By: rlamb@launchdarkly.com <kingdewman@gmail.com>

chore: trigger CI stress test (2 of 3)

3a938ce

Co-Authored-By: rlamb@launchdarkly.com <kingdewman@gmail.com>

devin-ai-integration bot and others added 4 commits March 19, 2026 22:24

chore: trigger CI stress test (3 of 3)

71176ca

Co-Authored-By: rlamb@launchdarkly.com <kingdewman@gmail.com>

chore: trigger CI stress test round 2 (1 of 3)

630525f

Co-Authored-By: rlamb@launchdarkly.com <kingdewman@gmail.com>

chore: trigger CI stress test round 2 (2 of 3)

1e1a793

Co-Authored-By: rlamb@launchdarkly.com <kingdewman@gmail.com>

chore: trigger CI stress test round 2 (3 of 3)

5d47632

Co-Authored-By: rlamb@launchdarkly.com <kingdewman@gmail.com>

kinyoklion marked this pull request as ready for review March 19, 2026 22:55

tanderson-ld approved these changes Mar 23, 2026

View reviewed changes

kinyoklion merged commit 648921c into v11 Mar 23, 2026
16 checks passed

kinyoklion deleted the devin/1773943260-fix-flaky-flag-synchronizer-tests branch March 23, 2026 16:58

Conversation

kinyoklion commented Mar 19, 2026 • edited by devin-ai-integration bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

devin-ai-integration bot commented Mar 19, 2026

🤖 Devin AI Engineer

Uh oh!

kinyoklion Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration bot Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

kinyoklion Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

kinyoklion Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration bot Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

kinyoklion Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

kinyoklion Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

devin-ai-integration bot commented Mar 19, 2026

Uh oh!

kinyoklion Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration bot Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

kinyoklion Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration bot Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

kinyoklion commented Mar 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kinyoklion commented Mar 19, 2026 •

edited by devin-ai-integration bot

Loading