Revert "OCPBUGS-77947: fix(tnf): do not block on stable revision after external etcd transition"#1572
Conversation
…r external etcd transition"
|
@jaypoulz: This pull request references Jira Issue OCPBUGS-77947, which is invalid:
Comment The bug has been updated to refer to the pull request using the external bug tracker. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
Important Review skippedAuto reviews are limited based on label configuration. 🚫 Review skipped — only excluded labels are configured. (1)
Please check the settings in the CodeRabbit UI or the ⚙️ Run configurationConfiguration used: Organization UI Review profile: CHILL Plan: Pro Run ID: You can disable this status message by setting the Use the checkbox below for a quick retry:
✨ Finishing Touches🧪 Generate unit tests (beta)
📝 Coding Plan
Comment |
|
/jira refresh |
|
@jaypoulz: This pull request references Jira Issue OCPBUGS-77947, which is valid. The bug has been moved to the POST state. 3 validation(s) were run on this bug
No GitHub users were found matching the public email listed for the QA contact in Jira (dhensel@redhat.com), skipping review request. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
/lgtm |
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: eggfoobar, jaypoulz The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
|
/verified bypass |
|
@eggfoobar: The DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
/retest required |
|
/retest-required |
|
@jaypoulz: all tests passed! Full PR test history. Your PR dashboard. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here. |
|
@jaypoulz: Jira Issue Verification Checks: Jira Issue OCPBUGS-77947 Jira Issue OCPBUGS-77947 has been moved to the MODIFIED state and will move to the VERIFIED state when the change is available in an accepted nightly payload. 🕓 DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
Reverts #1562
This PR intended to speed up recovery post-node replacement. After analysis, it does accomplish that goal, with an approximate gain of 1-2 minutes while the node comes up. The cost of this; however, is that we've introduced a race condition where the node installer needs to run at least once on the replaced node before TNF's podman-etcd can start the etcd container. If it hasn't run, podman-etcd fails because the certs directory is missing. While it is unlikely that wouldn't eventually resolve (since podman-etcd will retry etcd multiple times) it's just safer to handle this in CEO. Furthermore, the concern about this blocking TNF updates indefinitely was unfounded. That aspect (seen the in the node recovery job and fixed by openshift/origin#30846, was actually broken by #1554.