From 393903835decd952508258c58c633b0fa235c1c4 Mon Sep 17 00:00:00 2001 From: AD2000X Date: Wed, 3 Jun 2026 10:27:27 +0100 Subject: [PATCH] docs: confirm Step 7d smoke result in DEVLOG (285 OK / 1 WARN, n=286) Replaces the predicted "should report" wording with the confirmed Colab result: 285 OK / 1 WARN (0.35%), val_000670_table_1 sole WARN (no row boxes). Phase 2 acceptance gate passed; ready to merge. --- DEVLOG.md | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/DEVLOG.md b/DEVLOG.md index 76b4ff2..3e5e30f 100644 --- a/DEVLOG.md +++ b/DEVLOG.md @@ -202,9 +202,9 @@ Decisions outgrow this file, split them into `DECISIONS.md` (or `docs/adr/`). matched@0.75 recall 0.880 / precision 0.895. On 200 table-free pages, final crop false positives were 13/200 (6.5%) and fallback fired 0/200. - **Structure handoff:** n=50 crop smoke improved from 37 OK / 13 WARN before band dedup to - 50 OK / 0 WARN after band dedup. The old n=286 full smoke exposed one empty-grid crop - (`val_000670_table_1`, no row boxes); the validator is now tightened so rerunning Step 7d - should report 285 OK / 1 WARN, still under the <=5% WARN gate. + 50 OK / 0 WARN after band dedup. Step 7d full-crop smoke confirmed (seed=42, n=286): + **285 OK / 1 WARN** (0.35%); the sole WARN is `val_000670_table_1` (rows=0, no row boxes + detected), well under the <=5% WARN gate. - **Scope caveat:** these are fixed-subset Phase 2 diagnostics, not whole-DocLayNet AP. The crop->TATR smoke validates grid geometry compatibility, not OCR text/content quality. - **Files/Commits:** `src/bbox_utils.py`, `src/layout_parsing.py`, `src/layout_detector.py`,