perf: Batch metadata query to avoid N+1 across 5 call sites by KRRT7 · Pull Request #232 · microsoft/typeagent-py

KRRT7 · 2026-04-10T10:02:32Z

Stack: 4/4 — depends on #230. Merge #231, #229, #230, then this PR.

Five call sites used get_item() per scored ref — one SELECT and full deserialization per match (N+1 pattern)
Added get_metadata_multiple to ISemanticRefCollection that fetches only semref_id, range_json, knowledge_type in a single batch query
Replaced the N+1 loop with one get_metadata_multiple call at each site
Further optimized scope-filtering: binary search in contains_range, inline tuple comparisons in TextRange, skip pydantic validation in get_metadata_multiple

Call sites optimized

lookup_term_filtered — batch metadata, filter by knowledge_type/range
lookup_property_in_property_index — batch metadata, filter by range scope
SemanticRefAccumulator.group_matches_by_type — batch metadata, group by knowledge_type
SemanticRefAccumulator.get_matches_in_scope — batch metadata, filter by range scope
get_scored_semantic_refs_from_ordinals_iter — two-phase: metadata filter then batch fetch

Additional optimizations

Binary search in TextRangeCollection.contains_range: replaced O(n) linear scan with bisect_right keyed on start, reducing scope-filtering from ~25ms to ~9ms
Inline tuple comparisons in TextRange: replaced TextLocation allocations in __eq__/__lt__/__contains__ with a shared _effective_end returning tuples
Skip pydantic validation in get_metadata_multiple: construct TextLocation/TextRange directly from JSON instead of going through __pydantic_validator__

Benchmark

Azure Standard_D2s_v5 — 2 vCPU, 8 GiB RAM, Python 3.13

Query (pytest-async-benchmark pedantic, 200 rounds)

200 matches against a 200-message indexed SQLite transcript. Only the function under test is timed.

Function	Before (median)	After (median)	Speedup
`lookup_term_filtered`	2.650 ms	1.184 ms	2.24x
`group_matches_by_type`	2.428 ms	978 μs	2.48x
`get_scored_semantic_refs_from_ordinals_iter`	2.541 ms	2.946 ms	0.86x
`lookup_property_in_property_index`	25.306 ms	9.365 ms	2.70x
`get_matches_in_scope`	25.011 ms	9.160 ms	2.73x

Reproduce the benchmark locally

pip install 'pytest-async-benchmark @ git+https://github.com/KRRT7/pytest-async-benchmark.git@feat/pedantic-mode' pytest-asyncio
python -m pytest tests/benchmarks/test_benchmark_query.py -v -s

Generated by codeflash optimization agent

Add add_terms_batch / add_properties_batch to the index interfaces with executemany-based SQLite implementations. Restructure add_metadata_to_index_from_list and add_to_property_index to collect all items first, then batch-insert via extend() and the new batch methods. Eliminates ~1000 individual INSERT round-trips during indexing.

Rename _collect_{facet,entity,action}_{terms,properties} to drop the leading underscore in propindex.py and semrefindex.py.

Change list to Sequence in add_terms_batch and add_properties_batch interfaces and implementations to satisfy covariance. Add missing add_terms_batch to FakeTermIndex in conftest.py.

lookup_term_filtered called get_item() per scored ref — one SELECT and full deserialization per match. The filter only needs knowledge_type (a plain column) and range (json.loads of range_json), never the expensive knowledge_json deserialization (64% of per-row cost). Add get_metadata_multiple to ISemanticRefCollection that fetches only semref_id, range_json, knowledge_type in a single batch query. Replace the N+1 loop in lookup_term_filtered with one get_metadata_multiple call. Benchmark (200 matches, 200 rounds): 4.38ms → 1.32ms (3.3x speedup).

Apply the same get_metadata_multiple pattern from lookup_term_filtered to four more sites that called get_item() in a loop: - propindex.lookup_property_in_property_index: filter by .range - SemanticRefAccumulator.group_matches_by_type: group by .knowledge_type - SemanticRefAccumulator.get_matches_in_scope: filter by .range - answers.get_scored_semantic_refs_from_ordinals_iter: two-phase metadata filter then batch get_multiple for matching full objects All sites now use a single batch query instead of N individual SELECTs, skipping knowledge_json deserialization where only range or knowledge_type is needed.

…arisons - Use bisect_right with key=start in TextRangeCollection.contains_range to skip O(n) linear scan (O(log n) for non-overlapping point ranges) - Replace TextLocation allocations in TextRange __eq__/__lt__/__contains__ with a shared _effective_end returning tuples - Skip pydantic validation in get_metadata_multiple by constructing TextLocation/TextRange directly from JSON

bmerkle

@KRRT7 please check my comments

bmerkle · 2026-04-14T13:11:12Z

src/typeagent/storage/sqlite/collections.py

        assert set(rowdict) == set(arg)
        return [self._deserialize_semantic_ref_from_row(rowdict[ordl]) for ordl in arg]

+    async def get_metadata_multiple(


Missing bounds check in get_metadata_multiple

get_multiple validates ordinals are in bounds before querying. get_metadata_multiple in collections.py:344 does not — if an ordinal is missing from the DB, rowdict[o] will KeyError. Should either validate or handle missing keys gracefully, mirroring the existing get_multiple pattern.

bmerkle · 2026-04-14T13:13:32Z

src/typeagent/knowpro/interfaces_indexes.py

        semantic_ref_ordinal: SemanticRefOrdinal | ScoredSemanticRefOrdinal,
    ) -> None: ...

+    async def add_properties_batch(


In propindex.py:85 the SQLite implementation imports make_property_term_text / split_property_term_text from storage.memory.propindex. This creates a cross-layer dependency (sqlite → memory). These helpers should be extracted to a shared location (e.g., the base knowpro/propindex or a utils module).

bmerkle · 2026-04-14T13:14:18Z

src/typeagent/storage/memory/propindex.py

    await add_to_property_index(conversation, 0)


+def collect_facet_properties(


collect_* functions duplicate logic from existing add_*_to_index functions

propindex.py adds collect_entity_properties, collect_action_properties, collect_facet_properties that duplicate the logic of the existing add_entity_properties_to_index, add_action_properties_to_index, add_facet_to_index. Same duplication occurs in semrefindex.py with collect_entity_terms / collect_action_terms / collect_facet_terms. The old add_* functions should be refactored to use the new collect_* functions + batch add, rather than keeping both versions.

bmerkle · 2026-04-14T13:16:06Z

tests/benchmarks/test_benchmark_query.py

+async def test_benchmark_lookup_term_filtered(async_benchmark):
+    """Benchmark lookup_term_filtered with N+1 get_item pattern."""
+    settings = make_settings()
+    tmpdir = tempfile.mkdtemp()


the temp directory is created manually and only cleaned up in finally inside the benchmark — if create_indexed_transcript raises, the cleanup won't run. Use a pytest tmp_path fixture instead.

KRRT7 · 2026-04-14T23:55:57Z

@bmerkle Heads up — #230 now includes a refactor that will cause merge conflicts with this PR (semrefindex.py, sqlite/propindex.py, propindex.py all overlap). We'll hold off on updating this PR until #230 lands, then rebase on top.

KRRT7 force-pushed the perf/batch-metadata-query branch from 0fba38f to f9a489b Compare April 10, 2026 10:21

KRRT7 mentioned this pull request Apr 10, 2026

Fix parse_azure_endpoint passing query string to AsyncAzureOpenAI #231

Merged

KRRT7 added 7 commits April 10, 2026 15:49

Remove underscore prefix from collect helper functions

0d9871b

Rename _collect_{facet,entity,action}_{terms,properties} to drop the leading underscore in propindex.py and semrefindex.py.

Fix pyright errors: use Sequence for batch method signatures

bc6b966

Change list to Sequence in add_terms_batch and add_properties_batch interfaces and implementations to satisfy covariance. Add missing add_terms_batch to FakeTermIndex in conftest.py.

Add pytest-async-benchmark as dev dependency

743d8db

KRRT7 force-pushed the perf/batch-metadata-query branch from f9a489b to 743d8db Compare April 10, 2026 20:50

bmerkle self-assigned this Apr 11, 2026

Merge branch 'main' into perf/batch-metadata-query

a08939b

bmerkle reviewed Apr 14, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: Batch metadata query to avoid N+1 across 5 call sites#232

perf: Batch metadata query to avoid N+1 across 5 call sites#232
KRRT7 wants to merge 8 commits intomicrosoft:mainfrom
KRRT7:perf/batch-metadata-query

KRRT7 commented Apr 10, 2026 •

edited

Loading

Uh oh!

bmerkle left a comment

Uh oh!

bmerkle Apr 14, 2026

Uh oh!

bmerkle Apr 14, 2026

Uh oh!

bmerkle Apr 14, 2026

Uh oh!

bmerkle Apr 14, 2026

Uh oh!

KRRT7 commented Apr 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		await add_to_property_index(conversation, 0)


		def collect_facet_properties(

Conversation

KRRT7 commented Apr 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Call sites optimized

Additional optimizations

Benchmark

Azure Standard_D2s_v5 — 2 vCPU, 8 GiB RAM, Python 3.13

Query (pytest-async-benchmark pedantic, 200 rounds)

Uh oh!

bmerkle left a comment

Choose a reason for hiding this comment

Uh oh!

bmerkle Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

bmerkle Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

bmerkle Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

bmerkle Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

KRRT7 commented Apr 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

KRRT7 commented Apr 10, 2026 •

edited

Loading