Batch Open-vocabulary Detection with Grounding Models by NetZissou · Pull Request #21 · Imageomics/hpc-inference

NetZissou · 2025-11-21T04:00:10Z

Add a batch pipeline that takes

(a) an image corpus (folder or Parquet of binary images/URIs) and,
(b) one or more text labels, and returns detection boxes (with scores + optional masks) for each image/label using an open-vocabulary grounding model such as OWLv2

Close #18

…the original image

Face Detection

Merge Animal Detection

…ated authors list; updated project URL sections

- Added initial implementation using OWLv2 for zero-shot batch detection using text labels - Added SLURM scripts and config templates

- modified `ParquetImageDataset` & `ImageFolderDataset` to add option for returning image size, which will later used for OWLv2 detection parse - replaced PIL transform with Owlv2Processor wrapper, offload preprocessing from the main thread to the sub-workers - added `owlv2_collate` to collate output from Dataset object

NetZissou and others added 15 commits July 18, 2025 14:53

added face detection job scripts

cc3035e

re-modeled face detection scripts to adapt more YOLO detection tasks

8bd9587

organized config templates into folders

3e7aac8

added animal detection scripts and templates

9c33390

added detection vis utility functions to plot detection boxes on top …

81f3a77

…the original image

Merge pull request #14 from Imageomics/feature/face_detection

3d8c098

Face Detection

Merge pull request #15 from Imageomics/feature/animal_detection

eb04a2a

Merge Animal Detection

update optional dependency

1276c41

added initial draft for animal detection guide

641c1d8

optmized base detector source code

b86ffd4

updated animal detection slurm template

ae2f604

added pynvml as dependency

4ed9fb0

updated default py version to 3.10; support only py310 and above; upd…

fd38309

…ated authors list; updated project URL sections

Initial commit for zero-shot detection using OWLv2

8b40ea4

- Added initial implementation using OWLv2 for zero-shot batch detection using text labels - Added SLURM scripts and config templates

NetZissou requested review from egrace479 and thompsonmj November 21, 2025 04:00

NetZissou self-assigned this Nov 21, 2025

NetZissou added the enhancement New feature or request label Nov 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batch Open-vocabulary Detection with Grounding Models#21

Batch Open-vocabulary Detection with Grounding Models#21
NetZissou wants to merge 15 commits intomainfrom
feature/detection_grounding

NetZissou commented Nov 21, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

NetZissou commented Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

NetZissou commented Nov 21, 2025 •

edited

Loading