Event box

Multimodal AI - Visual tool calling

Multimodal AI models can include visual tools that enable them to manipulate images or retrieve external information. A zoom tool can be used to focus on a specific section of a painting. A reverse image search tool can find similar images across the Web. This visual search can retrieve metadata that improves the recognition and interpretation of visual information. We will begin with existing visual tools and consider which additional tools could aid research.

 

Image: Elise Racine & Digit / Woven Dialogues / Licenced by CC-BY 4.0

Date:
Wednesday, February 25, 2026
Time:
3:00pm - 4:00pm
Location:
Commons Library Classroom (D112)
Campus:
Commons Library
Audience:
  Princeton Student  
Categories:
  Data & Computation  

Registration is required. There are 23 in-person seats available. There are 10 online seats available.

To request disability-related accommodations for this event, please contact pulcomm@princeton.edu at least 3 working days in advance.