- This event has passed.
Animal Behavior Video Analysis Working Group
January 19 @ 3:00 pm - 5:00 pm
Title: Multimodal Learning from Pixels to People
Presenter: Carl Vondrick
Abstract: People experience the world through modalities of sight, sound, words, touch, and more. By leveraging their natural relationships and developing multimodal learning methods, my research creates artificial perception systems with diverse skills, including spatial, physical, logical, and cognitive abilities, for flexibly analyzing visual data. This multimodal approach provides versatile representations for tasks like 3D reconstruction, visual question answering, and object recognition, while offering inherent explainability and excellent zero-shot generalization across tasks. By closely integrating diverse modalities, we can overcome key challenges in machine learning and enable new capabilities for computer vision, especially for the many upcoming applications where trust is required.
Join Zoom Meeting:
https://columbiauniversity.zoom.us/j/96127949475pwd=TWxLa3A3a3lBRjdqbDBWMkRycHFMZz09
Meeting ID: 948 4868 7512
Passcode: 446335