Challenges

The workshop will host two challenges on tasks that are crucial to enable real-world vision-based assistants. These challenges are designed to test both the low-level visual capabilities and higher-level reasoning skills of vision-based assistants.

Challenge 1: Interactive Feedback Generation

This challenge focuses on assisting users through a workout session with interactive feedback.

Details: [Coming soon!] (also see here and here) .

Challenge 2: Interactive Question Answering

This challenge tests the ability of vision-based assistants to answer questions asked by a user in a face-to-face setup.

Details: [Coming soon!]