Challenges
The workshop will host two challenges on tasks that are crucial to enable real-world vision-based assistants. These challenges are designed to test both the low-level visual capabilities and higher-level reasoning skills of vision-based assistants.
Challenge 1: Interactive Feedback Generation
This challenge focuses on assisting users through a workout session with interactive feedback.
Details: [Coming soon!] (also see here and here) .
Challenge 2: Interactive Question Answering
This challenge tests the ability of vision-based assistants to answer questions asked by a user in a face-to-face setup.
Details: [Coming soon!]