OLViT: Multi-Modal State Tracking via Attention-Based Embeddings for Video-Grounded Dialog

Summary

This is a publication. If there is no link to the publication on this page, you can try the pre-formated search via the search engines listed on this page.

Authors: Adnen Abdessaied, Manuel Hochmeister, Andreas Bulling

Journal title: Proc. 31st Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING)

Journal publisher: European Language Resources Association (ELRA)

Published year: 2024