Multi-Task Video Captioning with Video and Entailment Generation

Preprint English
Pasunuru, Ramakanth; Bansal, Mohit;
  • Subject: Computer Science - Computation and Language | Computer Science - Computer Vision and Pattern Recognition | Computer Science - Artificial Intelligence

Video captioning, the task of describing the content of a video, has seen some promising improvements in recent years with sequence-to-sequence models, but accurately learning the temporal and logical dynamics involved in the task still remains a challenge, especially g... View more
