VLCAP: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning
More Like this
No document suggestions found
An official website of the United States government