نشرت في: 2020
In Visual Dialog, an agent has to parse temporal context in the dialog history and spatial context in the image to hold a meaningful dialog with human...
3D model retrieval has been widely utilized in numerous domains, such as computer-aided design, digital entertainment, and virtual reality. Recently,...
Cross-modal retrieval aims to retrieve data in one modality by a query in another modality, which has been a very interesting research issue in the fi...
Visual structure and syntactic structure are essential in images and texts, respectively. Visual structure depicts both entities in an image and their...
نشرت في: 2021
Image aesthetics assessment aims to endow computers with the ability to judge the aesthetic values of images, and its potential has been recognized in...