publié le: 2023
Nowadays, deep convolutional neural networks (CNNs) are mostly applied for image Super-Resolution (SR). But still, there are some disadvantages of usi...
Visual Question Answering (VQA) is a challenging task that requires a fine-grained understanding of both the visual content of images and the textual...
Network video typically contains a variety of information that is used by the video caption model to generate video tags. The process of creating vide...
Dynamic facial expression recognition is important for human-computer interaction. Learning the temporal dynamic representation of facial expressions...
The self-distillation methods can transfer the knowledge within the network itself to enhance the generalization ability of the network. However, due...