publié le: 2023
A large portion of the global population generates various multimedia data such as texts, images, videos, etc. One of the most common categories which...
Infrared images are full of energy information, which can intuitively reflect the difference between objects and scenes. Visible images are full of co...
Visual commonsense reasoning (VCR) task leads to a cognitive level of understanding between vision and linguistic domains. Three sub-tasks, i.e., , ,...
Most recent learning algorithms for single image dehazing are designed to train with paired hazy and corresponding ground truth images, typically synt...
In recent years, deep learning has become very popular and its application fields have been increasing, but it relies heavily on large number of label...