publié le: 2023
In the tile-based panoramic video streaming, the Field of View (FOV) is composed of multiple real-time synchronized visible video tiles. The common pa...
Point-level temporal action localization (PTAL) aims to locate action instances in untrimmed videos with only one timestamp annotation for each action...
Since diffusion-weighted imaging (DWI) images are high-dimensional medical images with rich texture features, existing traditional zero-watermarking a...
Nowadays, deep convolutional neural networks (CNNs) are mostly applied for image Super-Resolution (SR). But still, there are some disadvantages of usi...
Visual Question Answering (VQA) is a challenging task that requires a fine-grained understanding of both the visual content of images and the textual...