Notice détaillée

Image and audio caps

automated captioning of background sounds and images using deep learning

Article Ecrit par: Poongodi, M. ; Hamdi, Mounir ; Wang, Huihui ;

Résumé: Image recognition based on computers is something human beings have been working on for many years. It is one of the most difficult tasks in the field of computer science, and improvements to this system are made when we speak. In this paper, we propose a methodology to automatically propose an appropriate title and add a specific sound to the image. Two models have been extensively trained and combined to achieve this effect. Sounds are recommended based on the image scene and the headings are generated using a combination of natural language processing and state-of-the-art computer vision models. A Top 5 accuracy of 67% and a Top 1 accuracy of 53% have been achieved. It is also worth mentioning that this is also the first model of its kind to make this forecast.

Langue: Anglais

FAQ

Quelles sont les types de documents recensés dans le catalogue de la bibliothèque CERIST?

Les documents recensés dans le catalogue sont : Les périodiques, Articles de périodiques, les livres, les thèses de post-graduation (magister et doctorat), Rapport de recherche, documents Audiovisuels.

Quels sont les différents horaires de la bibliothèque durant l’année ?

La bibliothèque vous accueille de Dimanche à jeudi de 8h30 à 16h30. Notez que la bibliothèque peut être réquisitionnée pour des raisons administratives.

Où se situe la bibliothèque du Cerist ?

La Bibliothèque se situe au réez de chaussée du bloc B Voir Google Maps du site web pour localiser l’adresse.