Open-Source Datasets For Multimodal Generative AI Models
My Books
Cybersecurity
Cloud-native Computing
IT Operations
Database Systems
InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation
The Flickr30k dataset has become a standard benchmark for sentence-based image description
Multimodal Stressed Emotion (MuSE) helps to study the multimodal interplay between the presence of stress and expressions of affect.
Visual Question Answering (VQA) Data set
Social-IQ: A Question Answering Benchmark for Artificial Social Intelligence /a>
RGB-D Object Dataset
Datasets for multimodal models,requiring at least images and corresponding captions, suitable for training multimodal large models.
Digital Technologies
Software Categories
Artificial Intelligence
Blockchain
The IoT