Open-Source Datasets For Multimodal Generative AI Models