high-quality, annotated datasets tailored for training, validating, and testing machine learning (ML) models. These datasets serve as the foundation for supervised learning, semi-supervised learning, and in some cases, for fine-tuning pre-trained models.