Open-source toolkit for automatic evaluation of text-to-image generation task, including training & test datasets and a distilled MLLM.