Using this dataset, we train a generic CLIP model in 2 days on 1 server.
With a 10M subset of living organisms, we train domain expert CLIP models excelling at fine-grained animal, plant, and fungi classification.
Models are out now on huggingface! The dataset is coming soon.
08.05.2025 12:58 โ ๐ 0 ๐ 1 ๐ฌ 0 ๐ 0
Using Knowledge Graphs to harvest datasets for efficient CLIP model training
Training high-quality CLIP models typically requires enormous datasets, which limits the development of domain-specific models -- especially in areas that even the largest CLIP models do not cover wel...
Excited to release our models and preprint: "Using Knowledge Graphs to harvest datasets for efficient CLIP model training"
We propose a dataset collection method using knowledge graphs and web image search, and create EntityNet-33M: a dataset of 33M images paired with 46M texts.
08.05.2025 12:58 โ ๐ 0 ๐ 1 ๐ฌ 1 ๐ 0