CLIP-Based Multi-Modal Feature Learning for Cloth-

Following 11 feeds