Adapting Domain-Aware Knowledge to Vision-Language

Following 12 feeds