Logo image
Comparative Evaluation of Vision-Language Models in Skin Diseases Recognition
Conference proceeding

Comparative Evaluation of Vision-Language Models in Skin Diseases Recognition

Alexandr Ovsyannikov, Maksim Airapetian, David Barboza, Noor Jamal Alkhateeb and Luqman Ali
2025 International Conference on Computer and Applications (ICCA), pp.1-6
International Conference on Computer and Applications (ICCA2025) (Bahrain, 22/12/2025–24/12/2025)
2025

Abstract

Accuracy Cognition Dermatology Diseases Generative AI Predictive models Sensitivity Skin Skin Diseases Training Vision language model Vision Language Models Visualization
Generative AI models have seen remarkable advancements with innovations in data comprehension and reasoning. Vision-Language Models (VLMs) offer new opportunities for dermatology screening through zero-shot classification. This study provides a comparative benchmark for nine modern VLMs in skin disease classification and highlights key limitations that must be addressed before reliable clinical deployment. Three prompt-engineering strategies and two dataset variations (original and brightness-augmented images) were used to assess the models' robustness to context and visual noise. Results show that Gemini 2.5 Pro consistently outperforms all other models, while smaller and GPT-based models display reduced accuracy and sensitivity to lighting variations. Prompt refinement improves prediction stability across multiple models, confirming the importance of well-structured instructions in zero-shot dermatology tasks.

Details

Metrics

1 Record Views
Logo image