AI RESEARCH

Are Large Pre-trained Vision Language Models Effective Construction Safety Inspectors?

arXiv CS.CV

ArXi:2508.11011v2 Announce Type: replace Construction safety inspections typically involve a human inspector identifying safety concerns on-site. With the rise of powerful Vision Language Models (VLMs), researchers are exploring their use for tasks such as detecting safety rule violations from on-site images. However, there is a lack of open datasets to comprehensively evaluate and further fine-tune VLMs in construction safety inspection. Current applications of VLMs use small, supervised datasets, limiting their applicability in tasks they are not directly trained for.