Vision-language Models for Driver Monitoring Systems: A Driver Activity Description Dataset | ArxivCSExplorer