Exploiting Frequency Dynamics for Enhanced Multimodal Event-based Action Recognition