Achieving a robust and universal semantic representation for action description remains the key challenge in natural read more language understanding. Current approaches often struggle to capture the subtlety of human actions, leading to limited representations. To address this challenge, we propose innovative framework that leverages hybrid learni