A Human-Inspired Decoupled Architecture for Efficient Audio Representation Learning | ScienceToStartup | ScienceToStartup