Cheers: Decoupling Patch Details from Semantic Representations Enables Unified Multimodal Comprehension and Generation | ScienceToStartup | ScienceToStartup