Learn Before Represent

Learn Before Represent (LBR) is a novel two-stage framework proposed to overcome the limitations of Large Language Models (LLMs) adapted via contrastive learning (LLM+CL) in specialized or 'vertical' domains such as chemistry, law, or medicine. While LLM+CL excels in general representation learning, it often struggles with domain-specific terminology and knowledge acquisition. LBR addresses this by first injecting crucial domain knowledge through an Information Bottleneck-Constrained Generative Learning stage, which maximizes knowledge acquisition while compressing semantics and preserving the LLM's causal attention. Following this, it performs Generative-Refined Contrastive Learning on these compressed representations for semantic alignment. This approach maintains architectural consistency and effectively resolves the inherent objective conflict between generative and contrastive learning. LBR is critical for researchers and ML engineers aiming to build accurate and robust LLM-based systems for specialized applications where deep domain understanding is paramount, enabling LLMs to move beyond general-purpose tasks into highly technical fields.

Core Problem Addressed by Learn Before Represent

Domain-Specific Knowledge Gap: The prevailing 'LLM+CL' paradigm, while effective for general representation learning, struggles in vertical domains like chemistry and law due to its inability to perform knowledge acquisition. This leads to failures when dealing with specialized terminology and concepts.
Semantic Alignment vs. Knowledge Acquisition: Traditional contrastive learning primarily focuses on semantic alignment, which is insufficient for domains requiring deep, specialized knowledge. LBR identifies this as a core bottleneck, proposing a method to bridge the gap between general semantic understanding and domain-specific expertise.

Core Problem Addressed by Learn Before Represent

The Two-Stage Framework of Learn Before Represent

Impact and Applications of Learn Before Represent

Sources

At a glance

Executive summary

TL;DR

Key points

Use cases

Also known as

Related topics