Options that appear in the same research papers as MoE, by co-occurrence.
| — |
| 1 |
| — |
| RAG | — | 1 | — |
| PPO | — | 1 | — |
| Transformer | — | 1 | — |
| LLaMA | — | 1 | — |
| DeepSeek-R1 | — | 1 | — |
| Llama 3 | — | 1 | — |
| RLHF | — | 1 | — |
| DPO | — | 1 | — |
| ORPO | — | 1 | — |
| ACT | — | 1 | — |
| multi-agent systems | — | 1 | — |
| o1 | — | 1 | — |
| MoE routing | — | 1 | — |
| Multi-head Latent Attention | — | 1 | — |
| Phi-4 | — | 1 | — |
| agentic systems | — | 1 | — |
| Action Chunking Transformer | — | 1 | — |
| Vision-Language-Action models | — | 1 | — |
| GPT-2 | — | 1 | — |
| LLaMA-2 | — | 1 | — |
| Nemotron-Cascade 2 | — | 1 | — |
| Qwen2.5-3B | — | 1 | — |