Alternatives to Multi-modal Large Language Model | ScienceToStartup