SEA-Eval: A Benchmark for Evaluating Self-Evolving Agents Beyond Episodic Assessment | ScienceToStartup