GPU-Accelerated Optimization of Transformer-Based Neural Networks for Real-Time Inference | ScienceToStartup | ScienceToStartup