Who Sees the Future? A Deep Learning Language Model Demonstrates the Vision Advantage of Being Small