Transformer based architectures will never lead to high level intelligence no matter how much data it is trained on or how big it is.

[removed]