VOOZH about

URL: https://x.com/_philschmid/status/1967887164538057083

⇱ Philipp Schmid on X: "Just read this new research paper from Google AI called "Attention is All You Need" and I think my brain is actually broken 🤯 All our best AI models are stuck processing language one word at a time, in order. It's this huge sequential bottleneck. These researchers just... threw https://t.co/BbOkeoAcHT" / X


Post

Post

Just read this new research paper from Google AI called "Attention is All You Need" and I think my brain is actually broken 🤯 All our best AI models are stuck processing language one word at a time, in order. It's this huge sequential bottleneck. These researchers just... threw d something that on paper sounds completely insane. Instead of reading a sentence left-to-right, they built a model that can look at every single word at the same time and just figure out how important each one is to all the others. That's it. No more "recurrent" or "convolutional" layers. Just this mechanism they call "attention." They're calling the whole thing a "Transformer." The results are absolutely nuts. It’s not just a little better at machine translation, it's blowing the old models out of the water. And here’s the kicker: because it's not sequential, it can be trained way, way faster on modern hardware. But here's where it gets really weird. They're focused on translation, but this feels like something much bigger. This isn't just an improvement, it's a completely different way of thinking about language. This architecture feels like it was built to scale. What happens when you take this "Transformer" and make it 100x bigger? Train it on the entire internet? Does it just get better at translating, or does something else happen? It feels like we've been trying to force AI to process information like a human, step-by-step. This paper just asks, "what if that's completely the wrong way to do it?" I have a feeling that in 8 years, we're not going to be talking about LSTMs or RNNs anymore. We're all going to be talking about this. This feels like the start of something huge.
Don't miss what's happening
People on X are the first to know.