Multi-head attention mechanism: “queries”, “keys”, and “values,” over and over again
*A comment added on 04/05/2022: Thanks to a comment by Mr. Maier, I found a major mistake in my visualization. To be concrete, there is a mistake in expressing how […]