I was doing MaxEntRL all this time?

I have a quick question. I recently came across Max Entropy RL. However, I don't understand the field. It seems just adding entropy regularization loss to your policy loss makes the method max entropy RL as long as the coefficient is 1? Am I missing something? I thought maximum entropy RL should be a more sophisticated algorithm.

tldr; Is adding entropy regularization to your A2C/PPO, etc policy loss with coefficient 1 doing maximum entropy RL?