Trying to make responses generate faster
So I'm using oobabooga with tavernAI as a front for all the characters, and responses always take like a minute to generate. I want it to take far less time. I have a 3060 TI with 8 gigs of VRAM. I'm wondering if a different model make it go faster or what settings I should change. I'm pretty much a newbie for all this.