Paper below tested a variety of base LLMs (no TTA) on genera · AI HOT