LLMs are still not consistent judges of qualitative work, an · AI HOT