Cypher rlhf

How do you guys judge model failure in creative writing and chatbot?