You must log in or # to comment.
immediate deal-breaker
right? might as well have just used ask Jeeves or some nonsense.
Tap for spoiler
Exactly what you’d think: getting a decent answer took more work than doing it yourself.
Cheng was part of a research team that compared two datasets, each comprising personal advice: one dataset written by humans responding to real-world situations and the second dataset consisting of judgments made by LLMs in response to posts on Reddit’s AITA (“Am I the A**hole?”) advice forum.
Hahahahahahahahahahahahhahahahahahahahahaha