Ask HN: How do you personally evaluate new LLM models?

3 months ago 2
Ask HN: How do you personally evaluate new LLM models?
1 point by _samjarman 15 minutes ago | hide | past | favorite | discuss

Hey folks, how do you personally evaluate new HN models? Vibes? Or do you have some tests you like to run? Or do you just use them in your IDE/text iterface for a bit and see how it feels? I know we could probably trust some more public benchmarks but I'm curious on personal evaluation techniques. Thanks!


Read Entire Article