Understanding the 4 Main Approaches to LLM Evaluation (From Scratch)
How do we actually evaluate LLMs?It’s a simple question, but one that tends to open up a much bigger discussion.When advising or collaborating on projects, one of the things I get asked most often is how to choose between different models and how to make sense of the evaluation results out there. (And, of course, how to measure progress when fine-tuning or developing our own.)Since this comes up so often, I thought it might be helpful to share a short overview of the main evaluation methods peop...
Read full article →