Achhina's Digital Garden
Search
Search
Dark mode
Light mode
Explorer
Tag: evaluation
7 items with this tag.
May 05, 2026
Designing GenAI Evaluations - Process and Metrics
source
llm
evaluation
methodology
benchmark
metrics
May 05, 2026
GEPA - Reflective Prompt Evolution Can Outperform Reinforcement Learning
source
llm
ai
evaluation
Apr 04, 2026
Dimensions of LLM Quality
evaluation
ai
Apr 04, 2026
LLM Pairwise Preference Judging
information-retrieval
evaluation
Apr 04, 2026
LLM as a Judge for Preference Annotation
information-retrieval
evaluation
Apr 04, 2026
LLM Benchmark Reference
llm
evaluation
ai
benchmark
Apr 04, 2026
LLM Comparison Sources
llm
leaderboards
evaluation
ai