Achhina's Digital Garden

Tag: evaluation

7 items with this tag.

  • May 05, 2026

    Designing GenAI Evaluations - Process and Metrics

    • source
    • llm
    • evaluation
    • methodology
    • benchmark
    • metrics
  • May 05, 2026

    GEPA - Reflective Prompt Evolution Can Outperform Reinforcement Learning

    • source
    • llm
    • ai
    • evaluation
  • Apr 04, 2026

    Dimensions of LLM Quality

    • evaluation
    • ai
  • Apr 04, 2026

    LLM Pairwise Preference Judging

    • information-retrieval
    • evaluation
  • Apr 04, 2026

    LLM as a Judge for Preference Annotation

    • information-retrieval
    • evaluation
  • Apr 04, 2026

    LLM Benchmark Reference

    • llm
    • evaluation
    • ai
    • benchmark
  • Apr 04, 2026

    LLM Comparison Sources

    • llm
    • leaderboards
    • evaluation
    • ai