Achhina's Digital Garden

Tag: llm

6 items with this tag.

  • May 05, 2026

    Designing GenAI Evaluations - Process and Metrics

    • source
    • llm
    • evaluation
    • methodology
    • benchmark
    • metrics
  • May 05, 2026

    GEPA - Reflective Prompt Evolution Can Outperform Reinforcement Learning

    • source
    • llm
    • ai
    • evaluation
  • Apr 04, 2026

    LLM Benchmark Reference

    • llm
    • evaluation
    • ai
    • benchmark
  • Apr 04, 2026

    LLM Comparison Sources

    • llm
    • leaderboards
    • evaluation
    • ai
  • Apr 04, 2026

    Autoresearch - Agent-Driven Autonomous ML Experimentation

    • ai-agents
    • llm
    • research-automation
    • karpathy
    • nanochat
  • Mar 05, 2026

    Small Local LLMs as Judges

    • research
    • llm
    • machine-learning
    • claude-code