Achhina's Digital Garden
Search
Search
Dark mode
Light mode
Explorer
Tag: llm
6 items with this tag.
May 05, 2026
Designing GenAI Evaluations - Process and Metrics
source
llm
evaluation
methodology
benchmark
metrics
May 05, 2026
GEPA - Reflective Prompt Evolution Can Outperform Reinforcement Learning
source
llm
ai
evaluation
Apr 04, 2026
LLM Benchmark Reference
llm
evaluation
ai
benchmark
Apr 04, 2026
LLM Comparison Sources
llm
leaderboards
evaluation
ai
Apr 04, 2026
Autoresearch - Agent-Driven Autonomous ML Experimentation
ai-agents
llm
research-automation
karpathy
nanochat
Mar 05, 2026
Small Local LLMs as Judges
research
llm
machine-learning
claude-code