A production-oriented comparison of LLM sampling parameters -- how temperature, top-p, top-k, and min-p reshape the output distribution, what combos actually work, and when not to use them.
Sampling strategies compared: temperature, top-p, top-k, min-p, and what actually works in production