module
sampler.profiles
Functions
M.balanced
General-purpose default. Mild top-k / top-p / min-p with a moderate temperature ; a reasonable choice when you do not have a specific reason to deviate.
M.precise
Tighter cut-offs, lower temperature. Best when you want plausible, on-topic continuations without much variability.
M.creative
Wider candidate window, higher temperature. For brainstorming, creative writing, idea generation.
M.code
Code-tuned. Mild repetition penalty, low temperature, narrow top-k. Reduces the rate of accidental loops in code completions.
M.fast
Pure greedy. Deterministic and the fastest possible step ; best for grammar-constrained workloads where any randomness would conflict with the constraint.
M.thinking
Tuned for reasoning models — wider top-p, slightly cooler temp, still some entropy. Pair with `engine.chat({ thinking = true })`.