You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
| mirostat_eta | Influences how quickly the algorithm responds to feedback from the generated text. A lower learning rate will result in slower adjustments, while a higher learning rate will make the algorithm more responsive. | `0.1` |
38
+
| mirostat_tau | Controls the balance between coherence and diversity of the output. A lower value will result in more focused and coherent text. | `5` |
39
+
| num_ctx | Sets the size of the context window used to generate the next token. | `2048` |
40
+
| repeat_last_n | Sets how far back for the model to look back to prevent repetition. (0 = disabled, -1 = num_ctx) | `64` |
41
+
| repeat_penalty | Sets how strongly to penalize repetitions. A higher value (e.g., 1.5) will penalize repetitions more strongly, while a lower value (e.g., 0.9) will be more lenient. | `1.1` |
42
+
| temperature | The temperature of the model. Increasing the temperature will make the model answer more creatively. | `0.8` |
43
+
| seed | Sets the random number seed to use for generation. Setting this to a specific number will make the model generate the same text for the same prompt. | `0` |
44
+
| num_predict | Maximum number of tokens to predict when generating text. (-1 = infinite generation) | `-1` |
45
+
| top_k | Reduces the probability of generating nonsense. A higher value (e.g. 100) will give more diverse answers, while a lower value (e.g. 10) will be more conservative. | `40` |
46
+
| top_p | Works together with top-k. A higher value (e.g., 0.95) will lead to more diverse text, while a lower value (e.g., 0.5) will generate more focused and conservative text. | `0.9` |
47
+
| min_p | Alternative to the top_p, and aims to ensure a balance of quality and variety. The parameter p represents the minimum probability for a token to be considered, relative to the probability of the most likely token. For example, with p=0.05 and the most likely token having a probability of 0.9, logits with a value less than 0.045 are filtered out. | `0.0` |
0 commit comments