retrieval fusion
This commit is contained in:
@@ -202,7 +202,9 @@ Returns `503` if llama-server is unreachable.
|
||||
|---|---|---|---|
|
||||
| `recentEpisodeLimit` | integer | 1–20 | Recent episodes injected into prompt |
|
||||
| `semanticLimit` | integer | 1–20 | Max semantic search results |
|
||||
| `scoreThreshold` | float | 0–1 | Minimum similarity score |
|
||||
| `scoreThreshold` | float | 0–1 | Minimum similarity score for Qdrant results |
|
||||
| `semanticWeight` | float | 0–5 | RRF weight for Qdrant semantic results |
|
||||
| `keywordWeight` | float | 0–5 | RRF weight for FTS5 keyword results (`0` = disabled) |
|
||||
| `modelsFolderPath` | string | — | Path to folder containing .gguf files |
|
||||
| `temperature` | float | 0–2 | Inference randomness |
|
||||
| `repeatPenalty` | float | 1–2 | Repeat token penalty |
|
||||
|
||||
Reference in New Issue
Block a user