42 lines
1.3 KiB
Markdown
42 lines
1.3 KiB
Markdown
# Ollama Cloud Provider Plugin
|
|
|
|
Overrides the bundled `ollama-cloud` provider profile to properly forward
|
|
`reasoning_effort` to the ollama.com API.
|
|
|
|
## Problem
|
|
|
|
The bundled profile is a bare `ProviderProfile` with no `build_api_kwargs_extras`
|
|
override. The gateway parses `reasoning_effort` from config but the base class
|
|
returns `({}, {})` — nothing reaches the API. Agents start every session at
|
|
the provider's default reasoning level (medium) regardless of their config.
|
|
|
|
## Fix
|
|
|
|
This plugin provides `OllamaCloudProfile` which overrides
|
|
`build_api_kwargs_extras` to forward `reasoning_effort` as a top-level API
|
|
parameter.
|
|
|
|
**Value mapping:** `xhigh → high` (ollama.com has no xhigh; its `max` is heavier
|
|
than `high` and typically overkill).
|
|
|
|
## Deploy
|
|
|
|
```bash
|
|
hermes-plugin-deploy <agent> --plugin ollama-cloud --restart
|
|
```
|
|
|
|
Or manually: copy `__init__.py` to
|
|
`/opt/data/plugins/model-providers/ollama-cloud/__init__.py` and restart the gateway.
|
|
|
|
User plugins load after bundled ones with last-writer-wins, so this overrides
|
|
the stub without modifying the bundled file.
|
|
|
|
## Requires
|
|
|
|
- `model.reasoning_effort: high` (or `medium`/`low`/`max`/`none`) in `config.yaml`
|
|
under the `model:` section.
|
|
|
|
## Applies to
|
|
|
|
Any agent using `provider: ollama-cloud` (Bastion, Atlas, and any future fleet
|
|
agent on this provider). |