1.3 KiB
Ollama Cloud Provider Plugin
Overrides the bundled ollama-cloud provider profile to properly forward
reasoning_effort to the ollama.com API.
Problem
The bundled profile is a bare ProviderProfile with no build_api_kwargs_extras
override. The gateway parses reasoning_effort from config but the base class
returns ({}, {}) — nothing reaches the API. Agents start every session at
the provider's default reasoning level (medium) regardless of their config.
Fix
This plugin provides OllamaCloudProfile which overrides
build_api_kwargs_extras to forward reasoning_effort as a top-level API
parameter.
Value mapping: xhigh → high (ollama.com has no xhigh; its max is heavier
than high and typically overkill).
Deploy
hermes-plugin-deploy <agent> --plugin ollama-cloud --restart
Or manually: copy __init__.py to
/opt/data/plugins/model-providers/ollama-cloud/__init__.py and restart the gateway.
User plugins load after bundled ones with last-writer-wins, so this overrides the stub without modifying the bundled file.
Requires
model.reasoning_effort: high(ormedium/low/max/none) inconfig.yamlunder themodel:section.
Applies to
Any agent using provider: ollama-cloud (Bastion, Atlas, and any future fleet
agent on this provider).