2026-05-30 12:32:33 -07:00

1.3 KiB

Ollama Cloud Provider Plugin

Overrides the bundled ollama-cloud provider profile to properly forward reasoning_effort to the ollama.com API.

Problem

The bundled profile is a bare ProviderProfile with no build_api_kwargs_extras override. The gateway parses reasoning_effort from config but the base class returns ({}, {}) — nothing reaches the API. Agents start every session at the provider's default reasoning level (medium) regardless of their config.

Fix

This plugin provides OllamaCloudProfile which overrides build_api_kwargs_extras to forward reasoning_effort as a top-level API parameter.

Value mapping: xhigh → high (ollama.com has no xhigh; its max is heavier than high and typically overkill).

Deploy

hermes-plugin-deploy <agent> --plugin ollama-cloud --restart

Or manually: copy __init__.py to /opt/data/plugins/model-providers/ollama-cloud/__init__.py and restart the gateway.

User plugins load after bundled ones with last-writer-wins, so this overrides the stub without modifying the bundled file.

Requires

  • model.reasoning_effort: high (or medium/low/max/none) in config.yaml under the model: section.

Applies to

Any agent using provider: ollama-cloud (Bastion, Atlas, and any future fleet agent on this provider).