MechaMental
Admin Guide

Model Configuration

Set up provider accounts, logical models, entitlements, and health monitoring at the organization level.

Organization admins manage the model catalog that workspace users consume. This involves configuring provider accounts with API keys, creating logical models that abstract away specific providers, controlling access through entitlements, and monitoring provider health.

Provider Accounts

Provider accounts store the API credentials for each model provider your organization uses. MechaMental supports providers including OpenAI, Anthropic, Azure OpenAI, Google, Cohere, and others.

Go to Admin -> Provider Accounts in the sidebar. This page lists all configured provider accounts with their current status.

Add a Provider Account

Click Add Account and select the provider from the dropdown.

Enter Credentials

Fill in the authentication details for the selected provider.

  • Account Name -- a label for this account (e.g., "Production OpenAI")
  • API Key -- your OpenAI API key
  • Account Name -- a label for this account (e.g., "Anthropic Primary")
  • API Key -- your Anthropic API key
  • Account Name -- a label for this account
  • Endpoint -- your Azure OpenAI resource endpoint URL
  • API Key -- your Azure API key
  • API Version -- the Azure API version string
  • Account Name -- a label for this account
  • API Key -- your Google AI API key

Set Limits

Optionally configure rate limits and spending caps for this provider account:

  • Requests per minute -- maximum API calls per minute
  • Tokens per minute -- maximum token throughput
  • Monthly spending cap -- hard limit on monthly costs from this account

Test Connection

Click Test Connection to verify that MechaMental can reach the provider with the supplied credentials. A successful test shows the available models on this account.

Multiple accounts per provider

You can add multiple accounts for the same provider. This is useful for separating production and development workloads, or for distributing load across API keys with different rate limits.

Logical Models

Logical models are provider-agnostic abstractions that workspace users reference in their pipelines. Instead of referencing a specific provider and model name, users select a logical model like fast-chat or high-quality-reasoning. This decouples pipeline definitions from provider details, making it easy to swap providers without updating every pipeline.

Go to Admin -> Models in the sidebar. This page shows all logical models with their current targets and status.

Create a Logical Model

Click New Logical Model and fill in:

  • Name -- a descriptive, use-case-oriented name (e.g., fast-chat, code-generation, document-embedding)
  • Type -- the model type: chat, embedding, image, or audio
  • Description -- explain the intended use case

Add Targets

Each logical model has one or more targets -- specific provider + model combinations. Targets form the fallback chain.

For each target, configure:

  • Provider Account -- select from your configured provider accounts
  • Model -- the specific model name on that provider (e.g., gpt-4o, claude-sonnet-4-20250514, gemini-2.0-flash)
  • Priority -- the order in which targets are tried (1 = first choice)

Save

Click Save to create the logical model. It immediately becomes available for workspace users to select in pipeline configurations.

Naming convention

Name logical models by their intended use rather than the underlying provider model. This makes it easy to swap providers without updating every pipeline that references the model.

Managing Targets and Fallback Chains

Each logical model can have multiple targets that form a fallback chain. When a request is made, targets are tried in priority order:

  1. Primary target (priority 1) -- the first choice, usually the best balance of quality and cost
  2. Fallback targets (priority 2, 3, ...) -- used when the primary target fails or hits rate limits
  3. Targets are tried in sequence until one succeeds

Model Entitlements

Entitlements control which workspaces can use which logical models. This prevents teams from accidentally using expensive models and lets you allocate resources appropriately.

Go to Admin -> Model Entitlements in the sidebar.

Select a Logical Model

Choose the logical model you want to configure access for.

Assign Workspaces

Select the workspaces that should have access to this model. Unselected workspaces will not see the model in their model picker.

Set Per-Workspace Limits (Optional)

For each entitled workspace, you can optionally set:

  • Requests per minute -- workspace-level rate limit for this model
  • Tokens per minute -- workspace-level token throughput limit
  • Monthly token cap -- maximum tokens this workspace can consume per month

No entitlement = no access

Workspaces without an entitlement for a given logical model cannot use it. Newly created logical models have no entitlements by default -- you must explicitly grant access.

Provider Health Monitoring

Monitor the health and availability of your provider accounts from Admin -> Provider Health. The health dashboard shows:

MetricDescription
StatusCurrent availability: healthy, degraded, or unreachable
Response LatencyAverage and p95 response times
Error RatePercentage of failed requests over the last hour
Rate Limit HitsNumber of requests rejected due to provider rate limits
Token UsageInput and output tokens consumed
CostEstimated spending per provider account

Health data updates in real time. If a provider's status degrades, the model router automatically shifts traffic to fallback targets (if configured in the logical model's fallback chain).

On this page