hugo/bDS

Files

2026-03-01 14:06:03 +01:00

48 KiB

Raw Blame History

Plan: Add Mistral AI as Alternative Chat Provider

Context

bDS currently routes all AI chat through the OpenCode Zen gateway (opencode.ai/zen/v1/...) with two code paths: Anthropic Messages API and OpenAI-compatible. The user wants Mistral AI added as a direct alternative provider with frontier models that support chat completion, tool use, and vision. Mistral's API is OpenAI-compatible (api.mistral.ai/v1/chat/completions), making integration straightforward.

Important architecture facts:

HTTP requests are currently non-streaming (full response body collected, text emitted after each complete call) — to be converted to SSE streaming in a separate prerequisite PR (PR 1)
API keys are stored in plain-text SQLite — to be migrated to Electron safeStorage (OS keychain) for all providers in a separate prerequisite PR (PR 2)
Neither sendAnthropicMessage() nor sendOpenAIMessage() currently sets tool_choice
sendOpenAIMessage() does not convert view_image results to image_url format — they are JSON-stringified
generateConversationTitle() is hardcoded to claude-haiku-4-5 via ZEN_ANTHROPIC_URL
analyzeMediaImage() is hardcoded to claude-sonnet-4-5 via ZEN_ANTHROPIC_URL
checkReady() only checks the OpenCode key — blocks sendMessage() for keyless users
Internal ModelInfo type (returned by getAvailableModels()) is { id, name, provider } — same shape as ChatModel in electronApi.ts; ensure they stay aligned when adding vision field

PR Structure

This work is split into 3 sequential PRs to reduce risk:

PR	Scope	Key Changes
PR 1 — SSE Streaming ✅	Standalone feature, no Mistral dependency	`httpRequestStream()`, SSE parsers (Anthropic + OpenAI formats), `stream: true` in request bodies, tool-call accumulation during streaming
PR 2 — Keychain Migration ✅	Standalone security improvement	Migrate OpenCode API key from plain-text SQLite to `safeStorage`; add encryption/decryption wrappers; delete old plain-text keys (no migration); cross-platform (macOS Keychain, Windows DPAPI, Linux libsecret)
PR 3 — Mistral Integration	Builds on PR 1 + PR 2	Mistral constants, model detection, key storage (using keychain from PR 2), parameterized `sendOpenAIMessage()`, vision fix, provider-aware routing, UI changes, i18n

Target Models

Use latest aliases (not dated IDs) so models auto-update when Mistral releases new versions. getAvailableModels() fetches the actual model list from the API; MODEL_DISPLAY_NAMES provides human-readable names for known models.

Model ID (latest alias)	Display Name	Vision	Tools	Context Window	Context Budget
`mistral-large-latest`	Mistral Large	yes	yes	40k	35,000
`mistral-medium-latest`	Mistral Medium	yes	yes	40k	35,000
`mistral-small-latest`	Mistral Small	yes	yes	128k	120,000
`devstral-small-latest`	Devstral Small	no	yes	128k	120,000
`devstral-large-latest`	Devstral Large	no	yes	256k	240,000

Files to Modify

1. `src/main/engine/OpenCodeManager.ts` - Core provider logic

A. Add Mistral constants (near lines 23-25)

MISTRAL_API_URL = 'https://api.mistral.ai/v1/chat/completions'
MISTRAL_MODELS_URL = 'https://api.mistral.ai/v1/models'

B. Add Mistral models to MODEL_DISPLAY_NAMES (lines 28-69)

'mistral-large-latest': 'Mistral Large'
'mistral-medium-latest': 'Mistral Medium'
'mistral-small-latest': 'Mistral Small'
'devstral-small-latest': 'Devstral Small'
'devstral-large-latest': 'Devstral Large'

C. Update detectProvider() (lines 1839-1845)

Add: if (id.startsWith('mistral') || id.startsWith('ministral') || id.startsWith('devstral') || id.startsWith('codestral') || id.startsWith('pixtral')) return 'mistral';
This covers all current and foreseeable Mistral model prefixes (Mistral, Ministral, Devstral, Codestral, Pixtral)

C2. Update formatModelName() and UPPERCASE_PREFIXES

formatModelName() (L1869) first checks MODEL_DISPLAY_NAMES, then auto-formats via hyphen splitting + capitalization
All 5 Mistral models are in MODEL_DISPLAY_NAMES, so auto-format is a fallback for future unknown models — no changes needed
UPPERCASE_PREFIXES (L72) contains ['gpt', 'glm'] — no Mistral prefixes need uppercasing, so no changes needed

D. Add Mistral API key storage (using keychain from PR 2)

New field: private mistralApiKey: string = ''
New methods: setMistralApiKey(), getMistralApiKey(), validateMistralApiKey()
Load on init via SecureKeyStore.retrieve() (keychain infrastructure from PR 2)
Store/retrieve using the same SecureKeyStore wrapper that PR 2 introduces for the OpenCode key
No plain-text fallback — safeStorage is required

E. Update checkReady()

Return ready: true if either OpenCode key or Mistral key is set
Extend ChatReadyStatus to report per-provider availability, e.g. providers: { opencode: boolean, mistral: boolean }
Callers (Sidebar.tsx, sendMessage()) must gate on the relevant provider, not a single boolean

F. Parameterize sendOpenAIMessage() for Mistral (no separate method)

Mistral uses the identical OpenAI-compatible chat/completions format — creating a separate sendMistralRequest() would be a near-duplicate
Instead, parameterize sendOpenAIMessage() to accept URL, API key, and provider-specific options:
- Add params: apiUrl: string, apiKey: string, providerOptions?: { parallelToolCalls?: boolean }
- sendMessage() determines provider via detectProvider() and calls sendOpenAIMessage() with the correct URL/key/options
- For OpenCode OpenAI path: URL = ZEN_OPENAI_URL, key = this.apiKey
- For Mistral: URL = MISTRAL_API_URL, key = this.mistralApiKey, parallelToolCalls: false
tool_choice: omit entirely for all OpenAI-compatible providers (default "auto" is correct)
parallel_tool_calls: false — set explicitly for Mistral only; our tool executor runs tools sequentially, so parallel tool calls would break the execution loop

F1b. Update requestProvider closure in sendMessage()

The requestProvider lambda (~line 362) dispatches to sendAnthropicMessage() or sendOpenAIMessage() based on detectProvider()
The else branch must pass provider-specific URL/key/options when calling sendOpenAIMessage():
- provider === 'mistral': URL = MISTRAL_API_URL, key = this.mistralApiKey, options = { parallelToolCalls: false }
- All other non-Anthropic providers: URL = ZEN_OPENAI_URL, key = this.apiKey (existing behavior)
Helper method getProviderConfig(provider) could return { apiUrl, apiKey, options } to keep requestProvider clean

F2. Add MODEL_CONTEXT_BUDGETS map

New constant map MODEL_CONTEXT_BUDGETS: Record<string, number> with per-model token budgets
truncateToTokenBudget() (L1654) currently defaults to maxContextTokens = 150000
In sendAnthropicMessage() and sendOpenAIMessage(): pass the model's context budget from the map (defaulting to 150,000 for OpenCode models)
The parameterized sendOpenAIMessage() looks up MODEL_CONTEXT_BUDGETS[modelId] for Mistral models and passes to truncation
Values (keyed by latest aliases):
- 'mistral-large-latest': 35_000
- 'mistral-medium-latest': 35_000
- 'mistral-small-latest': 120_000
- 'devstral-small-latest': 120_000
- 'devstral-large-latest': 240_000

G. Fix tool-call message history in OpenAI-compatible path

Within a single sendMessage() call, the tool loop correctly tracks tool results across rounds
However, tool role messages are not persisted to DB-backed conversation history — on conversation resume, the model loses context about prior tool results
Ensure tool role messages are included when persisting conversation history so cross-session continuity works
This affects all OpenAI-compatible providers (OpenCode OpenAI path + Mistral)

H. Fix vision in OpenAI-compatible path (affects Mistral too)

sendOpenAIMessage() currently JSON-stringifies view_image results — no image_url conversion
Add image_url format conversion for __isImageResult objects in the OpenAI path: { type: 'image_url', image_url: { url: 'data:image/webp;base64,...' } }
This fixes vision for all OpenAI-compatible providers, not just Mistral

I. Update getAvailableModels() — merge from both providers

Model list merge strategy: fetch models from each configured provider's API endpoint and merge into a single list. When both keys are configured, return models from both; when only one key is set, return only that provider's models; when no key is set, return an empty list (UI disables the dropdown)
OpenCode models: fetched from existing OpenCode API (as today)
Mistral models: fetched from GET https://api.mistral.ai/v1/models when Mistral key is set; cross-reference returned IDs with MODEL_DISPLAY_NAMES to use display names + static vision/contextBudget metadata
Every model entry carries provider: 'opencode' | 'mistral' so the UI and engine can resolve the correct API URL + key
Invalidate cachedModels/cachedModelsAt when any provider key is added or removed

I2. Filter fallback model list by available keys

getAvailableModels() currently falls back to the full MODEL_DISPLAY_NAMES map when the API call fails
With Mistral models added to MODEL_DISPLAY_NAMES, the fallback would show Mistral models even without a Mistral key
Filter fallback: fallback.filter(m => this.isProviderKeySet(m.provider)) — only include models whose provider has a configured key
Add helper isProviderKeySet(provider: string): boolean that checks the relevant key field
Same issue in validateApiKey(): currently returns models from the full MODEL_DISPLAY_NAMES map regardless of which provider key was validated. Once Mistral models are added, a successful OpenCode validation would incorrectly include Mistral models. Apply the same isProviderKeySet() filter to validateApiKey() results

I3. Add MODEL_CAPABILITIES map for vision flags

The Mistral API's /v1/models endpoint does NOT include a vision field — vision capability must come from a local static map
New constant: MODEL_CAPABILITIES: Record<string, { vision: boolean }> keyed by model ID
Entries for Mistral models:
- 'mistral-large-latest': { vision: true }
- 'mistral-medium-latest': { vision: true }
- 'mistral-small-latest': { vision: true }
- 'devstral-small-latest': { vision: false }
- 'devstral-large-latest': { vision: false }
OpenCode models also need vision flags (e.g., 'claude-sonnet-4-5': { vision: true }, 'o3': { vision: false }) for the image analysis model dropdown filter
getAvailableModels() attaches vision from this map to each returned model
Falls back to vision: false for unknown models (safe default; prevents non-vision models from appearing in the image analysis dropdown)

I4. Reconcile ModelInfo and ChatModel types

Internal ModelInfo (returned by getAvailableModels()) is { id, name, provider } — same shape as ChatModel in electronApi.ts
When adding vision: boolean to ChatModel, also update ModelInfo (or alias/merge them) so the engine and renderer use the same type
Simplest approach: remove ModelInfo, use ChatModel everywhere (engine + IPC + renderer)

J. Update generateConversationTitle() — make configurable in Preferences

Currently hardcoded to claude-haiku-4-5 via ZEN_ANTHROPIC_URL with OpenCode key
Add a "Title generation model" preference in Settings so users can pick the cheapest model for this task
Default: claude-haiku-4-5 (OpenCode) or mistral-small-latest (Mistral) based on available keys
Route to the correct provider URL + API key based on the selected model's provider
Must work with any configured provider, not just OpenCode

K. Update analyzeMediaImage() — make configurable in Preferences (lines 2066-2192)

Currently hardcoded to claude-sonnet-4-5 via ZEN_ANTHROPIC_URL
Add an "Image analysis model" preference in Settings so users can select a dedicated vision model independent of their chat model (e.g. use Devstral for chat but Mistral Large 3 for images)
Only vision-capable models may be offered — filter model list by a vision capability flag (e.g. Devstral models have no vision and must be excluded)
Default: claude-sonnet-4-5 (OpenCode) or first vision-capable Mistral model based on available keys
Route to the correct provider URL + API key based on the selected model's provider
When routed to Mistral: use image_url format with base64 data URI
When routed to OpenCode/Anthropic: keep current Anthropic-native image block format

L. Update analyzeTaxonomy()

Currently uses this.apiKey (OpenCode) for both Anthropic and OpenAI paths
Has an early-return guard if (!this.apiKey) that must become provider-aware — check Mistral key when provider is Mistral
When a Mistral model is selected: use Mistral API key + MISTRAL_API_URL
Must branch on provider to select correct key and URL
Note: the OpenAI branch inside analyzeTaxonomy() also hardcodes ZEN_OPENAI_URL and this.apiKey — both must become provider-aware (use getProviderConfig(provider) helper from F1b)

L2. Update analyzeMediaImage() API key guard

Same issue: has if (!this.apiKey) early-return guard
Must become provider-aware — check the relevant provider's key based on the selected image analysis model
When routed to Mistral: check this.mistralApiKey instead of this.apiKey

M. Convert chat HTTP calls to SSE streaming (PR 1 — separate prerequisite PR)

This entire section (M1–M6) is implemented in PR 1, before the Mistral PR. The Mistral PR (PR 3) inherits streaming support and only needs to pass the correct URL/key/options to the already-streaming sendOpenAIMessage().

Currently httpRequest() buffers the entire response body before any text reaches the UI. Users wait 5–30s per API round with only a bouncing-dots indicator. All three providers (Anthropic, OpenAI, Mistral) support stream: true with SSE.

M1. Core streaming infrastructure — httpRequestStream()

New method (~100 lines) — uses Node.js https.request() but reads res as a readable stream
Returns an async iterable of parsed SSE events (or accepts an onEvent callback)
SSE line protocol: lines separated by \n\n, each line prefixed with event: or data:
Must handle:
- Buffering partial lines across data chunks (TCP may split mid-line)
- Empty data: lines (keep-alive pings)
- data: [DONE] sentinel — terminates the stream for OpenAI/Mistral (do NOT try to JSON.parse this)
- Multiple data: lines between double-newlines (concatenate per SSE spec)
Supports AbortSignal — calls req.destroy() to terminate immediately
120-second timeout matching existing httpRequest()
On non-2xx status: collect the error body (not streamed) and throw with parsed error message

M2. SSE parser for OpenAI/Mistral format (~50 lines) OpenAI and Mistral use identical SSE event structure:

data: {"id":"...","choices":[{"delta":{"content":"Hello"}}]}
data: {"id":"...","choices":[{"delta":{"tool_calls":[{"index":0,"id":"call_...","function":{"name":"search_posts","arguments":""}}]}}]}
data: {"id":"...","choices":[{"delta":{"tool_calls":[{"index":0,"function":{"arguments":"{\"query\""}}]}}]}
...
data: {"id":"...","usage":{"prompt_tokens":150,"completion_tokens":42,"total_tokens":192}}
data: [DONE]

Text deltas: choices[0].delta.content — emit via onDelta(content) immediately
Tool call start: delta.tool_calls[i] with id + function.name — begin accumulating arguments for tool call at index i
Tool call argument fragments: delta.tool_calls[i].function.arguments — append to argument accumulator string for index i
Finish reason: choices[0].finish_reason === 'tool_calls' or 'stop' — signals end of this chunk
Token usage: arrives in the final chunk before [DONE] only if stream_options: { include_usage: true } is set in the request body — parse usage.prompt_tokens, usage.completion_tokens, usage.total_tokens
[DONE] sentinel: stop iteration, do NOT JSON.parse
After stream ends: if tool calls were accumulated, JSON.parse each tool's assembled arguments string and execute

M3. SSE parser for Anthropic format (~60 lines) Anthropic uses named event types:

event: message_start
data: {"type":"message_start","message":{"id":"...","model":"...","usage":{"input_tokens":150}}}

event: content_block_start
data: {"type":"content_block_start","index":0,"content_block":{"type":"text","text":""}}

event: content_block_delta
data: {"type":"content_block_delta","index":0,"delta":{"type":"text_delta","text":"Hello"}}

event: content_block_start
data: {"type":"content_block_start","index":1,"content_block":{"type":"tool_use","id":"toolu_...","name":"search_posts"}}

event: content_block_delta
data: {"type":"content_block_delta","index":1,"delta":{"type":"input_json_delta","partial_json":"{\"query\""}}

event: content_block_stop
data: {"type":"content_block_stop","index":1}

event: message_delta
data: {"type":"message_delta","delta":{"stop_reason":"tool_use"},"usage":{"output_tokens":42}}

event: message_stop
data: {"type":"message_stop"}

message_start: extract usage.input_tokens (prompt tokens) + usage.cache_read_input_tokens + usage.cache_creation_input_tokens
content_block_start with type: 'text': no-op (empty initial text)
content_block_start with type: 'tool_use': record tool call id and name at block index
content_block_delta with type: 'text_delta': emit via onDelta(delta.text) immediately
content_block_delta with type: 'input_json_delta': append delta.partial_json to argument accumulator
content_block_stop: if tool block, JSON.parse the accumulated arguments for that block
message_delta: extract usage.output_tokens (completion tokens), delta.stop_reason
message_stop: stream complete
ping: ignore (keep-alive)
error: throw with data.error.message — handles mid-stream server errors (e.g. overloaded)

M4. Request body changes

sendAnthropicMessage(): add "stream": true to request body
sendOpenAIMessage() (used for both OpenCode OpenAI and Mistral): add "stream": true and "stream_options": { "include_usage": true } to request body — this is required to receive token usage in streaming mode (without it, usage is omitted from streamed responses)

M5. Tool call accumulation during streaming

Tool call arguments arrive as partial JSON fragments across many SSE events
Maintain a per-stream accumulator: Map<number, { id: string, name: string, arguments: string }> keyed by tool call index
Append each arguments fragment to the accumulator string
On stream completion (finish_reason tool_calls/tool_use, or content_block_stop for Anthropic): JSON.parse the full accumulated arguments string and execute the tool
If JSON.parse fails on accumulated arguments, report a tool error to the model and continue

M6. Error handling during streaming

Non-2xx status on connection: do NOT stream; collect the full error body and throw (same as current httpRequest() behavior)
Mid-stream TCP disconnect / network error: res.on('error') handler — emit whatever text was accumulated so far, then throw so the tool-call loop can surface the error to the user
Mid-stream API error event: Anthropic sends event: error with error details; OpenAI/Mistral return an error JSON in a data: line — detect and throw with parsed error message
Abort during streaming: req.destroy() triggers res.on('error') or res.on('close') — handle gracefully without surfacing as an error to the user (it's intentional cancellation)

M7. Retry with exponential backoff for transient errors

Applies to all providers (Anthropic, OpenAI, Mistral) for both streaming and non-streaming calls
Retry on HTTP status codes: 429 (rate limit), 503 (service unavailable), 502 (bad gateway)
Max 3 retries with exponential backoff: ~1s, ~2s, ~4s (with jitter)
For 429: respect Retry-After header if present (use as minimum delay)
For streaming: retry the entire request (cannot resume a partial SSE stream)
Do NOT retry on 4xx errors other than 429 (client errors like 400, 401, 403 are not transient)
Do NOT retry on abort (intentional cancellation)
Emit a brief status via onDelta or logging so the user knows a retry is in progress (e.g., "[Retrying...]") — or silently retry if preferred
Wrap in a helper: withRetry(fn, { maxRetries: 3, retryableStatuses: [429, 502, 503] })

What does NOT change:

The renderer pipeline — onDelta → IPC chat-stream-delta → appendStreamDelta → React state → live Markdown rendering already works token-by-token; it just receives one big chunk today
AbortController abort support — req.destroy() stops the stream immediately instead of wasting a buffered response
The tool-call loop structure — still max 10 rounds, still sequential

What to keep non-streaming:

generateConversationTitle() — small one-shot request, buffering is fine
analyzeMediaImage() — one-shot, no UI streaming needed
analyzeTaxonomy() — one-shot, no UI streaming needed
validateApiKey() / validateMistralApiKey() — small validation requests
Note: validateMistralApiKey() must call GET https://api.mistral.ai/v1/models with Authorization: Bearer ${key}. Mistral returns { data: [{ id, object, created, owned_by }] } — check for HTTP 200 + non-empty data array. On 401, return invalid. On success, optionally cross-reference returned model IDs with MODEL_DISPLAY_NAMES to verify expected models are available

Estimated scope: ~350 lines of new code in OpenCodeManager.ts (streaming ~200 lines + retry ~50 lines + parsers ~100 lines)

1b. Keychain Migration (PR 2 — separate prerequisite PR)

This section is implemented in PR 2, before the Mistral PR. PR 3 (Mistral) uses the keychain infrastructure introduced here.

Scope: Migrate all API keys from plain-text SQLite to Electron safeStorage (OS keychain). Cross-platform: macOS Keychain, Windows DPAPI, Linux libsecret. No legacy fallback — old plain-text keys are deleted on startup; users re-enter keys.

1b-A. SecureKeyStore utility class (~60 lines)

New module: src/main/engine/SecureKeyStore.ts
store(key: string, value: string) — encrypts with safeStorage.encryptString(), stores encrypted Buffer in SQLite settings table (as base64 string under a __encrypted_ prefixed key)
retrieve(key: string): string | null — reads encrypted base64 from SQLite, decrypts with safeStorage.decryptString()
remove(key: string) — deletes the encrypted entry
isAvailable(): boolean — wraps safeStorage.isEncryptionAvailable()
No plain-text fallback — store() throws if safeStorage is unavailable

1b-B. Cleanup of old plain-text keys (~10 lines)

On app startup (in getOpenCodeManager() init): delete plain-text opencode_api_key from settings if it exists
No migration — users re-enter their API key after the update
Simple and secure: no window where both plain-text and encrypted keys coexist

1b-C. Update setApiKey() / getApiKey() in chatHandlers

Use SecureKeyStore.store() / SecureKeyStore.retrieve() instead of direct getSetting()/setSetting()
getApiKey() returns masked key as before (for UI display)
validateApiKey() unchanged — works with the decrypted key in memory

1b-D. Tests

SecureKeyStore unit tests: encrypt/decrypt round-trip, error when safeStorage unavailable, cleanup of old plain-text keys
Mock safeStorage in tests (it's an Electron API, not available in Node)

Estimated scope: ~120 lines of new code + ~80 lines of tests

2. `src/main/engine/ChatEngine.ts` - Settings persistence

A. Add Mistral key helpers

Use existing generic getSetting()/setSetting() with key 'mistral_api_key' — no dedicated methods needed, avoids unnecessary boilerplate
ChatEngine already exposes generic helpers for reading/writing the settings table
Note: the actual encrypted key storage goes through SecureKeyStore (PR 2) — getSetting()/setSetting() is used only for non-sensitive preferences

B. Default model is user-driven

getSelectedModel() defaults to 'claude-sonnet-4-5'
When user configures providers in Preferences, they explicitly select their default model — no automatic fallback logic needed
All surfaces (ChatPanel, AssistantSidebar, ImportAnalysisView) use this preference as default
If selected model's provider key is later removed:
- sendMessage() returns a clear error string: "The selected model requires a {provider} API key. Configure it in Settings."
- checkReady() still returns ready: true if any other provider is available
- ChatPanel shows an inline error banner (not a toast) with a link/button to open Settings
- i18n key: chat.providerKeyMissing — "The model '{model}' requires a {provider} API key. Go to Settings to configure it."
- Add this key to all 5 locale files
- This applies equally to existing open conversations whose model belongs to the removed provider — the next sendMessage() in those conversations shows the same inline error, not a silent failure

C. Add per-purpose model preferences

getTitleModel() / setTitleModel(modelId) — settings key 'chat_title_model'
getImageAnalysisModel() / setImageAnalysisModel(modelId) — settings key 'chat_image_analysis_model'
Both default to null (= use hardcoded defaults per provider)

3. `src/main/ipc/chatHandlers.ts` - IPC bridge

A. Add Mistral-specific handlers

chat:setMistralApiKey - validate + persist Mistral key, invalidate model cache
chat:getMistralApiKey - return masked key
chat:validateMistralApiKey - test key against Mistral API

B. Update chat:getAvailableModels

Include Mistral models when Mistral key is configured
Return provider info per model

C. Update chat:checkReady

Report readiness for both providers independently

D. Update getOpenCodeManager() init

Load Mistral API key via SecureKeyStore.retrieve('mistral_api_key') on first call (alongside OpenCode key)
Call manager.setMistralApiKey() during init

E. Add per-purpose model preference handlers

chat:setTitleModel / chat:getTitleModel — persist + load title generation model preference
chat:setImageAnalysisModel / chat:getImageAnalysisModel — persist + load image analysis model preference

4. `src/main/shared/electronApi.ts` - Type definitions

A. Extend ChatModel interface

Add provider: 'opencode' | 'mistral' field (already optional, ensure populated)
Add vision: boolean field — indicates whether the model supports image inputs (used to filter the image analysis model dropdown)

B. Extend ChatReadyStatus interface

Add providers?: { opencode: boolean; mistral: boolean } for per-provider status

C. Add Mistral IPC methods to ElectronAPI.chat

setMistralApiKey(key: string)
getMistralApiKey()
validateMistralApiKey(key: string)

D. Add per-purpose model preference methods to ElectronAPI.chat

setTitleModel(modelId: string | null) / getTitleModel()
setImageAnalysisModel(modelId: string | null) / getImageAnalysisModel()

5. `src/renderer/components/SettingsView/SettingsView.tsx` - UI settings

A. Add Mistral API key section

Separate input field for Mistral API key (below OpenCode key)
Same pattern: masked display, change button, validation on save

B. Update model selector

SettingsView uses a native <select> element — group models by provider using <optgroup> labels ("OpenCode Zen", "Mistral AI")
When no API key is configured for any provider, disable the <select> dropdown
When both keys configured, show merged list from both providers; when only one key set, show only that provider's models
Note: availableModels state is currently typed as {id: string; name: string}[] — must be updated to ChatModel[] (which includes provider and vision fields) so provider grouping and vision filtering work

C. Add per-purpose model preferences

"Title generation model" dropdown — select cheapest/fastest model for auto-titling conversations
"Image analysis model" dropdown — select a dedicated vision model for media metadata (independent of chat model, e.g. use Devstral for chat but Mistral Large 3 for images); only show vision-capable models (filter out models without vision support like Devstral)
Both show available models from all configured providers, grouped by provider
Both allow a "Default" option that auto-selects per provider defaults

6. `src/renderer/components/ChatPanel/ChatPanel.tsx` - Chat UI

A. Update model selector in chat

ChatPanel uses a custom dropdown (CSS model-dropdown with <button> elements, not a native <select>) — add provider group headers (non-clickable divider labels) within the dropdown to visually separate providers
Only show models for configured providers; when no keys configured, hide the model selector entirely
When both providers configured, merge models from both with visual grouping

7. `src/renderer/components/AssistantSidebar/` - Assistant UI

A. No model selector changes needed

AssistantSidebar has no model selector and no checkReady() call of its own
It uses whatever default model is set in Preferences (via getSelectedModel())
No code changes needed here — provider-awareness is handled at the Preferences and engine level

8. `src/renderer/components/Sidebar.tsx` - Navigation

A. Update readiness check

Calls chat.checkReady() to show/hide chat features
Must handle multi-provider readiness (show chat if any provider is ready)
Note: Zustand store (src/renderer/store/appStore.ts) currently only tracks chatTokenUsage — no provider/readiness state is stored there. Provider readiness is ephemeral (fetched on mount via checkReady()), so no Zustand changes needed. If future features need reactive provider state, consider adding it then

9. `src/renderer/components/ImportAnalysisView/ImportAnalysisView.tsx` - Taxonomy analysis UI

A. Update model selector

Has its own model selector (ChatModel[] state + getAvailableModels() call) for taxonomy analysis
Currently renders a flat model list with no provider grouping
Apply provider grouping matching the component's existing dropdown pattern
Default to whatever is set in Preferences as default model (via getSelectedModel())

9b. Model selector UI approach

Two different dropdown patterns exist — keep each surface consistent with its current UX:

SettingsView uses native <select> elements → use <optgroup> for provider grouping (standard HTML pattern)
ChatPanel uses a custom CSS dropdown (model-dropdown with <button> elements) → add non-clickable provider group headers as dividers
ImportAnalysisView uses a custom CSS dropdown (taxonomy-model-dropdown with <button> elements, same pattern as ChatPanel) → add non-clickable provider group headers as dividers
Shared logic (filtering by vision, adding "Default" option, provider grouping) can be extracted into a utility function or hook rather than a full component, since the rendering pattern differs per surface
Props for the shared utility: models: ChatModel[], filterVisionOnly?: boolean, includeDefault?: boolean → returns grouped/filtered model list

9c. `src/renderer/navigation/useChatMessageSender.ts` - Shared chat hook

A. Verify no provider assumptions

Used by both ChatPanel and AssistantSidebar to send messages
Currently delegates to sendConversationMessage() from chatSession.ts — verify neither has hardcoded provider/model assumptions
No code changes expected, but must be verified during implementation

10. Preload/IPC registration

A. src/main/preload.ts

Register new Mistral IPC channels in preload bridge
All chat IPC channels are bridged 1:1; new methods need entries here

11. i18n - All locale files

A. Add Mistral-specific i18n keys in all 5 locale files:

src/renderer/i18n/locales/en.json
src/renderer/i18n/locales/de.json
src/renderer/i18n/locales/fr.json
src/renderer/i18n/locales/es.json
src/renderer/i18n/locales/it.json

Keys needed:

settings.ai.mistralApiKeyLabel — "Mistral API Key"
settings.ai.mistralApiKeyDescription — description text
settings.ai.mistralApiKeyPlaceholder — placeholder text
settings.ai.titleModelLabel — "Title generation model"
settings.ai.titleModelDescription — description text
settings.ai.imageAnalysisModelLabel — "Image analysis model"
settings.ai.imageAnalysisModelDescription — description text
settings.ai.defaultOption — "Default" (for per-purpose model selectors)
settings.ai.providerGroupOpenCode — "OpenCode Zen" (provider group label)
settings.ai.providerGroupMistral — "Mistral AI" (provider group label)
chat.providerKeyMissing — "The model '{model}' requires a {provider} API key. Go to Settings to configure it."
chat.apiKeyRequiredTitle — make generic or multi-provider (currently hardcoded to "OpenCode Zen API Key Required")
chat.apiKeyRequiredDescription — make generic or multi-provider (currently hardcoded to OpenCode-specific text)

12. MCP Server - `src/main/engine/MCPServer.ts`

No changes needed — MCP server exposes tools for external AI agents to call; no bDS-side AI runs during MCP requests

13. Python API - `src/main/shared/pythonApiContractV1.ts`

No changes needed — AI/chat features are explicitly not exposed via Python API

14. Main-process i18n locales - `src/main/shared/i18n/locales/`

No changes expected — chat-related strings are renderer-only
Verify no main-process strings reference "OpenCode" in a way that needs updating for multi-provider support

Tests to Update

New tests

PR 1 (SSE Streaming):

SSE line parsing (both OpenAI/Mistral and Anthropic formats)
[DONE] sentinel handling
Tool-call argument accumulation during streaming
Mid-stream error handling
stream_options in request bodies
Partial line buffering across TCP chunks
Abort during streaming (graceful cancellation)
Retry with exponential backoff: 429/502/503 retries, Retry-After header parsing, no retry on 4xx/abort

PR 2 (Keychain Migration):

SecureKeyStore encrypt/decrypt round-trip
Error when safeStorage unavailable (no plain-text fallback)
Cleanup of old plain-text keys on startup
chatHandlers integration with SecureKeyStore

PR 3 (Mistral Integration):

OpenCodeManager: Mistral key storage, detectProvider('mistral-*') + detectProvider('devstral-*') + detectProvider('codestral-*') + detectProvider('pixtral-*'), parameterized sendOpenAIMessage() with Mistral URL/key, vision image conversion in OpenAI path, tool-call message persistence in OpenAI path, generateConversationTitle() Mistral routing, model cache merge (both providers), MODEL_CONTEXT_BUDGETS correctness, MODEL_CAPABILITIES correctness, isProviderKeySet() helper, getProviderConfig() helper, fallback model list filtering by available keys, provider-aware API key guards in analyzeTaxonomy()/analyzeMediaImage()
ChatEngine: getTitleModel()/setTitleModel(), getImageAnalysisModel()/setImageAnalysisModel(), default model fallback
chatHandlers: new Mistral IPC handlers, per-purpose model preference handlers

Existing tests to update

tests/engine/OpenCodeManagerTools.test.ts — if mocked manager gains new required fields
tests/engine/ChatEngine.test.ts — default model fallback logic
tests/ipc/chatHandlers.test.ts — new handler registration, init flow
electronApiContract.test.ts — ElectronAPI.chat shape now includes Mistral methods
10 renderer test files that mock window.electronAPI.chat (12 mock blocks total) — add Mistral method stubs to mocks:
- tests/renderer/components/SidebarChat.test.tsx
- tests/renderer/components/SettingsView.test.tsx
- tests/renderer/components/SettingsView.i18n.test.tsx
- tests/renderer/components/TabBar.test.tsx
- tests/renderer/components/EditorDashboardTimeline.test.tsx
- tests/renderer/components/AssistantSidebar.wiring.test.tsx
- tests/renderer/navigation/chatSurfaceUsageGuards.test.ts
- tests/renderer/navigation/chatSurfaceModeUsageGuards.test.ts
- tests/renderer/navigation/assistantSidebarGuards.test.ts
- tests/renderer/a2ui/surfaceActionWiring.test.tsx

Implementation Order

PR 1 — SSE Streaming (prerequisite)

Tests first (per AGENTS.md) — SSE parsing, streaming error handling, tool accumulation
Core streaming infrastructure (httpRequestStream(), SSE line parser)
Anthropic SSE parser (sendAnthropicMessage() → streaming)
OpenAI/Mistral SSE parser (sendOpenAIMessage() → streaming)
stream: true + stream_options in request bodies
Update existing test mocks if needed
Build verification (npm run build)

PR 2 — Keychain Migration (prerequisite)

Tests first — SecureKeyStore unit tests
SecureKeyStore utility class
Delete old plain-text opencode_api_key in getOpenCodeManager() init
Update chatHandlers setApiKey() / init to use SecureKeyStore
Add deleteSetting() to ChatEngine for cleanup
Build verification (npm run build)

PR 3 — Mistral Integration (builds on PR 1 + PR 2)

Tests first (per AGENTS.md)
Types (electronApi.ts — ChatModel with vision, ChatReadyStatus, ElectronAPI.chat; unify ModelInfo/ChatModel)
Engine (OpenCodeManager.ts — constants, MODEL_DISPLAY_NAMES, MODEL_CONTEXT_BUDGETS, MODEL_CAPABILITIES, detectProvider(), key storage via SecureKeyStore, checkReady(), getProviderConfig(), isProviderKeySet(), parameterized sendOpenAIMessage(), vision fix, provider-aware guards, title generation fallback, model cache merge + fallback filtering)
Persistence (ChatEngine.ts — per-purpose model preferences, default model fallback)
IPC (chatHandlers.ts — new handlers, init flow update)
Preload (preload.ts — bridge new channels)
i18n (all 5 locale files)
Shared utilities (model grouping/filtering utility for provider-aware dropdowns)
UI (SettingsView/SettingsView.tsx, ChatPanel.tsx, ImportAnalysisView.tsx, Sidebar.tsx)
Update existing test mocks (10 renderer test files + engine/IPC tests)
Build verification (npm run build)

Key Differences to Handle

Aspect	OpenCode/Anthropic	OpenCode/OpenAI-compat	Mistral
Base URL	`opencode.ai/zen/v1/messages`	`opencode.ai/zen/v1/chat/completions`	`api.mistral.ai/v1/chat/completions`
Auth header	`Bearer ${openCodeKey}`	`Bearer ${openCodeKey}`	`Bearer ${mistralKey}`
Request method	`sendAnthropicMessage()`	`sendOpenAIMessage(url, key)`	`sendOpenAIMessage(url, key, opts)` (same method, parameterized)
Tool choice	not set	not set	not set (default `"auto"`)
Parallel tools	not set	not set	`parallel_tool_calls: false`
Context budget	150k tokens	150k tokens	per-model (see Target Models table)
Stream options	`"stream": true`	`"stream": true, "stream_options": {"include_usage": true}`	`"stream": true, "stream_options": {"include_usage": true}`
Vision in tool results	Anthropic `image` block (native)	BUG: JSON-stringified	`image_url` block (fix needed)
HTTP mode	SSE streaming (`stream: true`)	SSE streaming (`stream: true`)	SSE streaming (`stream: true`)
Title generation	`claude-haiku-4-5` default	N/A	`mistral-small-latest` default
Image analysis	`claude-sonnet-4-5` default	N/A	user-selected vision model
Model source	fetched from OpenCode API	fetched from OpenCode API	fetched from `api.mistral.ai/v1/models`
Key storage	`safeStorage` (keychain)	`safeStorage` (keychain)	`safeStorage` (keychain)

Verification

Run npm test — all existing + new tests pass
Run npm run build — clean build
Manual: set Mistral API key in Settings, verify validation
Manual: select Mistral Large, send chat message, verify response completes
Manual: use view_image tool in chat with Mistral model, verify vision works
Manual: verify tool calling works (search_posts, list_posts, etc.)
Manual: verify OpenCode models still work unchanged
Manual: verify Mistral-only mode (no OpenCode key) — chat works, title generates, readiness shows correctly
Manual: verify analyzeMediaImage() and analyzeTaxonomy() with Mistral model
Manual: configure title generation model in Settings, verify titles use selected model
Manual: configure image analysis model in Settings, verify media analysis uses selected model (independent of chat model)
Manual: verify SSE streaming — text appears token-by-token (not as a single block after long wait)
Manual: verify abort during streaming — text stops immediately, no wasted response
Manual: verify keychain storage — API keys are encrypted, not stored as plain text in SQLite
Manual: verify old plain-text key is deleted on first launch after update (user re-enters key)

Resolved Decisions

analyzeMediaImage() — Configurable via Settings preference; user selects a dedicated vision model independent of chat model; dropdown only shows vision-capable models
generateConversationTitle() — Configurable via Settings preference; user selects cheapest/fastest model for auto-titling
checkReady() — Returns true if any provider key is set; reports per-provider availability
Default model — User-driven; set explicitly in Preferences when configuring provider + model; all surfaces (ChatPanel, AssistantSidebar, ImportAnalysisView) use this preference
Vision in OpenAI path — Fix image_url conversion for all OpenAI-compatible providers (not just Mistral)
MCP server — N/A; only exposes tools, no bDS-side AI runs
Python API — N/A; AI/chat not exposed via Python API
Model dropdown grouping — SettingsView uses native <select> with <optgroup>; ChatPanel uses custom CSS dropdown with provider header dividers; shared utility extracts grouping/filtering logic while each surface keeps its own rendering pattern
SSE streaming — Convert all chat HTTP calls to stream: true + SSE parsing; keep one-shot requests (title, image analysis, taxonomy, validation) non-streaming. Renderer needs zero changes — existing onDelta pipeline already supports incremental tokens. Token usage requires stream_options: { include_usage: true } for OpenAI/Mistral format; Anthropic provides usage in message_start + message_delta events
OpenAI tool-call history — Within a single sendMessage() call, tool results are tracked correctly across rounds. The fix is about persisting tool role messages to DB-backed conversation history so cross-session resume works
ImportAnalysisView — Has its own model selector; apply same provider grouping; default to Preferences model
AssistantSidebar — No model selector of its own; uses Preferences default model; no code changes needed
tool_choice — Do NOT set tool_choice: "any" for Mistral (this forces tool use every turn). Omit it entirely; Mistral defaults to "auto", same as OpenCode. Set parallel_tool_calls: false explicitly since our tool executor is sequential
No separate sendMistralRequest() — Parameterize sendOpenAIMessage() with URL/key/options instead of creating a near-duplicate method; Mistral uses the identical OpenAI-compatible format
detectProvider() prefixes — Cover all Mistral model families: mistral, ministral, devstral, codestral, pixtral
formatModelName() / UPPERCASE_PREFIXES — No changes needed; all 5 Mistral models are in MODEL_DISPLAY_NAMES; auto-format fallback handles future unknown models correctly
Context budgets — Stored in MODEL_CONTEXT_BUDGETS map; passed explicitly to truncateToTokenBudget() per provider path; OpenCode defaults to 150k, Mistral per-model (see Target Models table)
Error UX for removed provider key — Inline error banner in ChatPanel (not a toast) with link to Settings; sendMessage() returns descriptive error string; checkReady() stays true if any provider available
Zustand store — No changes needed; provider readiness is ephemeral (fetched on mount), token usage tracking is already in store and is provider-agnostic
validateMistralApiKey() — Calls GET https://api.mistral.ai/v1/models with Bearer token; checks for HTTP 200 + non-empty data array; Mistral returns { data: [{ id, object, created, owned_by }] } format
Model cache merge — getAvailableModels() fetches from both provider endpoints when both keys are set, merges into a single list with provider field on each model; when only one key is set, only that provider's models are returned; when no keys are set, returns empty list and UI disables the model dropdown
Provider-aware API key guards — analyzeTaxonomy() and analyzeMediaImage() have if (!this.apiKey) early-return guards that must become provider-aware (check the relevant provider's key based on the selected model)
useChatMessageSender hook — Shared by ChatPanel and AssistantSidebar; verify no provider assumptions exist (expected: no changes needed)
ChatEngine generic settings — Use existing getSetting()/setSetting() for non-sensitive preferences; API keys use SecureKeyStore (keychain)
SettingsView model state type — Currently {id: string; name: string}[]; must be updated to ChatModel[] to include provider and vision fields for grouping and filtering
PR structure — Split into 3 PRs: PR 1 (SSE streaming), PR 2 (keychain migration), PR 3 (Mistral integration). Reduces risk and allows independent review/testing
Model IDs — Use "latest" aliases (mistral-large-latest, etc.) not dated IDs. Models auto-update when Mistral releases new versions; getAvailableModels() fetches actual model list from API
API key storage — All API keys (OpenCode + Mistral) stored via Electron safeStorage (OS keychain). Cross-platform: macOS Keychain, Windows DPAPI, Linux libsecret. No plain-text fallback — old plain-text keys are deleted on startup; users re-enter keys after upgrade
Model fallback filtering — getAvailableModels() fallback list (from MODEL_DISPLAY_NAMES) filtered by available provider keys. Only shows models whose provider has a configured key, even in fallback mode
requestProvider routing — The requestProvider lambda in sendMessage() must pass provider-specific URL/key/options to sendOpenAIMessage() via getProviderConfig() helper
Vision capability map — MODEL_CAPABILITIES static map provides vision: boolean per model ID, since neither Mistral nor OpenCode APIs expose this field. OpenCode models also need vision flags for the image analysis dropdown filter
ModelInfo / ChatModel unification — Remove internal ModelInfo type; use ChatModel (with vision field) everywhere: engine, IPC, renderer
Retry logic — All providers get retry-with-exponential-backoff for transient HTTP errors (429, 502, 503). Max 3 retries, ~1s/2s/4s with jitter. Respects Retry-After header for 429. Implemented in PR 1 as part of the HTTP infrastructure
validateApiKey() model filtering — Filter returned models by isProviderKeySet() to avoid showing Mistral models on OpenCode key validation (and vice versa). Same pattern as getAvailableModels() fallback filtering
ImportAnalysisView dropdown — Uses custom CSS dropdown with <button> elements (same pattern as ChatPanel, not native <select>); apply provider group headers as dividers
Removed-key error for existing conversations — Existing open conversations whose model belongs to a removed provider show the same inline error banner on next sendMessage(), not a silent failure

48 KiB Raw Blame History Unescape Escape