added new story model with 12B size

This commit is contained in:
2026-03-21 16:08:40 +01:00
parent 8ebf19408b
commit 84a6b2229f
3 changed files with 12 additions and 0 deletions

View File

@@ -55,6 +55,7 @@ open "build/Debug/MLX Server.app"
| `qwen` | `mlx-community/Qwen3.5-4B-MLX-4bit` | Vision + thinking mode + tool use via `<tool_call>` tags (256k context) |
| `qwen3.5-0.8b` | `mlx-community/Qwen3.5-0.8B-4bit` | Vision + thinking mode + tool use via `<tool_call>` tags (256k context) |
| `qwen3.5-9b` | `mlx-community/Qwen3.5-9B-4bit` | Vision + thinking mode + tool use via `<tool_call>` tags (256k context) |
| `violet-lotus` | `hobaratio/MN-Violet-Lotus-12B-mlx-4Bit` | Text-only Mistral-based model (32k context) |
Any model in MLX format on HuggingFace can be added — no restriction on uploader or architecture.

View File

@@ -69,6 +69,16 @@ struct ModelConfig: Identifiable, Hashable {
supportsTools: false,
defaultGenerationSettings: .roleplayDefault
),
ModelConfig(
id: "violet-lotus",
repoId: "hobaratio/MN-Violet-Lotus-12B-mlx-4Bit",
displayName: "Violet Lotus 12B",
contextLength: 32_768,
loaderKind: .llm,
supportsImages: false,
supportsTools: false,
defaultGenerationSettings: .roleplayDefault
),
]
static let `default` = availableModels[0]

View File

@@ -11,6 +11,7 @@ Native macOS app for running local LLMs on Apple Silicon via [MLX](https://githu
| `qwen3.5-0.8b` | `mlx-community/Qwen3.5-0.8B-4bit` | 256k | `VLMModelFactory` | Vision, thinking mode, tool use (`<tool_call>` tags) |
| `qwen3.5-9b` | `mlx-community/Qwen3.5-9B-4bit` | 256k | `VLMModelFactory` | Vision, thinking mode, tool use (`<tool_call>` tags) |
| `stheno` | `synk/L3-8B-Stheno-v3.2-MLX` | 8k | `LLMModelFactory` | Text-only, llama-based |
| `violet-lotus` | `hobaratio/MN-Violet-Lotus-12B-mlx-4Bit` | 32k | `LLMModelFactory` | Text-only, Mistral-based |
Any model in MLX format on HuggingFace can be added — there is no restriction on uploader or architecture.