added new story model with 12B size

2026-03-21 16:08:40 +01:00
parent 8ebf19408b
commit 84a6b2229f
3 changed files with 12 additions and 0 deletions
--- a/AGENTS.md
+++ b/AGENTS.md
@@ -55,6 +55,7 @@ open "build/Debug/MLX Server.app"
 | `qwen` | `mlx-community/Qwen3.5-4B-MLX-4bit` | Vision + thinking mode + tool use via `<tool_call>` tags (256k context) |
 | `qwen3.5-0.8b` | `mlx-community/Qwen3.5-0.8B-4bit` | Vision + thinking mode + tool use via `<tool_call>` tags (256k context) |
 | `qwen3.5-9b` | `mlx-community/Qwen3.5-9B-4bit` | Vision + thinking mode + tool use via `<tool_call>` tags (256k context) |
+| `violet-lotus` | `hobaratio/MN-Violet-Lotus-12B-mlx-4Bit` | Text-only Mistral-based model (32k context) |

 Any model in MLX format on HuggingFace can be added — no restriction on uploader or architecture.

--- a/MLXServer/Models/ModelConfig.swift
+++ b/MLXServer/Models/ModelConfig.swift
@@ -69,6 +69,16 @@ struct ModelConfig: Identifiable, Hashable {
            supportsTools: false,
            defaultGenerationSettings: .roleplayDefault
        ),
+        ModelConfig(
+            id: "violet-lotus",
+            repoId: "hobaratio/MN-Violet-Lotus-12B-mlx-4Bit",
+            displayName: "Violet Lotus 12B",
+            contextLength: 32_768,
+            loaderKind: .llm,
+            supportsImages: false,
+            supportsTools: false,
+            defaultGenerationSettings: .roleplayDefault
+        ),
    ]

    static let `default` = availableModels[0]
--- a/README.md
+++ b/README.md
@@ -11,6 +11,7 @@ Native macOS app for running local LLMs on Apple Silicon via [MLX](https://githu
 | `qwen3.5-0.8b` | `mlx-community/Qwen3.5-0.8B-4bit` | 256k | `VLMModelFactory` | Vision, thinking mode, tool use (`<tool_call>` tags) |
 | `qwen3.5-9b` | `mlx-community/Qwen3.5-9B-4bit` | 256k | `VLMModelFactory` | Vision, thinking mode, tool use (`<tool_call>` tags) |
 | `stheno` | `synk/L3-8B-Stheno-v3.2-MLX` | 8k | `LLMModelFactory` | Text-only, llama-based |
+| `violet-lotus` | `hobaratio/MN-Violet-Lotus-12B-mlx-4Bit` | 32k | `LLMModelFactory` | Text-only, Mistral-based |

 Any model in MLX format on HuggingFace can be added — there is no restriction on uploader or architecture.