MLXServer

Author	SHA1	Message	Date
Chili Palmer	e40a2f3c45	feat: implement phase 2 of session-cache-upgrade.md	2026-03-20 08:57:54 +01:00
Chili Palmer	e98e5fd88b	Implement session cache upgrade phase 1 foundation	2026-03-20 08:35:37 +01:00
Chili Palmer	41199cb9bc	chore: added an implementation plan to rework the OpenAI API server to be more performant	2026-03-19 19:07:31 +01:00
Chili Palmer	577fdf8950	feat: more visibility of prefilling	2026-03-19 11:36:46 +01:00
Chili Palmer	49bd165ce7	fix: more telemetry and tighter implementation of cache	2026-03-19 11:30:18 +01:00
Chili Palmer	c2e80e4066	chore: small update on qwen 3.5 9b actually having vision	2026-03-19 11:20:26 +01:00
Chili Palmer	25df02daa1	feat: auto-save on close and auto-load on reopen	2026-03-19 09:44:35 +01:00
Chili Palmer	b52633a301	image support for qwen3.5 9b activated again	2026-03-19 08:56:03 +01:00
Chili Palmer	f0db0c0938	feat: document saving/loading	2026-03-18 15:29:02 +01:00
Chili Palmer	09b94b32d0	feat: scene management for RP settings	2026-03-18 14:57:29 +01:00
Chili Palmer	ed1c91cd2b	chore: added a plan for proper document support	2026-03-18 13:22:12 +01:00
Chili Palmer	27849ccbd7	feat: added stheno (llambda based) text-only model, too	2026-03-18 13:08:21 +01:00
Chili Palmer	6a87fe6f08	fix: export finally works	2026-03-18 11:59:51 +01:00
Chili Palmer	82a77fdb0a	feat: first tries at save dialog, so far failing	2026-03-18 11:40:43 +01:00
Chili Palmer	af8b8c9532	feat: copy-paste image files from finder	2026-03-18 09:25:26 +01:00
Chili Palmer	07b71f90ec	feat: start of support for thinking mode, qwen 3.5 9b addition and better idle time handling	2026-03-18 09:16:47 +01:00
Chili Palmer	ed6cc5f5d1	fix: better handling of API stuff, still not where internal chat is	2026-03-17 21:24:04 +01:00
Chili Palmer	20f9c0bcc4	feat: settings for default model	2026-03-17 20:20:54 +01:00
Chili Palmer	aa2712555a	feat: idle-unload of models	2026-03-17 20:01:44 +01:00
Chili Palmer	033443589c	fea: added a proper icon	2026-03-17 19:57:39 +01:00
Chili Palmer	1a67311874	feat: inference visualisation	2026-03-17 19:30:09 +01:00
Chili Palmer	5313b7175e	feat: complete rewrite to swift	2026-03-17 19:12:54 +01:00
Chili Palmer	c80fe97f41	feat: added gemma 3n E4B as another model for fast response	2026-03-17 13:24:43 +01:00
Chili Palmer	cc4f937d9a	feat: proper support for context size	2026-03-17 12:34:11 +01:00
Chili Palmer	540b187593	chore: added README	2026-03-17 12:07:45 +01:00
Chili Palmer	ef83c24b0b	feat: hot swapping of models	2026-03-17 11:58:24 +01:00
Chili Palmer	cc6e761ed4	feat: qwen now works, too	2026-03-17 11:44:24 +01:00
Chili Palmer	bdfbd14577	fix: trying to do kv prefix caching	2026-03-17 10:04:14 +01:00
Chili Palmer	5bf170cedb	removed kv quantization due to incompatibility with gemma3	2026-03-17 09:20:35 +01:00
Chili Palmer	df81afe8d7	initial commit	2026-03-17 09:14:27 +01:00

30 Commits