Commit Graph

30 Commits

Author SHA1 Message Date
e40a2f3c45 feat: implement phase 2 of session-cache-upgrade.md 2026-03-20 08:57:54 +01:00
e98e5fd88b Implement session cache upgrade phase 1 foundation 2026-03-20 08:35:37 +01:00
41199cb9bc chore: added an implementation plan to rework the OpenAI API server to be more performant 2026-03-19 19:07:31 +01:00
577fdf8950 feat: more visibility of prefilling 2026-03-19 11:36:46 +01:00
49bd165ce7 fix: more telemetry and tighter implementation of cache 2026-03-19 11:30:18 +01:00
c2e80e4066 chore: small update on qwen 3.5 9b actually having vision 2026-03-19 11:20:26 +01:00
25df02daa1 feat: auto-save on close and auto-load on reopen 2026-03-19 09:44:35 +01:00
b52633a301 image support for qwen3.5 9b activated again 2026-03-19 08:56:03 +01:00
f0db0c0938 feat: document saving/loading 2026-03-18 15:29:02 +01:00
09b94b32d0 feat: scene management for RP settings 2026-03-18 14:57:29 +01:00
ed1c91cd2b chore: added a plan for proper document support 2026-03-18 13:22:12 +01:00
27849ccbd7 feat: added stheno (llambda based) text-only model, too 2026-03-18 13:08:21 +01:00
6a87fe6f08 fix: export finally works 2026-03-18 11:59:51 +01:00
82a77fdb0a feat: first tries at save dialog, so far failing 2026-03-18 11:40:43 +01:00
af8b8c9532 feat: copy-paste image files from finder 2026-03-18 09:25:26 +01:00
07b71f90ec feat: start of support for thinking mode, qwen 3.5 9b addition and better idle time handling 2026-03-18 09:16:47 +01:00
ed6cc5f5d1 fix: better handling of API stuff, still not where internal chat is 2026-03-17 21:24:04 +01:00
20f9c0bcc4 feat: settings for default model 2026-03-17 20:20:54 +01:00
aa2712555a feat: idle-unload of models 2026-03-17 20:01:44 +01:00
033443589c fea: added a proper icon 2026-03-17 19:57:39 +01:00
1a67311874 feat: inference visualisation 2026-03-17 19:30:09 +01:00
5313b7175e feat: complete rewrite to swift 2026-03-17 19:12:54 +01:00
c80fe97f41 feat: added gemma 3n E4B as another model for fast response 2026-03-17 13:24:43 +01:00
cc4f937d9a feat: proper support for context size 2026-03-17 12:34:11 +01:00
540b187593 chore: added README 2026-03-17 12:07:45 +01:00
ef83c24b0b feat: hot swapping of models 2026-03-17 11:58:24 +01:00
cc6e761ed4 feat: qwen now works, too 2026-03-17 11:44:24 +01:00
bdfbd14577 fix: trying to do kv prefix caching 2026-03-17 10:04:14 +01:00
5bf170cedb removed kv quantization due to incompatibility with gemma3 2026-03-17 09:20:35 +01:00
df81afe8d7 initial commit 2026-03-17 09:14:27 +01:00