Logo
Explore Help
Sign In
hugo/MLXServer
1
0
Fork 0
You've already forked MLXServer
Code Issues Pull Requests Actions Packages Projects Releases Wiki Activity
8 Commits 2 Branches 0 Tags
c80fe97f41efa02a8d742f32d18ac425b6de6414
Commit Graph

7 Commits

Author SHA1 Message Date
Chili Palmer
c80fe97f41 feat: added gemma 3n E4B as another model for fast response 2026-03-17 13:24:43 +01:00
Chili Palmer
cc4f937d9a feat: proper support for context size 2026-03-17 12:34:11 +01:00
Chili Palmer
ef83c24b0b feat: hot swapping of models 2026-03-17 11:58:24 +01:00
Chili Palmer
cc6e761ed4 feat: qwen now works, too 2026-03-17 11:44:24 +01:00
Chili Palmer
bdfbd14577 fix: trying to do kv prefix caching 2026-03-17 10:04:14 +01:00
Chili Palmer
5bf170cedb removed kv quantization due to incompatibility with gemma3 2026-03-17 09:20:35 +01:00
Chili Palmer
df81afe8d7 initial commit 2026-03-17 09:14:27 +01:00
Powered by Gitea Version: 1.25.4 Page: 33ms Template: 6ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API