ollama

mirror of https://github.com/ollama/ollama.git synced 2026-05-13 14:27:00 +00:00

History

Daniel Hiltgen 87288ced4f New models (#15861 ) * mlx: add laguna model support * convert: support fp8 safetensors import Decode HF F8_E4M3 safetensors with block scale companions into GGUF-supported tensor types, and record which output tensors came from FP8 source weights. Use that source-precision metadata during create quantization: default FP8-sourced GGUFs to Q8_0, keep non-FP8 tensors at their original precision for Q8_0, and promote non-FP8 quantizable tensors to Q8_0 for Q4_K requests. * ggml: add laguna model support * server: preserve generate logprobs with builtin parsers Generate requests were dropping logprob-only chunks whenever a builtin parser buffered visible content. Chat already handled this case, but generate only forwarded chunks with visible response, thinking, or tool-call output. Keep generate chunks that carry logprobs even when the builtin parser has not flushed visible content yet, and add a regression test that exercises the behavior with a generic thinking parser. * review comments - perf improvements * ggml: implement nemotron 3 nano omni * add poolside integration * update poolside doc * adapt to new cache setup * fix test * fix test --------- Co-authored-by: Eva Ho <hoyyeva@gmail.com>		2026-04-28 11:50:12 -07:00
..
sentencepiece	chore(all): replace instances of interface with any (#10067 )	2025-04-02 09:44:27 -07:00
testdata
convert.go	New models (#15861 )	2026-04-28 11:50:12 -07:00
convert_bert.go	Add experimental MLX backend and engine with imagegen support (#13648 )	2026-01-08 16:18:59 -08:00
convert_commandr.go	Add experimental MLX backend and engine with imagegen support (#13648 )	2026-01-08 16:18:59 -08:00
convert_deepseek2.go	Add experimental MLX backend and engine with imagegen support (#13648 )	2026-01-08 16:18:59 -08:00
convert_deepseekocr.go	Add experimental MLX backend and engine with imagegen support (#13648 )	2026-01-08 16:18:59 -08:00
convert_gemma.go	Add experimental MLX backend and engine with imagegen support (#13648 )	2026-01-08 16:18:59 -08:00
convert_gemma2.go	Add experimental MLX backend and engine with imagegen support (#13648 )	2026-01-08 16:18:59 -08:00
convert_gemma2_adapter.go	Add experimental MLX backend and engine with imagegen support (#13648 )	2026-01-08 16:18:59 -08:00
convert_gemma3.go	Add experimental MLX backend and engine with imagegen support (#13648 )	2026-01-08 16:18:59 -08:00
convert_gemma3n.go	Add experimental MLX backend and engine with imagegen support (#13648 )	2026-01-08 16:18:59 -08:00
convert_gemma4.go	Add support for gemma4 (#15214 )	2026-04-02 11:33:33 -07:00
convert_gemma4_test.go	Add support for gemma4 (#15214 )	2026-04-02 11:33:33 -07:00
convert_glm4moelite.go	Re-apply "model: add MLA absorption for glm4moelite" with fix (#13870 )	2026-01-23 18:40:28 -08:00
convert_glmocr.go	Add GLM-OCR vision model support (#14024 )	2026-02-02 15:39:18 -08:00
convert_gptoss.go	Add experimental MLX backend and engine with imagegen support (#13648 )	2026-01-08 16:18:59 -08:00
convert_laguna.go	New models (#15861 )	2026-04-28 11:50:12 -07:00
convert_laguna_test.go	New models (#15861 )	2026-04-28 11:50:12 -07:00
convert_lfm2.go	model: improvements to LFM architectures (#14368 )	2026-02-23 14:38:10 -08:00
convert_lfm2_test.go	model: improvements to LFM architectures (#14368 )	2026-02-23 14:38:10 -08:00
convert_lfm2_vl.go	model: improvements to LFM architectures (#14368 )	2026-02-23 14:38:10 -08:00
convert_lfm2_vl_test.go	model: improvements to LFM architectures (#14368 )	2026-02-23 14:38:10 -08:00
convert_llama.go	Add experimental MLX backend and engine with imagegen support (#13648 )	2026-01-08 16:18:59 -08:00
convert_llama4.go	Add experimental MLX backend and engine with imagegen support (#13648 )	2026-01-08 16:18:59 -08:00
convert_llama_adapter.go	Add experimental MLX backend and engine with imagegen support (#13648 )	2026-01-08 16:18:59 -08:00
convert_mistral.go	Add experimental MLX backend and engine with imagegen support (#13648 )	2026-01-08 16:18:59 -08:00
convert_mistral_causal.go	Add experimental MLX backend and engine with imagegen support (#13648 )	2026-01-08 16:18:59 -08:00
convert_mixtral.go	Add experimental MLX backend and engine with imagegen support (#13648 )	2026-01-08 16:18:59 -08:00
convert_mllama.go	Add experimental MLX backend and engine with imagegen support (#13648 )	2026-01-08 16:18:59 -08:00
convert_nemotron_h.go	New models (#15861 )	2026-04-28 11:50:12 -07:00
convert_nemotron_h_test.go	New models (#15861 )	2026-04-28 11:50:12 -07:00
convert_nomicbert.go	Add experimental MLX backend and engine with imagegen support (#13648 )	2026-01-08 16:18:59 -08:00
convert_olmo.go	Add experimental MLX backend and engine with imagegen support (#13648 )	2026-01-08 16:18:59 -08:00
convert_phi3.go	Add experimental MLX backend and engine with imagegen support (#13648 )	2026-01-08 16:18:59 -08:00
convert_qwen2.go	Add experimental MLX backend and engine with imagegen support (#13648 )	2026-01-08 16:18:59 -08:00
convert_qwen3.go	Add experimental MLX backend and engine with imagegen support (#13648 )	2026-01-08 16:18:59 -08:00
convert_qwen3next.go	model: support for qwen3.5 architecture (#14378 )	2026-02-24 20:08:05 -08:00
convert_qwen3next_test.go	model: support for qwen3.5 architecture (#14378 )	2026-02-24 20:08:05 -08:00
convert_qwen3vl.go	Add experimental MLX backend and engine with imagegen support (#13648 )	2026-01-08 16:18:59 -08:00
convert_qwen25vl.go	Add experimental MLX backend and engine with imagegen support (#13648 )	2026-01-08 16:18:59 -08:00
convert_test.go	Add support for gemma4 (#15214 )	2026-04-02 11:33:33 -07:00
json_compat.go	models: add nemotronh architecture support (#14356 )	2026-02-22 15:09:14 -08:00
json_compat_test.go	models: add nemotronh architecture support (#14356 )	2026-02-22 15:09:14 -08:00
reader.go	New models (#15861 )	2026-04-28 11:50:12 -07:00
reader_safetensors.go	New models (#15861 )	2026-04-28 11:50:12 -07:00
reader_test.go	New models (#15861 )	2026-04-28 11:50:12 -07:00
reader_torch.go	llama4	2025-04-25 16:59:20 -07:00
sentencepiece_model.proto
tensor.go	New models (#15861 )	2026-04-28 11:50:12 -07:00
tensor_test.go	New models (#15861 )	2026-04-28 11:50:12 -07:00
tokenizer.go	New models (#15861 )	2026-04-28 11:50:12 -07:00
tokenizer_spm.go	parsers/renderers: functiongemma (#13521 )	2025-12-18 07:55:37 -08:00
tokenizer_test.go	model: support for qwen3.5 architecture (#14378 )	2026-02-24 20:08:05 -08:00