Skip to content

Actions: ggml-org/llama.cpp

Server

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
11,776 workflow runs
11,776 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Prefilling assistant message in openai compatible API
Server #13272: Pull request #13174 synchronize by matteoserva
April 29, 2025 07:50 10m 34s matteoserva:prefill
April 29, 2025 07:50 10m 34s
llama : set qwen3 model type sizes
Server #13271: Pull request #13175 opened by CISC
April 29, 2025 07:38 11m 32s cisc/qwen3-sizes
April 29, 2025 07:38 11m 32s
Prefilling assistant message in openai compatible API
Server #13270: Pull request #13174 synchronize by matteoserva
April 29, 2025 07:28 9m 10s matteoserva:prefill
April 29, 2025 07:28 9m 10s
Prefilling assistant message in openai compatible API
Server #13269: Pull request #13174 opened by matteoserva
April 29, 2025 07:22 6m 48s matteoserva:prefill
April 29, 2025 07:22 6m 48s
sampling : when top-k <= 0 -> noop
Server #13268: Pull request #13173 opened by ggerganov
April 29, 2025 07:02 15m 58s gg/top-k-fix
April 29, 2025 07:02 15m 58s
llama-graph : fix text position for mrope (#13159)
Server #13267: Commit b6ce743 pushed by ggerganov
April 29, 2025 06:45 9m 44s master
April 29, 2025 06:45 9m 44s
llama-graph : fix text position for mrope
Server #13264: Pull request #13159 synchronize by ngxson
April 28, 2025 22:08 8m 22s ngxson:xsn/qwen2vl_fix_text_pos
April 28, 2025 22:08 8m 22s
llama-graph : fix text position for mrope
Server #13263: Pull request #13159 synchronize by ngxson
April 28, 2025 22:07 1m 29s ngxson:xsn/qwen2vl_fix_text_pos
April 28, 2025 22:07 1m 29s
model : Nomic Embed Text V2 with Mixture-of-Experts (MoE) architectur…
Server #13260: Commit 5f5e39e pushed by ggerganov
April 28, 2025 19:52 37m 21s master
April 28, 2025 19:52 37m 21s
clip : fix model size display (#13153)
Server #13259: Commit eaea325 pushed by ngxson
April 28, 2025 19:23 36m 50s master
April 28, 2025 19:23 36m 50s
fix(rpc): Improve input validation and error handling (#13069)
Server #13258: Commit 43ddab6 pushed by rgerganov
April 28, 2025 18:00 1h 33m 26s master
April 28, 2025 18:00 1h 33m 26s
Nomic Embed Text V2 with Mixture-of-Experts (MoE) architecture
Server #13256: Pull request #12466 synchronize by cebtenzzre
April 28, 2025 16:10 2h 17m 7s manyoso:nomic_embed_v2
April 28, 2025 16:10 2h 17m 7s
clip : fix model size display
Server #13255: Pull request #13153 opened by ngxson
April 28, 2025 15:19 2h 26m 35s ngxson:xsn/clip_fix_model_size_display
April 28, 2025 15:19 2h 26m 35s
sycl : implementation of reordered Q4_0 MMVQ for Intel GPUs
Server #13254: Pull request #12858 synchronize by Alcpz
April 28, 2025 14:57 2h 29m 14s Alcpz:Alcpz/mmvq_q4_0_reorder
April 28, 2025 14:57 2h 29m 14s
mtmd : add qwen2vl and qwen2.5vl
Server #13251: Pull request #13141 synchronize by ngxson
April 28, 2025 14:54 1h 10m 35s ngxson:xsn/mtmd_qwen2vl
April 28, 2025 14:54 1h 10m 35s
llama-bench: add -d depth arg (#13096)
Server #13250: Commit 1831f53 pushed by JohannesGaessler
April 28, 2025 14:50 2h 2m 19s master
April 28, 2025 14:50 2h 2m 19s
mtmd : add qwen2vl and qwen2.5vl
Server #13248: Pull request #13141 synchronize by ngxson
April 28, 2025 14:13 41m 8s ngxson:xsn/mtmd_qwen2vl
April 28, 2025 14:13 41m 8s