Releases · ggml-org/llama.cpp

05 Jul 06:41

6681688

b5831 Latest

Latest

opencl: add GELU_ERF (#14476)

Assets 15

cudart-llama-bin-win-cuda-12.4-x64.zip

sha256:8c79a9b226de4b3cacfd1f83d24f962d0773be79f1e7b75c6af4ded7e32ae1d6
373 MB 2025-07-05T06:41:34Z
llama-b5831-bin-macos-arm64.zip

sha256:e2243cd7f0bf2f142912996d6954b8628d66ac8c89340ff8d93b42c11ca08a29
10.5 MB 2025-07-05T06:41:47Z
llama-b5831-bin-macos-x64.zip

sha256:05fdad2fe1ed455c21db3bbdc9714dca88a3c24160a075f23007367bf0a94263
26.3 MB 2025-07-05T06:41:48Z
llama-b5831-bin-ubuntu-vulkan-x64.zip

sha256:6296e8f50d870c1805cb9d0e077cd5c92f5860fbba6adc13d5916fdb15b82bf5
20.1 MB 2025-07-05T06:41:50Z
llama-b5831-bin-ubuntu-x64.zip

sha256:0436db686e37cdf68bebab8384d8b3c1b218c650e6f0a16046bde0227d96a12a
12.4 MB 2025-07-05T06:41:51Z
llama-b5831-bin-win-cpu-arm64.zip

sha256:29df0dc321411a41d82db2ce0e0d60a92ab9c0c1af07b0053eeb8affacad922e
10.8 MB 2025-07-05T06:41:52Z
llama-b5831-bin-win-cpu-x64.zip

sha256:bb33b7802b04975e9723c11b539cac41950b37b86b2aeb530f510f1acbc04a24
13.6 MB 2025-07-05T06:41:53Z
llama-b5831-bin-win-cuda-12.4-x64.zip

sha256:570aaccb767a60d57e2073b1871f61a15b5ad0b7214f929d02ff2ec63fb9b6a3
128 MB 2025-07-05T06:41:54Z
llama-b5831-bin-win-hip-radeon-x64.zip

sha256:da858a35e7bbeab64ac60c3638cecc6c487bc2e4ec33a93a0adaf18d4a90b412
298 MB 2025-07-05T06:41:59Z
llama-b5831-bin-win-opencl-adreno-arm64.zip

sha256:866aafc89dd4ebcd4b8db10137ed16f104dc6bb251476affa9930918b7dd1f45
11.1 MB 2025-07-05T06:42:13Z
Source code (zip)

2025-07-05T06:24:56Z
Source code (tar.gz)

2025-07-05T06:24:56Z

05 Jul 05:15

github-actions

b5830

bac8bed

b5830

eval-callback : check for empty input (#14539)

Assets 15

05 Jul 05:08

github-actions

b5829

b81510a

b5829

test-backend-ops: add support for specifying output format (#14368)

* test-backend-ops: add support for specifying output format

Signed-off-by: Xiaodong Ye <[email protected]>

* Address review comments

Signed-off-by: Xiaodong Ye <[email protected]>

* Add build_commit and build_number in test_result

Signed-off-by: Xiaodong Ye <[email protected]>

* Address review comments

Signed-off-by: Xiaodong Ye <[email protected]>

* refactor

Signed-off-by: Xiaodong Ye <[email protected]>

* Get build commit from ggml_commit()

Signed-off-by: Xiaodong Ye <[email protected]>

* Merge errors into test_operation_info && address review comments

Signed-off-by: Xiaodong Ye <[email protected]>

* Address review comments

Signed-off-by: Xiaodong Ye <[email protected]>

* Address review comments

Signed-off-by: Xiaodong Ye <[email protected]>

* remove visitor nonsense

* remove visitor comment

Signed-off-by: Xiaodong Ye <[email protected]>

* Address review comments

Signed-off-by: Xiaodong Ye <[email protected]>

---------

Signed-off-by: Xiaodong Ye <[email protected]>
Co-authored-by: slaren <[email protected]>

Assets 15

04 Jul 16:39

github-actions

b5828

ef797db

b5828

metal : disable fast math in all quantize kernels (#14528)

ggml-ci

Assets 15

04 Jul 06:58

github-actions

b5827

67d1ef2

b5827

batch : add optional for sequential equal split (#14511)

ggml-ci

Assets 15

04 Jul 06:54

github-actions

b5826

7b50f7c

b5826

graph : prepare for 4D mask (#14515)

ggml-ci

Assets 15

04 Jul 06:37

github-actions

b5825

c79184d

b5825

batch : add n_used count (#14512)

ggml-ci

Assets 15

04 Jul 04:04

github-actions

b5824

499a8f5

b5824

CANN: Replace aclrtMemsetSync with aclnnInplaceZero operator (#14002)

Co-authored-by: luyuhong <[email protected]>

Assets 15

03 Jul 22:14

github-actions

b5823

28657a8

b5823

ggml : implement GEGLU_ERF and GEGLU_QUICK ops (#14445)

Assets 15

03 Jul 19:24

github-actions

b5822

bee2842

b5822

opencl : broadcast for soft_max (#14510)

Assets 15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Releases: ggml-org/llama.cpp

b5831

Uh oh!

b5830

Uh oh!

b5829

Uh oh!

b5828

Uh oh!

b5827

Uh oh!

b5826

Uh oh!

b5825

Uh oh!

b5824

Uh oh!

b5823

Uh oh!

b5822

Uh oh!