Releases: ggml-org/llama.cpp
Releases · ggml-org/llama.cpp
b5831
b5830
eval-callback : check for empty input (#14539)
b5829
test-backend-ops: add support for specifying output format (#14368) * test-backend-ops: add support for specifying output format Signed-off-by: Xiaodong Ye <[email protected]> * Address review comments Signed-off-by: Xiaodong Ye <[email protected]> * Add build_commit and build_number in test_result Signed-off-by: Xiaodong Ye <[email protected]> * Address review comments Signed-off-by: Xiaodong Ye <[email protected]> * refactor Signed-off-by: Xiaodong Ye <[email protected]> * Get build commit from ggml_commit() Signed-off-by: Xiaodong Ye <[email protected]> * Merge errors into test_operation_info && address review comments Signed-off-by: Xiaodong Ye <[email protected]> * Address review comments Signed-off-by: Xiaodong Ye <[email protected]> * Address review comments Signed-off-by: Xiaodong Ye <[email protected]> * remove visitor nonsense * remove visitor comment Signed-off-by: Xiaodong Ye <[email protected]> * Address review comments Signed-off-by: Xiaodong Ye <[email protected]> --------- Signed-off-by: Xiaodong Ye <[email protected]> Co-authored-by: slaren <[email protected]>
b5828
metal : disable fast math in all quantize kernels (#14528) ggml-ci
b5827
batch : add optional for sequential equal split (#14511) ggml-ci
b5826
graph : prepare for 4D mask (#14515) ggml-ci
b5825
batch : add n_used count (#14512) ggml-ci
b5824
CANN: Replace aclrtMemsetSync with aclnnInplaceZero operator (#14002) Co-authored-by: luyuhong <[email protected]>
b5823
ggml : implement GEGLU_ERF and GEGLU_QUICK ops (#14445)
b5822
opencl : broadcast for soft_max (#14510)