Tags · prosyslab-classroom/llama.cpp

b3639

vulkan : fix build (#0)

ggml-ci

Aug 27, 2024
20f1789
zip
tar.gz

b3417

convert-*.py: add general.name kv override (ggml-org#8571)

Jul 19, 2024
3d0e436
zip
tar.gz

b3281

convert-hf : print output file name when completed (ggml-org#8181)

* convert-hf : print output file name when completed

This commit adds the output file name to the log message when the
conversion is completed.

The motivation for this change is that when `--outfile` option is not
specified it migth not be obvious where the output file is written.

With this change the output of running the script will be something like
the following:
```console
INFO:hf-to-gguf:Model successfully exported to models/gemma-2-9b-it.gguf.
```

Signed-off-by: Daniel Bevenius <[email protected]>

* squash! convert-hf : print output file name when completed

Updates the output of to support printing the directory if the output is
split into multiple files. Also the output file name is now retrieved
from the model_instance object.

Signed-off-by: Daniel Bevenius <[email protected]>

* squash! convert-hf : print output file name when completed

Use parent attribute of Path object and string interpolation.

Signed-off-by: Daniel Bevenius <[email protected]>

* squash! convert-hf : print output file name when completed

Use os.sep instead of hardcoding the path separator.

Signed-off-by: Daniel Bevenius <[email protected]>

---------

Signed-off-by: Daniel Bevenius <[email protected]>

Jul 2, 2024
023b880
zip
tar.gz

b3218

CUDA: fix matrix multiplication algorithm choice (ggml-org#8102)

Jun 24, 2024
2df373a
zip
tar.gz

b3214

CUDA: optimize MMQ int8 tensor core performance (ggml-org#8062)

* CUDA: optimize MMQ int8 tensor core performance

* only a single get_mma_tile_x_k function

* simplify code, make functions constexpr

Jun 24, 2024
9a590c8
zip
tar.gz

b3189

[SYCL] Fix windows build and inference (ggml-org#8003)

* add sycl preset

* fix debug link error. fix windows crash

* update README

Jun 20, 2024
de391e4
zip
tar.gz

b3080

Per token attributes (ggml-org#7685)

* Add per token attributes enum
* Using phi-3 for testing 'rstrip'
* Using jina-v2 for testing 'lstrip'
* Brute force test for 'lstrip' and 'rstrip'
* Implement 'rstrip' and 'lstrip'
* Update phi-3 GGUF file (obsolete since 917dc8c)
* Replace llama_token_type with llama_token_attribs

Jun 4, 2024
3b38d48
zip
tar.gz

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

b3639

b3417

b3281

b3218

b3214

b3189

b3080

Tags: prosyslab-classroom/llama.cpp

b3639

b3417

b3281

b3218

b3214

b3189

b3080