Skip to content

Instantly share code, notes, and snippets.

View philipturner's full-sized avatar
🏠
Working from home

Philip Turner philipturner

🏠
Working from home
View GitHub Profile
@philipturner
philipturner / llama-7B.ggml
Created June 4, 2023 17:55 — forked from ggerganov/llama-7B.ggml
LLaMA 7B ggml computation graph
20:47:34 ▶ metal ▶ 12⎘ ▶ $ ▶ ./bin/main -m ../models/7B/ggml-model-q4_0.bin --export
main: build = 652 (b252acb)
main: seed = 1685900854
llama.cpp: loading model from ../models/7B/ggml-model-q4_0.bin
llama_model_load_internal: format = ggjt v3 (latest)
llama_model_load_internal: n_vocab = 32000
llama_model_load_internal: n_ctx = 512
llama_model_load_internal: n_embd = 4096
llama_model_load_internal: n_mult = 256
llama_model_load_internal: n_head = 32
CCV_NNC_GEMM_FORWARD [1]: [3] -> [1] (0)
|-> 1. 0x1438bd420 (0x285d90fc0:0) [2x320] 0.517578 0.953613 -0.921875 ..
|-> 2. 0x1438bd570 (0x285d841c0:0) [1280x320] -0.001888 0.001598 0.001110 ..
|-> 3. 0x1438bd5e0 (0x285d84280:0) [1280] -0.019775 0.008278 0.010788 ..
|<- 1. 0x1438a0000 (0x285da5600:0) [2x1280] 0.044556 -0.020798 0.078064 ..
CCV_NNC_SWISH_FORWARD [2]: [1] -> [1] (0)
|-> 1. 0x1438a0000 (0x285da5600:0) [2x1280] 0.044556 -0.020798 0.078064 ..
|<- 1. 0x1438a0000 (0x285da5600:0) [2x1280] 0.022781 -0.010292 0.040558 ..
CCV_NNC_GEMM_FORWARD [3]: [3] -> [1] (0)
|-> 1. 0x1438a0000 (0x285da5600:0) [2x1280] 0.022781 -0.010292 0.040558 ..