Skip to content

Documentation

Curated notes and guides for getting the best out of Llama‑Build and llama.cpp.

GPU sweet spots

Coming soon. Planned: a quick matrix of VRAM/throughput trade‑offs and suggested quantization for popular NVIDIA cards.

Configs

Coming soon. Planned: example cmake flags, batch snippets, and perf‑tuning recipes.