Documentation
Curated notes and guides for getting the best out of Llama‑Build and llama.cpp.
GPU sweet spots
Coming soon. Planned: a quick matrix of VRAM/throughput trade‑offs and suggested quantization for popular NVIDIA cards.
Configs
Coming soon. Planned: example cmake
flags, batch snippets, and perf‑tuning recipes.