53
1
Prompt processing vs. generation: two phases, opposite bottlenecks (vettedconsumer.com)
3
Why long context eats your VRAM: the KV cache explained (vettedconsumer.com)
3
Show HN: Quant Picker – which GGUF file fits your model and machine (vettedconsumer.com)
2
Mixture-of-Experts (Moe), Explained: Why "Active Parameters" Decide What Runs (vettedconsumer.com)
2
GGUF vs. GPTQ vs. AWQ: The Plain-English Guide to LLM Quantization (vettedconsumer.com)
1
GGUF vs. GPTQ vs. AWQ: The Plain-English Guide to LLM Quantization (vettedconsumer.com)
5
Death by AI: Israel using machine learning to designate and kill suspects (agoraroad.com)
2
The death of the Yesterweb and "The Web Revival" movement (agoraroad.com)
2
What I Learned from Citadel’s Training Software (nairachan.com)
3