1
Canaries in the Coal Mine? Six Facts about the Recent Employment Effects of AI (arxiviq.substack.com)
4 days ago | che_shr_cat | substack.com | newest
2
Fantastic Pretraining Optimizers and Where to Find Them (arxiviq.substack.com)
a week ago | che_shr_cat | substack.com | newest
2
Solving the compute crisis with physics-based ASICs (arxiviq.substack.com)
a week ago | che_shr_cat | substack.com | frontpage
2
Critiques of World Models (arxiviq.substack.com)
2 weeks ago | che_shr_cat | substack.com | newest
65
DeepConf: Scaling LLM reasoning with confidence, not just compute (arxiviq.substack.com)
2 weeks ago | che_shr_cat | substack.com | frontpage
2
V-JEPA 2: Scaling V-JEPA (gonzoml.substack.com)
2 weeks ago | che_shr_cat | substack.com | newest
1
Speed Always Wins: A Survey on Efficient Architectures for Large Language Models (arxiviq.substack.com)
3 weeks ago | che_shr_cat | substack.com | newest
27
Tversky Neural Networks (gonzoml.substack.com)
3 weeks ago | che_shr_cat | substack.com | frontpage
1
Einstein Fields: A Neural Perspective to Computational General Relativity (arxiviq.substack.com)
a month ago | che_shr_cat | substack.com | newest
2
Tversky Neural Networks: Psychologically Plausible Deep Learning With (arxiviq.substack.com)
a month ago | che_shr_cat | substack.com | newest
39
GEPA: Reflective prompt evolution can outperform reinforcement learning (arxiviq.substack.com)
a month ago | che_shr_cat | substack.com | frontpage
2
Subliminal Learning: Language models transmit behavioral traits via hidden (arxiviq.substack.com)
a month ago | che_shr_cat | substack.com | newest
1
AlphaGo Moment for Model Architecture Discovery (arxiviq.substack.com)
a month ago | che_shr_cat | substack.com | newest
1
Paper FOMO and ICML 2025 Outstanding Papers (gonzoml.substack.com)
a month ago | che_shr_cat | substack.com | newest
1
Early Signs of Steganographic Capabilities in Frontier LLMs (arxiviq.substack.com)
2 months ago | che_shr_cat | substack.com | newest
1
Musicality in Animals (substack.com)
3 months ago | che_shr_cat | substack.com | newest
3
Text-to-LoRA Enables On-the-Fly Model Adaptation (arxiviq.substack.com)
3 months ago | che_shr_cat | substack.com | newest
1
Quantum computing and artificial intelligence: status and perspectives (arxiv.org)
3 months ago | che_shr_cat | arxiv.org | newest
1
The Most Misunderstood Feature of the Sound (substack.com)
3 months ago | che_shr_cat | substack.com | newest
1
Darwin Gödel Machine (gonzoml.substack.com)
3 months ago | che_shr_cat | substack.com | newest
2
Are Deeper LLMs Smarter, or Just Longer? (gonzoml.substack.com)
4 months ago | che_shr_cat | substack.com | newest
8
Muon Optimizer Accelerates Grokking (gonzoml.substack.com)
5 months ago | che_shr_cat | substack.com | newest
2
ThoughtTerminator (gonzoml.substack.com)
5 months ago | che_shr_cat | substack.com | newest
3
Chain of Continuous Thought (Coconut) (gonzoml.substack.com)
5 months ago | che_shr_cat | substack.com | newest
1
Intuitive Physics Emergence in V-JEPA (gonzoml.substack.com)
5 months ago | che_shr_cat | substack.com | newest
2
Sound physics And basics of sound perception (substack.com)
5 months ago | che_shr_cat | substack.com | newest
3
BLT: Byte Latent Transformer (gonzoml.substack.com)
9 months ago | che_shr_cat | substack.com | newest
1
A Single 'Super Weight' Can Break Your Billion-Parameter Model (gonzoml.substack.com)
10 months ago | che_shr_cat | substack.com | newest
1
Jax Things to Watch for in 2025 (gonzoml.substack.com)
10 months ago | che_shr_cat | substack.com | newest
13
Diffusion models are evolutionary algorithms (gonzoml.substack.com)
10 months ago | che_shr_cat | substack.com | best
1
Make Softmax Great Again (gonzoml.substack.com)
10 months ago | che_shr_cat | substack.com | newest
1
Deep Learning Frameworks: The Fourth Pillar of Deep Learning Revolution (gonzoml.substack.com)
10 months ago | che_shr_cat | substack.com | newest
3
TextGrad: Automatic "Differentiation" via Text (gonzoml.substack.com)
a year ago | che_shr_cat | substack.com | newest
1
Superconducting Supercomputers (gonzoml.substack.com)
a year ago | che_shr_cat | substack.com | newest
1
Decoder-decoder architecture is coming (gonzoml.substack.com)
a year ago | che_shr_cat | substack.com | newest
1
Chronos: Using Pretrained LLMs for Probabilistic Time Series Forecasting (gonzoml.substack.com)
a year ago | che_shr_cat | substack.com | newest
12
Big Post About Big Context (gonzoml.substack.com)
a year ago | che_shr_cat | substack.com | best
1
Neural Network Diffusion (gonzoml.substack.com)
a year ago | che_shr_cat | substack.com | newest
8
Thermodynamic AI is getting hotter (gonzoml.substack.com)
a year ago | che_shr_cat | substack.com | best
1
Training LLMs with AMD GPUs on Frontier Supercomputer (gonzoml.substack.com)
a year ago | che_shr_cat | substack.com | newest
1
Beyond Chinchilla-Optimal Accounting for Inference in Language Model Scaling Law (gonzoml.substack.com)
a year ago | che_shr_cat | substack.com | newest
1
Project CETI (gonzoml.substack.com)
a year ago | che_shr_cat | substack.com | newest
1
GonzoML on Mamba and S6 (+previous post on S4) (gonzoml.substack.com)
a year ago | che_shr_cat | substack.com | newest
2
Conway's Game of Life Is Omniperiodic (gonzoml.substack.com)
a year ago | che_shr_cat | substack.com | newest
2
GonzoML on Gemini (gonzoml.substack.com)
a year ago | che_shr_cat | substack.com | newest
1
Matryoshka Representation Learning (gonzoml.substack.com)
a year ago | che_shr_cat | substack.com | newest
1
Mindstorms in Natural Language-Based Societies of Mind (gonzoml.substack.com)
a year ago | che_shr_cat | substack.com | newest
42
The convolution empire strikes back (gonzoml.substack.com)
a year ago | che_shr_cat | substack.com | best
2
Sparse Universal Transformer (gonzoml.substack.com)
a year ago | che_shr_cat | substack.com | newest
1
MemWalker: An alternative way for working with long documents using transformers (gonzoml.substack.com)
a year ago | che_shr_cat | substack.com | newest
17
"Building Machines That Learn and Think Like People", 7 Years Later (gonzoml.substack.com)
a year ago | che_shr_cat | substack.com | best
1
Chain-of-Thought → Tree-of-Thought (gonzoml.substack.com)
a year ago | che_shr_cat | substack.com | newest
9
Mortal Computers (gonzoml.substack.com)
a year ago | che_shr_cat | substack.com | frontpage
1
Levanter – Legible, Scalable, Reproducible Foundation Models with Jax (stanford.edu)
2 years ago | che_shr_cat | stanford.edu | newest
1
LM-3 –- resurrecting the MIT CADR (tumbleweed.nu)
2 years ago | che_shr_cat | tumbleweed.nu | newest