Next >
1
Deep Laziness (ribbonfarm.com)
10 hours ago | jxmorris12 | ribbonfarm.com | newest
199
[dupe] Embeddings are underrated (2024) (technicalwriting.dev)
a day ago | jxmorris12 | technicalwriting.dev | best
1
What Physics Can Teach Us About AI (OpenAI's Dan Roberts) (sequoiacap.com)
3 days ago | jxmorris12 | sequoiacap.com | newest
1
Deep Learning for Natural Language Processing (Without Magic) (2013) (stanford.edu)
6 days ago | jxmorris12 | stanford.edu | newest
2
Robotics Predictions for 2025 (bolte.cc)
a week ago | jxmorris12 | bolte.cc | newest
2
Giant Inscrutable Matrices: Not Worse Than Anything Else
a week ago | jxmorris12 | ycombinator.com | newest
4
AI crawler wars threaten to make the web more closed for everyone (technologyreview.com)
a week ago | jxmorris12 | technologyreview.com | newest
2
Why GRPO Is Important and How It Works (oxen.ai)
a week ago | jxmorris12 | oxen.ai | newest
10
The Speed of VITs and CNNs (lucasb.eyer.be)
a week ago | jxmorris12 | eyer.be | frontpage
1
Good Research Takes Are Not Sufficient for Good Strategic Takes (neelnanda.io)
2 weeks ago | jxmorris12 | neelnanda.io | newest
27
Thank you for holding my duck (2021) (naml.us)
2 weeks ago | jxmorris12 | naml.us | frontpage
1
The Maintenance Race (worksinprogress.co)
2 weeks ago | jxmorris12 | worksinprogress.co | newest
3
Product Quantization: Compressing high-dimensional vectors by 97% (pinecone.io)
3 weeks ago | jxmorris12 | pinecone.io | frontpage
2
Notes on "An Observation on Generalization" (sumanthrh.com)
3 weeks ago | jxmorris12 | sumanthrh.com | newest
1
Boring numbers, complexity and Chaitin's incompleteness theorem (ejenner.com)
3 weeks ago | jxmorris12 | ejenner.com | newest
40
Surprises in Logic (2016) (ucr.edu)
3 weeks ago | jxmorris12 | ucr.edu | frontpage
5
Agency Is Eating the World (giansegato.com)
3 weeks ago | jxmorris12 | giansegato.com | newest
1
Is a PhD on Language Models Worth It in 2025? (ruiqizhong.substack.com)
3 weeks ago | jxmorris12 | substack.com | newest
1
Embzip (github.com/jxmorris12)
3 weeks ago | jxmorris12 | github.com | newest
2
Healthy Obsession (eatonphil.com)
3 weeks ago | jxmorris12 | eatonphil.com | newest
6
To Make Language Models Work Better, Researchers Sidestep Language (quantamagazine.org)
3 weeks ago | jxmorris12 | quantamagazine.org | frontpage
1
Modifying Custom Matmul CUDA Kernels (demoriarty.github.io)
3 weeks ago | jxmorris12 | github.io | newest
1
Reading the Llama Code (adrian.idv.hk)
3 weeks ago | jxmorris12 | adrian.idv.hk | newest
2
What It Feels Like to Get Stronger (troynikov.io)
3 weeks ago | jxmorris12 | troynikov.io | frontpage
35
How I Don't Use LLMs (gleech.org)
4 weeks ago | jxmorris12 | gleech.org | frontpage
2
A vision researcher's guide to some RL stuff: PPO and GRPO (yugeten.github.io)
4 weeks ago | jxmorris12 | github.io | newest
2
There Are No New Ideas in AI... Only New Datasets (substack.com)
4 weeks ago | jxmorris12 | substack.com | newest
5
Introduction to Theoretical Computer Science (2023) (introtcs.org)
4 weeks ago | jxmorris12 | introtcs.org | frontpage
1
An intuitive introduction to text embeddings (stackoverflow.blog)
a month ago | jxmorris12 | stackoverflow.blog | newest
2
There Are No New Ideas in AI... Only New Datasets (substack.com)
a month ago | jxmorris12 | substack.com | newest
3
There Are No New Ideas in AI – Only New Datasets (substack.com)
a month ago | jxmorris12 | substack.com | newest
2
Understanding Flash Attention: Writing the Algorithm from Scratch in Triton (alexdremov.me)
a month ago | jxmorris12 | alexdremov.me | newest
2
The Scaling Hypothesis (2020) (gwern.net)
a month ago | jxmorris12 | gwern.net | newest
1
Optimizing Matrix Multiplication (coffeebeforearch.github.io)
a month ago | jxmorris12 | github.io | frontpage
1
Phylogeny of all the plants I can remember eating (ribo.zone)
a month ago | jxmorris12 | ribo.zone | newest
2
The Experience of Using a Guest Pass at an Elite Gym (applieddivinitystudies.com)
a month ago | jxmorris12 | applieddivinitystudies.com | newest
1
Book Review: The PhD Grind (cesarsotovalero.net)
a month ago | jxmorris12 | cesarsotovalero.net | newest
89
The Egg (2009) (galactanet.com)
a month ago | jxmorris12 | galactanet.com | best
1
Language Modeling with 3D Parallelism (uvadlc-notebooks.readthedocs.io)
a month ago | jxmorris12 | readthedocs.io | newest
3
The Mean-Ing of Loss Functions (jiha-kim.github.io)
a month ago | jxmorris12 | github.io | newest
2
The Serendipity Machine (Notes on Using Twitter) (nabeelqu.co)
a month ago | jxmorris12 | nabeelqu.co | newest
1
Attention Is Logarithmic (supaiku.com)
a month ago | jxmorris12 | supaiku.com | newest
9
What Fruits and Vegetables Looked Like Before We Domesticated Them (2016) (businessinsider.com)
a month ago | jxmorris12 | businessinsider.com | frontpage
32
The PhD Metagame: Don't try to reform science – not yet (maxwellforbes.com)
a month ago | jxmorris12 | maxwellforbes.com | frontpage
2
Seeing Circles, Sines, and Signals: A Compact Primer on Digital Signal Processin (jackschaedler.github.io)
a month ago | jxmorris12 | github.io | newest
3
Kill Math (worrydream.com)
a month ago | jxmorris12 | worrydream.com | newest
1
Inference Characteristics of Llama (cursor.com)
2 months ago | jxmorris12 | cursor.com | newest
1
Math –> GPU: My Transition from Academic Math to GPU Programming (ericauld.github.io)
2 months ago | jxmorris12 | github.io | newest
1
Poking Around Claude Code (leehanchung.github.io)
2 months ago | jxmorris12 | github.io | newest
3
Deriving Muon (jeremybernste.in)
2 months ago | jxmorris12 | jeremybernste.in | frontpage
3
The Ones Who Stay and Fight (lightspeedmagazine.com)
2 months ago | jxmorris12 | lightspeedmagazine.com | newest
1
How to hire ML engineers/researchers (artfintel.com)
2 months ago | jxmorris12 | artfintel.com | newest
1
A Case of Plagarism in Machine Learning Research (carlini.com)
2 months ago | jxmorris12 | carlini.com | newest
2
Read "Harry Potter and the Methods of Rationality" (turntrout.com)
2 months ago | jxmorris12 | turntrout.com | newest
1
Laws of Tech: Commoditize Your Complement (gwern.net)
2 months ago | jxmorris12 | gwern.net | newest
1
AI Blindspots (ezyang.github.io)
2 months ago | jxmorris12 | github.io | newest
69
Copilot for Everything: Training your AI replacement one keystroke at a time (substack.com)
2 months ago | jxmorris12 | substack.com | best
3
Thoughts on Cursor (vncntt.github.io)
3 months ago | jxmorris12 | github.io | newest
4
Quantity Always Trumps Quality (2008) (codinghorror.com)
3 months ago | jxmorris12 | codinghorror.com | newest
1
Colophon (joodaloop.com)
3 months ago | jxmorris12 | joodaloop.com | newest
2
Training a 70B Model from Scratch (imbue.com)
3 months ago | jxmorris12 | imbue.com | newest
2
A Meticulous Guide to Advances in Deep Learning Efficiency (alexzhang13.github.io)
3 months ago | jxmorris12 | github.io | newest
92
I think Yann Lecun was right about LLMs (but perhaps only by accident) (substack.com)
3 months ago | jxmorris12 | substack.com | best
51
Please Commit More Blatant Academic Fraud (2021) (jacobbuckman.com)
3 months ago | jxmorris12 | jacobbuckman.com | best
1
Demystifying Noise Contrastive Estimation (jxmo.io)
3 months ago | jxmorris12 | jxmo.io | newest
60
It's time to become an ML engineer (2022) (gregbrockman.com)
3 months ago | jxmorris12 | gregbrockman.com | frontpage
2
Approximating KL Divergence (2020) (joschu.net)
3 months ago | jxmorris12 | joschu.net | newest
10
The Ultra-Scale Playbook: Training LLMs on GPU Clusters (huggingface.co)
3 months ago | jxmorris12 | huggingface.co | frontpage
38
Implementing LLaMA3 in 100 Lines of Pure Jax (saurabhalone.com)
3 months ago | jxmorris12 | saurabhalone.com | best
2
Outperforming cuBLAS on H100: A Worklog (cudaforfun.substack.com)
3 months ago | jxmorris12 | substack.com | newest
26
Gravel Map (gravelmap.com)
3 months ago | jxmorris12 | gravelmap.com | frontpage
2
Gravel
3 months ago | jxmorris12 | ycombinator.com | newest
1
Stochastic Integration for Poets (slater.works)
3 months ago | jxmorris12 | slater.works | newest
1
NASA writes space-proof code [video] (youtube.com)
3 months ago | jxmorris12 | youtube.com | newest
1
Large Lambda Model (theopolis.net)
3 months ago | jxmorris12 | theopolis.net | newest
25
Softmax forever, or why I like softmax (kyunghyuncho.me)
3 months ago | jxmorris12 | kyunghyuncho.me | best
23
Diffusion Without Tears (baincapitalventures.notion.site)
3 months ago | jxmorris12 | notion.site | frontpage
1
AGI Safety Course Workbook (docs.google.com)
3 months ago | jxmorris12 | google.com | newest
3
AI Nationalism – Ian Hogarth (ianhogarth.com)
3 months ago | jxmorris12 | ianhogarth.com | newest
1
A Beginners' Guide to Misprints in Magic (misprintedmtg.com)
3 months ago | jxmorris12 | misprintedmtg.com | newest
1
Flow with What You Know (drscotthawley.github.io)
3 months ago | jxmorris12 | github.io | newest
1
Going with the Flow: An Introduction to Normalizing Flows (gebob19.github.io)
3 months ago | jxmorris12 | github.io | newest
2
Diffusion Meets Flow Matching: Two Sides of the Same Coin (diffusionflow.github.io)
3 months ago | jxmorris12 | github.io | newest
2
Muon: An optimizer for hidden layers in neural networks (kellerjordan.github.io)
3 months ago | jxmorris12 | github.io | newest
2
Honeycrisp: An Apple-First Deep Learning Framework (aqnichol.com)
3 months ago | jxmorris12 | aqnichol.com | frontpage
1
My Gear (paulstamatiou.com)
3 months ago | jxmorris12 | paulstamatiou.com | newest
4
DOGE for AI (allen-zhu.com)
3 months ago | jxmorris12 | allen-zhu.com | newest
1
How to Backpack (moultano.wordpress.com)
3 months ago | jxmorris12 | wordpress.com | newest
2
GRPO with Verifiable Rewards Is Contrastive Loss (ymroueh.me)
3 months ago | jxmorris12 | ymroueh.me | newest
1
Bit Prediction (argmin.net)
3 months ago | jxmorris12 | argmin.net | newest
3
How Could Telepathy Work? (hinterlander.substack.com)
3 months ago | jxmorris12 | substack.com | newest
1
How to Scale Your Model (jax-ml.github.io)
3 months ago | jxmorris12 | github.io | newest
85
RLHF Book (rlhfbook.com)
3 months ago | jxmorris12 | rlhfbook.com | best
1
Reading notes: unsupervised word translation (yourdomain.com)
4 months ago | jxmorris12 | yourdomain.com | newest
2
The Art of Debugging (github.com/stas00)
4 months ago | jxmorris12 | github.com | newest
2
What a $500k grant proposal looks like (austinhenley.com)
4 months ago | jxmorris12 | austinhenley.com | newest
1
Deep Reinforcement Learning Doesn't Work Yet (alexirpan.com)
4 months ago | jxmorris12 | alexirpan.com | newest
61
How far can you get in 40 minutes from each subway station in NYC? (subwaysheds.com)
4 months ago | jxmorris12 | subwaysheds.com | best
1
Attention Sinks in LLMs for endless fluency (huggingface.co)
4 months ago | jxmorris12 | huggingface.co | newest
3
Flow with What You Know: An Introduction to Flow-Based Models (drscotthawley.github.io)
4 months ago | jxmorris12 | github.io | newest
Next >