Next >
2
Learning to Model the World with Language (dynalang.github.io)
a day ago | jxmorris12 | github.io | newest
2
A hitchhiker's guide to CUDA programming (seanzhang.me)
a week ago | jxmorris12 | seanzhang.me | frontpage
46
Estimating the perceived 'claustrophobia' of New York City's streets (2024) (mfranchi.net)
a week ago | jxmorris12 | mfranchi.net | best
166
Tinkering is a way to acquire good taste (seated.ro)
a week ago | jxmorris12 | seated.ro | best
2
Modern LLM Training (A Summary) (lesswrong.com)
a week ago | jxmorris12 | lesswrong.com | newest
2
Yes it's just doing compression. No it's not the diss you think it is (blog.wtf.sg)
a week ago | jxmorris12 | wtf.sg | newest
2
Good developer relations is about being a celebrity for dorks (pfiffer.org)
2 weeks ago | jxmorris12 | pfiffer.org | newest
2
Prompt Baking (arxiv.org)
2 weeks ago | jxmorris12 | arxiv.org | frontpage
1
Offline "Studying" Shrinks the Cost of Contextually Aware AI (stanford.edu)
2 weeks ago | jxmorris12 | stanford.edu | newest
5
The State of Machine Learning Frameworks in 2019 (thegradient.pub)
2 weeks ago | jxmorris12 | thegradient.pub | frontpage
1
Many AI Safety Orgs Have Tried to Criminalize Open-Source AI (2024) (1a3orn.com)
3 weeks ago | jxmorris12 | 1a3orn.com | newest
1
Neural Networks and Deep Learning (neuralnetworksanddeeplearning.com)
3 weeks ago | jxmorris12 | neuralnetworksanddeeplearning.com | newest
105
America's future could hinge on whether AI slightly disappoints (noahpinion.blog)
3 weeks ago | jxmorris12 | noahpinion.blog | best
3
Self-Respect (By Joan Didion) (1961) (gatech.edu)
3 weeks ago | jxmorris12 | gatech.edu | newest
12
Read your way through Hà Nội (vietnamesetypography.com)
3 weeks ago | jxmorris12 | vietnamesetypography.com | frontpage
59
How hard do you have to hit a chicken to cook it? (2020) (james-simon.github.io)
4 weeks ago | jxmorris12 | github.io | best
2
Start a Blog (guzey.com)
4 weeks ago | jxmorris12 | guzey.com | newest
1
Computable Babylonian Diaries Project (christopherwolfram.com)
4 weeks ago | jxmorris12 | christopherwolfram.com | frontpage
1
Survival of the Best Fit (survivalofthebestfit.com)
a month ago | jxmorris12 | survivalofthebestfit.com | newest
3
Breath of the Wild Decompilation (botw.link)
a month ago | jxmorris12 | botw.link | newest
2
The Politics of Contagion (emilybynight.com)
a month ago | jxmorris12 | emilybynight.com | newest
8
A PhD in Snapshots (rbharath.github.io)
a month ago | jxmorris12 | github.io | frontpage
31
Memory access is O(N^[1/3]) (vitalik.eth.limo)
a month ago | jxmorris12 | eth.limo | frontpage
1
Highrises (hythacg.com)
a month ago | jxmorris12 | hythacg.com | newest
30
How does gradient descent work? (centralflows.github.io)
a month ago | jxmorris12 | github.io | best
3
Small Products That Improved My Life (moultano.wordpress.com)
a month ago | jxmorris12 | wordpress.com | newest
1
Whispers of A.I.'s Modular Future (2023) (newyorker.com)
a month ago | jxmorris12 | newyorker.com | newest
1
An Age of AI Enlightenment (xiangfu.co)
a month ago | jxmorris12 | xiangfu.co | newest
1
A vision researcher's guide to some RL stuff: PPO and GRPO (yugeten.github.io)
a month ago | jxmorris12 | github.io | newest
3
LLMs are strangely-shaped tools (near.blog)
a month ago | jxmorris12 | near.blog | newest
1
Learned Structures (nonint.com)
a month ago | jxmorris12 | nonint.com | newest
1
LoRA-XS: Low-Rank Adaptation with Small Number of Parameters (arxiv.org)
a month ago | jxmorris12 | arxiv.org | newest
8
Evals in 2025: going beyond simple benchmarks to build models people can use (github.com/huggingface)
a month ago | jxmorris12 | github.com | frontpage
1
Dissecting Batching Effects in GPT Inference (qun.ch)
a month ago | jxmorris12 | qun.ch | newest
2
My (speculative) master plan for immortality (maxwellnye.com)
a month ago | jxmorris12 | maxwellnye.com | newest
3
Richard Feynman and the Connection Machine (1989) (longnow.org)
a month ago | jxmorris12 | longnow.org | frontpage
98
Defeating Nondeterminism in LLM Inference (thinkingmachines.ai)
a month ago | jxmorris12 | thinkingmachines.ai | best
12
Perceived Age (2024) (sdan.io)
a month ago | jxmorris12 | sdan.io | frontpage
2
Don't Build an RL Environment Startup (benanderson.work)
2 months ago | jxmorris12 | benanderson.work | newest
1
Shifting Bits in Company History (williamyeny.github.io)
2 months ago | jxmorris12 | github.io | newest
1
ML Systems: Motivating Dense Models (jacobkahn.me)
2 months ago | jxmorris12 | jacobkahn.me | newest
4
The "it" in AI models is the dataset (nonint.com)
2 months ago | jxmorris12 | nonint.com | newest
1
The Paradigm (nonint.com)
2 months ago | jxmorris12 | nonint.com | frontpage
1
Personalization, measuring with taste, and intrinsic interfaces (thesephist.com)
3 months ago | jxmorris12 | thesephist.com | newest
3
Long Term Memory in AI (Princeton CS 597A) (edoliberty.github.io)
3 months ago | jxmorris12 | github.io | newest
1
Model Merging – A Biased Overview (crisostomi.github.io)
3 months ago | jxmorris12 | github.io | newest
1
Adversarial Examples Are Not Bugs, They Are Superposition (livgorton.com)
3 months ago | jxmorris12 | livgorton.com | newest
1
Sequence Parallelism: Long Sequence Training from System Perspective (2021) (arxiv.org)
3 months ago | jxmorris12 | arxiv.org | newest
10
How many paths of length K are there between A and B? (2021) (horace.io)
3 months ago | jxmorris12 | horace.io | frontpage
2
How A Neuron Learns (rvns.moe)
3 months ago | jxmorris12 | rvns.moe | newest
1
GPT, Fast (pytorch.org)
3 months ago | jxmorris12 | pytorch.org | newest
1
GPT-Fast (github.com/meta-pytorch)
3 months ago | jxmorris12 | github.com | newest
10
Exploring EXIF (2023) (hturan.com)
3 months ago | jxmorris12 | hturan.com | frontpage
1
The Practitioner's Guide to the Maximal Update Parameterization (cerebras.ai)
3 months ago | jxmorris12 | cerebras.ai | newest
1
The scientific method and its application to the science of deep learning (james-simon.github.io)
3 months ago | jxmorris12 | github.io | newest
2
Solving Humanity's Last Exam Problems (youtube.com)
3 months ago | jxmorris12 | youtube.com | newest
1
Why We Think (lilianweng.github.io)
3 months ago | jxmorris12 | github.io | newest
5
Philosophical Thoughts on Kolmogorov-Arnold Networks (2024) (kindxiaoming.github.io)
3 months ago | jxmorris12 | github.io | frontpage
1
Matmul() using PyTorch's MPs back end is faster than Apple's MLX (kevinmartinjose.com)
3 months ago | jxmorris12 | kevinmartinjose.com | newest
2
The Making of Gemini Plays Pokémon (jcz.dev)
3 months ago | jxmorris12 | jcz.dev | newest
6
Facebook is not worth $33B (2010) (signalvnoise.com)
3 months ago | jxmorris12 | signalvnoise.com | frontpage
3
Comefrom (wikipedia.org)
3 months ago | jxmorris12 | wikipedia.org | newest
1
Diffusion Language Models Are Super Data Learners (jinjieni.notion.site)
3 months ago | jxmorris12 | notion.site | newest
2
How to build a router for MOE models (cerebras.ai)
3 months ago | jxmorris12 | cerebras.ai | newest
2
The Eponymous Principles of Management – Coase's Ceiling and Floor (amvaishnav.wordpress.com)
3 months ago | jxmorris12 | wordpress.com | newest
70
[flagged] No One Is Working (humaninvariant.com)
3 months ago | jxmorris12 | humaninvariant.com | frontpage
3
No One Is Working (humaninvariant.com)
3 months ago | jxmorris12 | humaninvariant.com | newest
1
SFT Is Bad RL (justinchiu.netlify.app)
3 months ago | jxmorris12 | netlify.app | newest
8
A Simple CPU on the Game of Life (2021) (carlini.com)
3 months ago | jxmorris12 | carlini.com | frontpage
2
Trends in LLM-Generated Citations on ArXiv (spylab.ai)
3 months ago | jxmorris12 | spylab.ai | newest
3
'AI' just means LLMs now (jxmo.io)
3 months ago | jxmorris12 | jxmo.io | newest
2
Ada Lovelace and the Analytical Engine (ox.ac.uk)
3 months ago | jxmorris12 | ox.ac.uk | newest
40
How long before superintelligence? (1997) (nickbostrom.com)
4 months ago | jxmorris12 | nickbostrom.com | frontpage
65
Attention is your scarcest resource (2020) (benkuhn.net)
4 months ago | jxmorris12 | benkuhn.net | best
1
DeltaNet Explained (sustcsonglin.github.io)
4 months ago | jxmorris12 | github.io | newest
84
All AI models might be the same (jxmo.io)
4 months ago | jxmorris12 | jxmo.io | best
1
Life Update – On Health (jiha-kim.github.io)
4 months ago | jxmorris12 | github.io | newest
1
Asymmetry of Verification and Verifier's Law (jasonwei.net)
4 months ago | jxmorris12 | jasonwei.net | frontpage
7
Soviet College Admission – My Dad's Story (1970) (ilyavolodarsky.com)
4 months ago | jxmorris12 | ilyavolodarsky.com | frontpage
2
H-Net – Inference (main-horse.github.io)
4 months ago | jxmorris12 | github.io | newest
8
How to scale RL to 10^26 FLOPs (jxmo.io)
4 months ago | jxmorris12 | jxmo.io | frontpage
1
Britain is cheap, and should learn to love it (economist.com)
4 months ago | jxmorris12 | economist.com | newest
2
Database Sharding (planetscale.com)
4 months ago | jxmorris12 | planetscale.com | newest
3
Microdosing Willpower: My Takeaways from Microdosing Ozempic (substack.com)
4 months ago | jxmorris12 | substack.com | newest
29
The upcoming GPT-3 moment for RL (mechanize.work)
4 months ago | jxmorris12 | mechanize.work | best
19
The Tradeoffs of SSMs and Transformers (goombalab.github.io)
4 months ago | jxmorris12 | github.io | frontpage
1
Things you can do –with uv (zaloog.github.io)
4 months ago | jxmorris12 | github.io | newest
24
The era of exploration (yidingjiang.github.io)
4 months ago | jxmorris12 | github.io | best
5
Just Ask for Generalization (2021) (evjang.com)
4 months ago | jxmorris12 | evjang.com | frontpage
3
Will Scaling Solve Robotics? (nishanthjkumar.com)
4 months ago | jxmorris12 | nishanthjkumar.com | frontpage
3
VLLM: Easy, Fast, and Cheap LLM Serving with PagedAttention (vllm.ai)
4 months ago | jxmorris12 | vllm.ai | frontpage
4
LLM Memory (grantslatton.com)
5 months ago | jxmorris12 | grantslatton.com | frontpage
126
What Problems to Solve (1966) (cat-v.org)
5 months ago | jxmorris12 | cat-v.org | best
1
Test-Time Training (yueatsprograms.github.io)
5 months ago | jxmorris12 | github.io | newest
129
Thnickels (thick-coins.net)
5 months ago | jxmorris12 | thick-coins.net | best
22
SFStreets: History of San Francisco place names (noahveltman.com)
5 months ago | jxmorris12 | noahveltman.com | frontpage
1
Muon Doesn't Clearly Grok Faster (essential.ai)
5 months ago | jxmorris12 | essential.ai | newest
1
René Girard and Mimetic Theory for Non-Philosophers (siboehm.com)
5 months ago | jxmorris12 | siboehm.com | newest
2
Becoming a Better Programmer by Tightening Feedback Loops (siboehm.com)
5 months ago | jxmorris12 | siboehm.com | newest
1
Approximating Language Model Training Data from Weights (arxiv.org)
5 months ago | jxmorris12 | arxiv.org | newest
Next >