All
5+
10+
25+
50+
100+
Next >
2
Learning to Model the World with Language (dynalang.github.io)
a day ago |
jxmorris12
| github.io
|
newest
2
A hitchhiker's guide to CUDA programming (seanzhang.me)
a week ago |
jxmorris12
| seanzhang.me
|
frontpage
46
Estimating the perceived 'claustrophobia' of New York City's streets (2024) (mfranchi.net)
a week ago |
jxmorris12
| mfranchi.net
|
best
166
Tinkering is a way to acquire good taste (seated.ro)
a week ago |
jxmorris12
| seated.ro
|
best
2
Modern LLM Training (A Summary) (lesswrong.com)
a week ago |
jxmorris12
| lesswrong.com
|
newest
2
Yes it's just doing compression. No it's not the diss you think it is (blog.wtf.sg)
a week ago |
jxmorris12
| wtf.sg
|
newest
2
Good developer relations is about being a celebrity for dorks (pfiffer.org)
2 weeks ago |
jxmorris12
| pfiffer.org
|
newest
2
Prompt Baking (arxiv.org)
2 weeks ago |
jxmorris12
| arxiv.org
|
frontpage
1
Offline "Studying" Shrinks the Cost of Contextually Aware AI (stanford.edu)
2 weeks ago |
jxmorris12
| stanford.edu
|
newest
5
The State of Machine Learning Frameworks in 2019 (thegradient.pub)
2 weeks ago |
jxmorris12
| thegradient.pub
|
frontpage
1
Many AI Safety Orgs Have Tried to Criminalize Open-Source AI (2024) (1a3orn.com)
3 weeks ago |
jxmorris12
| 1a3orn.com
|
newest
1
Neural Networks and Deep Learning (neuralnetworksanddeeplearning.com)
3 weeks ago |
jxmorris12
| neuralnetworksanddeeplearning.com
|
newest
105
America's future could hinge on whether AI slightly disappoints (noahpinion.blog)
3 weeks ago |
jxmorris12
| noahpinion.blog
|
best
3
Self-Respect (By Joan Didion) (1961) (gatech.edu)
3 weeks ago |
jxmorris12
| gatech.edu
|
newest
12
Read your way through Hà Nội (vietnamesetypography.com)
3 weeks ago |
jxmorris12
| vietnamesetypography.com
|
frontpage
59
How hard do you have to hit a chicken to cook it? (2020) (james-simon.github.io)
4 weeks ago |
jxmorris12
| github.io
|
best
2
Start a Blog (guzey.com)
4 weeks ago |
jxmorris12
| guzey.com
|
newest
1
Computable Babylonian Diaries Project (christopherwolfram.com)
4 weeks ago |
jxmorris12
| christopherwolfram.com
|
frontpage
1
Survival of the Best Fit (survivalofthebestfit.com)
a month ago |
jxmorris12
| survivalofthebestfit.com
|
newest
3
Breath of the Wild Decompilation (botw.link)
a month ago |
jxmorris12
| botw.link
|
newest
2
The Politics of Contagion (emilybynight.com)
a month ago |
jxmorris12
| emilybynight.com
|
newest
8
A PhD in Snapshots (rbharath.github.io)
a month ago |
jxmorris12
| github.io
|
frontpage
31
Memory access is O(N^[1/3]) (vitalik.eth.limo)
a month ago |
jxmorris12
| eth.limo
|
frontpage
1
Highrises (hythacg.com)
a month ago |
jxmorris12
| hythacg.com
|
newest
30
How does gradient descent work? (centralflows.github.io)
a month ago |
jxmorris12
| github.io
|
best
3
Small Products That Improved My Life (moultano.wordpress.com)
a month ago |
jxmorris12
| wordpress.com
|
newest
1
Whispers of A.I.'s Modular Future (2023) (newyorker.com)
a month ago |
jxmorris12
| newyorker.com
|
newest
1
An Age of AI Enlightenment (xiangfu.co)
a month ago |
jxmorris12
| xiangfu.co
|
newest
1
A vision researcher's guide to some RL stuff: PPO and GRPO (yugeten.github.io)
a month ago |
jxmorris12
| github.io
|
newest
3
LLMs are strangely-shaped tools (near.blog)
a month ago |
jxmorris12
| near.blog
|
newest
1
Learned Structures (nonint.com)
a month ago |
jxmorris12
| nonint.com
|
newest
1
LoRA-XS: Low-Rank Adaptation with Small Number of Parameters (arxiv.org)
a month ago |
jxmorris12
| arxiv.org
|
newest
8
Evals in 2025: going beyond simple benchmarks to build models people can use (github.com/huggingface)
a month ago |
jxmorris12
| github.com
|
frontpage
1
Dissecting Batching Effects in GPT Inference (qun.ch)
a month ago |
jxmorris12
| qun.ch
|
newest
2
My (speculative) master plan for immortality (maxwellnye.com)
a month ago |
jxmorris12
| maxwellnye.com
|
newest
3
Richard Feynman and the Connection Machine (1989) (longnow.org)
a month ago |
jxmorris12
| longnow.org
|
frontpage
98
Defeating Nondeterminism in LLM Inference (thinkingmachines.ai)
a month ago |
jxmorris12
| thinkingmachines.ai
|
best
12
Perceived Age (2024) (sdan.io)
a month ago |
jxmorris12
| sdan.io
|
frontpage
2
Don't Build an RL Environment Startup (benanderson.work)
2 months ago |
jxmorris12
| benanderson.work
|
newest
1
Shifting Bits in Company History (williamyeny.github.io)
2 months ago |
jxmorris12
| github.io
|
newest
1
ML Systems: Motivating Dense Models (jacobkahn.me)
2 months ago |
jxmorris12
| jacobkahn.me
|
newest
4
The "it" in AI models is the dataset (nonint.com)
2 months ago |
jxmorris12
| nonint.com
|
newest
1
The Paradigm (nonint.com)
2 months ago |
jxmorris12
| nonint.com
|
frontpage
1
Personalization, measuring with taste, and intrinsic interfaces (thesephist.com)
3 months ago |
jxmorris12
| thesephist.com
|
newest
3
Long Term Memory in AI (Princeton CS 597A) (edoliberty.github.io)
3 months ago |
jxmorris12
| github.io
|
newest
1
Model Merging – A Biased Overview (crisostomi.github.io)
3 months ago |
jxmorris12
| github.io
|
newest
1
Adversarial Examples Are Not Bugs, They Are Superposition (livgorton.com)
3 months ago |
jxmorris12
| livgorton.com
|
newest
1
Sequence Parallelism: Long Sequence Training from System Perspective (2021) (arxiv.org)
3 months ago |
jxmorris12
| arxiv.org
|
newest
10
How many paths of length K are there between A and B? (2021) (horace.io)
3 months ago |
jxmorris12
| horace.io
|
frontpage
2
How A Neuron Learns (rvns.moe)
3 months ago |
jxmorris12
| rvns.moe
|
newest
1
GPT, Fast (pytorch.org)
3 months ago |
jxmorris12
| pytorch.org
|
newest
1
GPT-Fast (github.com/meta-pytorch)
3 months ago |
jxmorris12
| github.com
|
newest
10
Exploring EXIF (2023) (hturan.com)
3 months ago |
jxmorris12
| hturan.com
|
frontpage
1
The Practitioner's Guide to the Maximal Update Parameterization (cerebras.ai)
3 months ago |
jxmorris12
| cerebras.ai
|
newest
1
The scientific method and its application to the science of deep learning (james-simon.github.io)
3 months ago |
jxmorris12
| github.io
|
newest
2
Solving Humanity's Last Exam Problems (youtube.com)
3 months ago |
jxmorris12
| youtube.com
|
newest
1
Why We Think (lilianweng.github.io)
3 months ago |
jxmorris12
| github.io
|
newest
5
Philosophical Thoughts on Kolmogorov-Arnold Networks (2024) (kindxiaoming.github.io)
3 months ago |
jxmorris12
| github.io
|
frontpage
1
Matmul() using PyTorch's MPs back end is faster than Apple's MLX (kevinmartinjose.com)
3 months ago |
jxmorris12
| kevinmartinjose.com
|
newest
2
The Making of Gemini Plays Pokémon (jcz.dev)
3 months ago |
jxmorris12
| jcz.dev
|
newest
6
Facebook is not worth $33B (2010) (signalvnoise.com)
3 months ago |
jxmorris12
| signalvnoise.com
|
frontpage
3
Comefrom (wikipedia.org)
3 months ago |
jxmorris12
| wikipedia.org
|
newest
1
Diffusion Language Models Are Super Data Learners (jinjieni.notion.site)
3 months ago |
jxmorris12
| notion.site
|
newest
2
How to build a router for MOE models (cerebras.ai)
3 months ago |
jxmorris12
| cerebras.ai
|
newest
2
The Eponymous Principles of Management – Coase's Ceiling and Floor (amvaishnav.wordpress.com)
3 months ago |
jxmorris12
| wordpress.com
|
newest
70
[flagged] No One Is Working (humaninvariant.com)
3 months ago |
jxmorris12
| humaninvariant.com
|
frontpage
3
No One Is Working (humaninvariant.com)
3 months ago |
jxmorris12
| humaninvariant.com
|
newest
1
SFT Is Bad RL (justinchiu.netlify.app)
3 months ago |
jxmorris12
| netlify.app
|
newest
8
A Simple CPU on the Game of Life (2021) (carlini.com)
3 months ago |
jxmorris12
| carlini.com
|
frontpage
2
Trends in LLM-Generated Citations on ArXiv (spylab.ai)
3 months ago |
jxmorris12
| spylab.ai
|
newest
3
'AI' just means LLMs now (jxmo.io)
3 months ago |
jxmorris12
| jxmo.io
|
newest
2
Ada Lovelace and the Analytical Engine (ox.ac.uk)
3 months ago |
jxmorris12
| ox.ac.uk
|
newest
40
How long before superintelligence? (1997) (nickbostrom.com)
4 months ago |
jxmorris12
| nickbostrom.com
|
frontpage
65
Attention is your scarcest resource (2020) (benkuhn.net)
4 months ago |
jxmorris12
| benkuhn.net
|
best
1
DeltaNet Explained (sustcsonglin.github.io)
4 months ago |
jxmorris12
| github.io
|
newest
84
All AI models might be the same (jxmo.io)
4 months ago |
jxmorris12
| jxmo.io
|
best
1
Life Update – On Health (jiha-kim.github.io)
4 months ago |
jxmorris12
| github.io
|
newest
1
Asymmetry of Verification and Verifier's Law (jasonwei.net)
4 months ago |
jxmorris12
| jasonwei.net
|
frontpage
7
Soviet College Admission – My Dad's Story (1970) (ilyavolodarsky.com)
4 months ago |
jxmorris12
| ilyavolodarsky.com
|
frontpage
2
H-Net – Inference (main-horse.github.io)
4 months ago |
jxmorris12
| github.io
|
newest
8
How to scale RL to 10^26 FLOPs (jxmo.io)
4 months ago |
jxmorris12
| jxmo.io
|
frontpage
1
Britain is cheap, and should learn to love it (economist.com)
4 months ago |
jxmorris12
| economist.com
|
newest
2
Database Sharding (planetscale.com)
4 months ago |
jxmorris12
| planetscale.com
|
newest
3
Microdosing Willpower: My Takeaways from Microdosing Ozempic (substack.com)
4 months ago |
jxmorris12
| substack.com
|
newest
29
The upcoming GPT-3 moment for RL (mechanize.work)
4 months ago |
jxmorris12
| mechanize.work
|
best
19
The Tradeoffs of SSMs and Transformers (goombalab.github.io)
4 months ago |
jxmorris12
| github.io
|
frontpage
1
Things you can do –with uv (zaloog.github.io)
4 months ago |
jxmorris12
| github.io
|
newest
24
The era of exploration (yidingjiang.github.io)
4 months ago |
jxmorris12
| github.io
|
best
5
Just Ask for Generalization (2021) (evjang.com)
4 months ago |
jxmorris12
| evjang.com
|
frontpage
3
Will Scaling Solve Robotics? (nishanthjkumar.com)
4 months ago |
jxmorris12
| nishanthjkumar.com
|
frontpage
3
VLLM: Easy, Fast, and Cheap LLM Serving with PagedAttention (vllm.ai)
4 months ago |
jxmorris12
| vllm.ai
|
frontpage
4
LLM Memory (grantslatton.com)
5 months ago |
jxmorris12
| grantslatton.com
|
frontpage
126
What Problems to Solve (1966) (cat-v.org)
5 months ago |
jxmorris12
| cat-v.org
|
best
1
Test-Time Training (yueatsprograms.github.io)
5 months ago |
jxmorris12
| github.io
|
newest
129
Thnickels (thick-coins.net)
5 months ago |
jxmorris12
| thick-coins.net
|
best
22
SFStreets: History of San Francisco place names (noahveltman.com)
5 months ago |
jxmorris12
| noahveltman.com
|
frontpage
1
Muon Doesn't Clearly Grok Faster (essential.ai)
5 months ago |
jxmorris12
| essential.ai
|
newest
1
René Girard and Mimetic Theory for Non-Philosophers (siboehm.com)
5 months ago |
jxmorris12
| siboehm.com
|
newest
2
Becoming a Better Programmer by Tightening Feedback Loops (siboehm.com)
5 months ago |
jxmorris12
| siboehm.com
|
newest
1
Approximating Language Model Training Data from Weights (arxiv.org)
5 months ago |
jxmorris12
| arxiv.org
|
newest
Next >