Hacker News headlines

2

Learning to Model the World with Language (dynalang.github.io)

a day ago | jxmorris12 | github.io | newest

2

A hitchhiker's guide to CUDA programming (seanzhang.me)

a week ago | jxmorris12 | seanzhang.me | frontpage

46

Estimating the perceived 'claustrophobia' of New York City's streets (2024) (mfranchi.net)

a week ago | jxmorris12 | mfranchi.net | best

166

Tinkering is a way to acquire good taste (seated.ro)

a week ago | jxmorris12 | seated.ro | best

2

Modern LLM Training (A Summary) (lesswrong.com)

a week ago | jxmorris12 | lesswrong.com | newest

2

Yes it's just doing compression. No it's not the diss you think it is (blog.wtf.sg)

a week ago | jxmorris12 | wtf.sg | newest

2

Good developer relations is about being a celebrity for dorks (pfiffer.org)

2 weeks ago | jxmorris12 | pfiffer.org | newest

2

Prompt Baking (arxiv.org)

2 weeks ago | jxmorris12 | arxiv.org | frontpage

1

Offline "Studying" Shrinks the Cost of Contextually Aware AI (stanford.edu)

2 weeks ago | jxmorris12 | stanford.edu | newest

5

The State of Machine Learning Frameworks in 2019 (thegradient.pub)

2 weeks ago | jxmorris12 | thegradient.pub | frontpage

1

Many AI Safety Orgs Have Tried to Criminalize Open-Source AI (2024) (1a3orn.com)

3 weeks ago | jxmorris12 | 1a3orn.com | newest

1

Neural Networks and Deep Learning (neuralnetworksanddeeplearning.com)

3 weeks ago | jxmorris12 | neuralnetworksanddeeplearning.com | newest

105

America's future could hinge on whether AI slightly disappoints (noahpinion.blog)

3 weeks ago | jxmorris12 | noahpinion.blog | best

3

Self-Respect (By Joan Didion) (1961) (gatech.edu)

3 weeks ago | jxmorris12 | gatech.edu | newest

12

Read your way through Hà Nội (vietnamesetypography.com)

3 weeks ago | jxmorris12 | vietnamesetypography.com | frontpage

59

How hard do you have to hit a chicken to cook it? (2020) (james-simon.github.io)

4 weeks ago | jxmorris12 | github.io | best

2

Start a Blog (guzey.com)

4 weeks ago | jxmorris12 | guzey.com | newest

1

Computable Babylonian Diaries Project (christopherwolfram.com)

4 weeks ago | jxmorris12 | christopherwolfram.com | frontpage

1

Survival of the Best Fit (survivalofthebestfit.com)

a month ago | jxmorris12 | survivalofthebestfit.com | newest

3

Breath of the Wild Decompilation (botw.link)

a month ago | jxmorris12 | botw.link | newest

2

The Politics of Contagion (emilybynight.com)

a month ago | jxmorris12 | emilybynight.com | newest

8

A PhD in Snapshots (rbharath.github.io)

a month ago | jxmorris12 | github.io | frontpage

31

Memory access is O(N^[1/3]) (vitalik.eth.limo)

a month ago | jxmorris12 | eth.limo | frontpage

1

Highrises (hythacg.com)

a month ago | jxmorris12 | hythacg.com | newest

30

How does gradient descent work? (centralflows.github.io)

a month ago | jxmorris12 | github.io | best

3

Small Products That Improved My Life (moultano.wordpress.com)

a month ago | jxmorris12 | wordpress.com | newest

1

Whispers of A.I.'s Modular Future (2023) (newyorker.com)

a month ago | jxmorris12 | newyorker.com | newest

1

An Age of AI Enlightenment (xiangfu.co)

a month ago | jxmorris12 | xiangfu.co | newest

1

A vision researcher's guide to some RL stuff: PPO and GRPO (yugeten.github.io)

a month ago | jxmorris12 | github.io | newest

3

LLMs are strangely-shaped tools (near.blog)

a month ago | jxmorris12 | near.blog | newest

1

Learned Structures (nonint.com)

a month ago | jxmorris12 | nonint.com | newest

1

LoRA-XS: Low-Rank Adaptation with Small Number of Parameters (arxiv.org)

a month ago | jxmorris12 | arxiv.org | newest

8

Evals in 2025: going beyond simple benchmarks to build models people can use (github.com/huggingface)

a month ago | jxmorris12 | github.com | frontpage

1

Dissecting Batching Effects in GPT Inference (qun.ch)

a month ago | jxmorris12 | qun.ch | newest

2

My (speculative) master plan for immortality (maxwellnye.com)

a month ago | jxmorris12 | maxwellnye.com | newest

3

Richard Feynman and the Connection Machine (1989) (longnow.org)

a month ago | jxmorris12 | longnow.org | frontpage

98

Defeating Nondeterminism in LLM Inference (thinkingmachines.ai)

a month ago | jxmorris12 | thinkingmachines.ai | best

12

Perceived Age (2024) (sdan.io)

a month ago | jxmorris12 | sdan.io | frontpage

2

Don't Build an RL Environment Startup (benanderson.work)

2 months ago | jxmorris12 | benanderson.work | newest

1

Shifting Bits in Company History (williamyeny.github.io)

2 months ago | jxmorris12 | github.io | newest

1

ML Systems: Motivating Dense Models (jacobkahn.me)

2 months ago | jxmorris12 | jacobkahn.me | newest

4

The "it" in AI models is the dataset (nonint.com)

2 months ago | jxmorris12 | nonint.com | newest

1

The Paradigm (nonint.com)

2 months ago | jxmorris12 | nonint.com | frontpage

1

Personalization, measuring with taste, and intrinsic interfaces (thesephist.com)

3 months ago | jxmorris12 | thesephist.com | newest

3

Long Term Memory in AI (Princeton CS 597A) (edoliberty.github.io)

3 months ago | jxmorris12 | github.io | newest

1

Model Merging – A Biased Overview (crisostomi.github.io)

3 months ago | jxmorris12 | github.io | newest

1

Adversarial Examples Are Not Bugs, They Are Superposition (livgorton.com)

3 months ago | jxmorris12 | livgorton.com | newest

1

Sequence Parallelism: Long Sequence Training from System Perspective (2021) (arxiv.org)

3 months ago | jxmorris12 | arxiv.org | newest

10

How many paths of length K are there between A and B? (2021) (horace.io)

3 months ago | jxmorris12 | horace.io | frontpage

2

How A Neuron Learns (rvns.moe)

3 months ago | jxmorris12 | rvns.moe | newest

1

GPT, Fast (pytorch.org)

3 months ago | jxmorris12 | pytorch.org | newest

1

GPT-Fast (github.com/meta-pytorch)

3 months ago | jxmorris12 | github.com | newest

10

Exploring EXIF (2023) (hturan.com)

3 months ago | jxmorris12 | hturan.com | frontpage

1

The Practitioner's Guide to the Maximal Update Parameterization (cerebras.ai)

3 months ago | jxmorris12 | cerebras.ai | newest

1

The scientific method and its application to the science of deep learning (james-simon.github.io)

3 months ago | jxmorris12 | github.io | newest

2

Solving Humanity's Last Exam Problems (youtube.com)

3 months ago | jxmorris12 | youtube.com | newest

1

Why We Think (lilianweng.github.io)

3 months ago | jxmorris12 | github.io | newest

5

Philosophical Thoughts on Kolmogorov-Arnold Networks (2024) (kindxiaoming.github.io)

3 months ago | jxmorris12 | github.io | frontpage

1

Matmul() using PyTorch's MPs back end is faster than Apple's MLX (kevinmartinjose.com)

3 months ago | jxmorris12 | kevinmartinjose.com | newest

2

The Making of Gemini Plays Pokémon (jcz.dev)

3 months ago | jxmorris12 | jcz.dev | newest

6

Facebook is not worth $33B (2010) (signalvnoise.com)

3 months ago | jxmorris12 | signalvnoise.com | frontpage

3

Comefrom (wikipedia.org)

3 months ago | jxmorris12 | wikipedia.org | newest

1

Diffusion Language Models Are Super Data Learners (jinjieni.notion.site)

3 months ago | jxmorris12 | notion.site | newest

2

How to build a router for MOE models (cerebras.ai)

3 months ago | jxmorris12 | cerebras.ai | newest

2

The Eponymous Principles of Management – Coase's Ceiling and Floor (amvaishnav.wordpress.com)

3 months ago | jxmorris12 | wordpress.com | newest

70

[flagged] No One Is Working (humaninvariant.com)

3 months ago | jxmorris12 | humaninvariant.com | frontpage

3

No One Is Working (humaninvariant.com)

3 months ago | jxmorris12 | humaninvariant.com | newest

1

SFT Is Bad RL (justinchiu.netlify.app)

3 months ago | jxmorris12 | netlify.app | newest

8

A Simple CPU on the Game of Life (2021) (carlini.com)

3 months ago | jxmorris12 | carlini.com | frontpage

2

Trends in LLM-Generated Citations on ArXiv (spylab.ai)

3 months ago | jxmorris12 | spylab.ai | newest

3

'AI' just means LLMs now (jxmo.io)

3 months ago | jxmorris12 | jxmo.io | newest

2

Ada Lovelace and the Analytical Engine (ox.ac.uk)

3 months ago | jxmorris12 | ox.ac.uk | newest

40

How long before superintelligence? (1997) (nickbostrom.com)

4 months ago | jxmorris12 | nickbostrom.com | frontpage

65

Attention is your scarcest resource (2020) (benkuhn.net)

4 months ago | jxmorris12 | benkuhn.net | best

1

DeltaNet Explained (sustcsonglin.github.io)

4 months ago | jxmorris12 | github.io | newest

84

All AI models might be the same (jxmo.io)

4 months ago | jxmorris12 | jxmo.io | best

1

Life Update – On Health (jiha-kim.github.io)

4 months ago | jxmorris12 | github.io | newest

1

Asymmetry of Verification and Verifier's Law (jasonwei.net)

4 months ago | jxmorris12 | jasonwei.net | frontpage

7

Soviet College Admission – My Dad's Story (1970) (ilyavolodarsky.com)

4 months ago | jxmorris12 | ilyavolodarsky.com | frontpage

2

H-Net – Inference (main-horse.github.io)

4 months ago | jxmorris12 | github.io | newest

8

How to scale RL to 10^26 FLOPs (jxmo.io)

4 months ago | jxmorris12 | jxmo.io | frontpage

1

Britain is cheap, and should learn to love it (economist.com)

4 months ago | jxmorris12 | economist.com | newest

2

Database Sharding (planetscale.com)

4 months ago | jxmorris12 | planetscale.com | newest

3

Microdosing Willpower: My Takeaways from Microdosing Ozempic (substack.com)

4 months ago | jxmorris12 | substack.com | newest

29

The upcoming GPT-3 moment for RL (mechanize.work)

4 months ago | jxmorris12 | mechanize.work | best

19

The Tradeoffs of SSMs and Transformers (goombalab.github.io)

4 months ago | jxmorris12 | github.io | frontpage

1

Things you can do –with uv (zaloog.github.io)

4 months ago | jxmorris12 | github.io | newest

24

The era of exploration (yidingjiang.github.io)

4 months ago | jxmorris12 | github.io | best

5

Just Ask for Generalization (2021) (evjang.com)

4 months ago | jxmorris12 | evjang.com | frontpage

3

Will Scaling Solve Robotics? (nishanthjkumar.com)

4 months ago | jxmorris12 | nishanthjkumar.com | frontpage

3

VLLM: Easy, Fast, and Cheap LLM Serving with PagedAttention (vllm.ai)

4 months ago | jxmorris12 | vllm.ai | frontpage

4

LLM Memory (grantslatton.com)

5 months ago | jxmorris12 | grantslatton.com | frontpage

126

What Problems to Solve (1966) (cat-v.org)

5 months ago | jxmorris12 | cat-v.org | best

1

Test-Time Training (yueatsprograms.github.io)

5 months ago | jxmorris12 | github.io | newest

129

Thnickels (thick-coins.net)

5 months ago | jxmorris12 | thick-coins.net | best

22

SFStreets: History of San Francisco place names (noahveltman.com)

5 months ago | jxmorris12 | noahveltman.com | frontpage

1

Muon Doesn't Clearly Grok Faster (essential.ai)

5 months ago | jxmorris12 | essential.ai | newest

1

René Girard and Mimetic Theory for Non-Philosophers (siboehm.com)

5 months ago | jxmorris12 | siboehm.com | newest

2

Becoming a Better Programmer by Tightening Feedback Loops (siboehm.com)

5 months ago | jxmorris12 | siboehm.com | newest

1

Approximating Language Model Training Data from Weights (arxiv.org)

5 months ago | jxmorris12 | arxiv.org | newest