3
D1: Scaling Reasoning in Diffusion LLMs via Reinforcement Learning (dllm-reasoning.github.io)
5 days ago | t55 | github.io | newest
1
Are LLMs more than autocomplete? AI Debate (rehearsal.so)
5 days ago | t55 | rehearsal.so | newest
20
Block Diffusion: Interpolating Autoregressive and Diffusion Language Models (m-arriola.com)
5 days ago | t55 | m-arriola.com | frontpage
2
How to stay in flow while using Cursor or Windsurf (rehearsal.so)
5 days ago | t55 | rehearsal.so | newest
1
Generative Modelling in Latent Space (sander.ai)
5 days ago | t55 | sander.ai | newest
7
Show HN: Debate Uncle Bob – Is SQL Dead? (Voice RPG) (rehearsal.so)
a week ago | t55 | rehearsal.so | newest
1
OpenAI O3 and O4-Mini (openai.com)
3 weeks ago | t55 | openai.com | newest
5
Memory in ChatGPT (twitter.com/openai)
a month ago | t55 | twitter.com | frontpage
20
Superintelligence startup Reflection AI launches with $130M in funding (siliconangle.com)
2 months ago | t55 | siliconangle.com | frontpage
21
Intro to DeepSeek's open-source week and why it's a big deal (pyspur.dev)
2 months ago | t55 | pyspur.dev | frontpage
2
Anthropic Claude Code [video] (youtube.com)
3 months ago | t55 | youtube.com | newest
91
Introduction to CUDA programming for Python developers (pyspur.dev)
3 months ago | t55 | pyspur.dev | best
1
Novelty Left on the Table (ansatz.blog)
3 months ago | t55 | ansatz.blog | newest
3
Competitive Programming with Large Reasoning Models (arxiv.org)
3 months ago | t55 | arxiv.org | frontpage
2
The Differences Between Direct Alignment Algorithms Are a Blur (arxiv.org)
3 months ago | t55 | arxiv.org | frontpage
2
The Octalysis Framework for Gamification and Behavioral Design (yukaichou.com)
3 months ago | t55 | yukaichou.com | newest
11
S1: Simple Test-Time Scaling (github.com/simplescaling)
3 months ago | t55 | github.com | frontpage
1
A Malloc Tutorial [pdf] (github.com/zyfjeff)
3 months ago | t55 | github.com | newest
37
Reinforcement Learning: An Overview (arxiv.org)
3 months ago | t55 | arxiv.org | frontpage
1
What automated firms will look like (dwarkeshpatel.com)
3 months ago | t55 | dwarkeshpatel.com | newest
50
Large Language Models for Mathematicians (2023) (arxiv.org)
3 months ago | t55 | arxiv.org | frontpage
2
Mathematics for Machine Learning (mml-book.github.io)
3 months ago | t55 | github.io | newest
2
Propositional Interpretability in Artificial Intelligence (arxiv.org)
4 months ago | t55 | arxiv.org | newest
43
The Tensor Cookbook (2024) (tensorcookbook.com)
4 months ago | t55 | tensorcookbook.com | best
14
ArXiv LaTeX Cleaner: Clean the LaTeX code of your paper to submit to ArXiv (github.com/google-research)
3 months ago | t55 | github.com | frontpage
1
Tesla Unveils Autonomous Cleaning Robot for Robotaxi (twitter.com/tesla)
4 months ago | t55 | twitter.com | newest
1
O3-Mini vs. DeepSeek-R1: Which One Is Safer? (arxiv.org)
4 months ago | t55 | arxiv.org | newest
4
Qwen Chat – Another Chinese ChatGPT Rival (qwenlm.ai)
4 months ago | t55 | qwenlm.ai | frontpage
3
Systemic Existential Risks from Incremental AI Development (gradual-disempowerment.ai)
4 months ago | t55 | gradual-disempowerment.ai | newest
1
The risk from prompt injection attacks on AI systems (googleblog.com)
4 months ago | t55 | googleblog.com | newest
2
The Desire to Be Liked Is Rotting Your Brain (flocrivello.com)
4 months ago | t55 | flocrivello.com | newest
1
Thou Shalt Not Overfit (argmin.net)
4 months ago | t55 | argmin.net | newest
2
Obsidian's Web viewer lets you open external links within Obsidian (obsidian.md)
4 months ago | t55 | obsidian.md | newest
46
Quaternions and spherical trigonometry (terrytao.wordpress.com)
4 months ago | t55 | wordpress.com | frontpage
3
Qwen2.5-VL: State-of-the-art multimodal LLM (github.com/qwenlm)
4 months ago | t55 | github.com | newest
7
Goose – an open-source, extensible AI agent that goes beyond code suggestions (github.com/block)
4 months ago | t55 | github.com | frontpage
2
I don't believe DeepSeek crashed Nvidia's stock (understandingai.org)
4 months ago | t55 | understandingai.org | newest
2
Large Language Model Training Using FP4 Quantization (arxiv.org)
4 months ago | t55 | arxiv.org | newest
1
Supervised Fine-Tuning Memorizes, RL Generalizes (arxiv.org)
4 months ago | t55 | arxiv.org | newest
95
DeepSeek's multi-head latent attention and other KV cache tricks (pyspur.dev)
4 months ago | t55 | pyspur.dev | best
2
VideoRAG: Retrieval-Augmented Generation over Video Corpus (arxiv.org)
4 months ago | t55 | arxiv.org | frontpage
6
Cryptoscammers Impersonated and Hacked Us – Now What? (pyspur.dev)
4 months ago | t55 | pyspur.dev | newest
1
Comparing Llama 3.2 vs. Gemma 2 vs. Mistral on philosophical questions (reddit.com)
5 months ago | t55 | reddit.com | newest
3
Show HN: Graph-Based Editor for LLM Workflows (github.com/pyspur-dev)
5 months ago | t55 | github.com | newest
38
ChatGPT's Advanced Voice Mode adds Santa Mode, Live Video, Screensharing (openai.com)
5 months ago | t55 | openai.com | frontpage
1
Visual Autoregressive Modeling: Image Generation via Next-Scale Prediction (openreview.net)
5 months ago | t55 | openreview.net | newest
105
Canvas (openai.com)
5 months ago | t55 | openai.com | best
1
Luigi Mangione's Storyline (defenderofbasic.github.io)
5 months ago | t55 | github.io | newest
1
How to profile CUDA kernels in PyTorch [video] (youtube.com)
5 months ago | t55 | youtube.com | newest
2
Three senior Ex-DeepMind researchers about to open OpenAI's Zurich Office (twitter.com/xiaohuazhai)
5 months ago | t55 | twitter.com | newest
3
How to Tell Great Stories (julian.com)
5 months ago | t55 | julian.com | frontpage
3
'I'll Be Fine' in Prison: Pump.fun Attacker Pleads Guilty (decrypt.co)
5 months ago | t55 | decrypt.co | newest
4
P-99: Ninety-Nine Prolog Problems (ic.unicamp.br)
5 months ago | t55 | unicamp.br | frontpage
13
Efficient Track Anything (yformer.github.io)
5 months ago | t55 | github.io | frontpage
3
Conventional Commit Messages (gist.github.com)
5 months ago | t55 | github.com | frontpage
4
Evening use of eReaders negatively affects sleep, and next-morning alertness (nih.gov)
5 months ago | t55 | nih.gov | newest
1
Challenges and Applications of Large Language Models (arxiv.org)
5 months ago | t55 | arxiv.org | newest
1
Has Volkswagen Lost Its Way? [video] (youtube.com)
5 months ago | t55 | youtube.com | newest
2
Apple showcases AirPods Pro 2 hearing aid feature (9to5mac.com)
6 months ago | t55 | 9to5mac.com | newest