All
5+
10+
25+
50+
100+
Next >
30
SPy: An interpreter and compiler for a fast statically typed variant of Python (antocuni.eu)
a day ago |
og_kalu
| antocuni.eu
|
frontpage
11
Emergent Introspective Awareness in Large Language Models (transformer-circuits.pub)
a week ago |
og_kalu
| transformer-circuits.pub
|
best
1
Quantifying the algorithmic improvement from reasoning models (epoch.ai)
3 months ago |
og_kalu
| epoch.ai
|
newest
1
Evidence of interrelated cognitive-like capabilities in large language models (sciencedirect.com)
5 months ago |
og_kalu
| sciencedirect.com
|
newest
9
Atlas: Learning to Optimally Memorize the Context at Test Time (arxiv.org)
6 months ago |
og_kalu
| arxiv.org
|
frontpage
15
Gemini Diffusion (deepmind.google)
6 months ago |
og_kalu
| deepmind.google
|
frontpage
3
Tails Tell Tales: Chapter-Wide Manga Transcriptions with Character Names (arxiv.org)
9 months ago |
og_kalu
| arxiv.org
|
newest
2
Over-Tokenized Transformer: Vocabulary Is Generally Worth Scaling (arxiv.org)
9 months ago |
og_kalu
| arxiv.org
|
newest
2
LLMs struggle with perception, not reasoning, in ARC-AGI (anokas.substack.com)
9 months ago |
og_kalu
| substack.com
|
newest
2
EvaByte: Efficient Byte-Level Language Models at Scale (hkunlp.github.io)
10 months ago |
og_kalu
| github.io
|
newest
1
Tell me about yourself: LLMs are aware of their learned behaviors (arxiv.org)
10 months ago |
og_kalu
| arxiv.org
|
newest
2
Imagine While Reasoning in Space: Multimodal Visualization-of-Thought (arxiv.org)
10 months ago |
og_kalu
| arxiv.org
|
newest
1
LLMs struggle with perception, not reasoning, in ARC-AGI (anokas.substack.com)
10 months ago |
og_kalu
| substack.com
|
newest
2
Byte Latent Transformer: Patches Scale Better Than Tokens (meta.com)
11 months ago |
og_kalu
| meta.com
|
newest
1
Mastering Board Games by External and Internal Planning with Language Models (deepmind.google)
11 months ago |
og_kalu
| deepmind.google
|
newest
2
Emergence of Hidden Capabilities: Exploring Learning Dynamics in Concept Space (arxiv.org)
12 months ago |
og_kalu
| arxiv.org
|
newest
3
GameGen-X: Open-World Video Game Generation (gamegen-x.github.io)
a year ago |
og_kalu
| github.io
|
newest
45
TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters (arxiv.org)
a year ago |
og_kalu
| arxiv.org
|
best
4
Kurzgesagt: We Fell for the Oldest Lie on the Internet [video] (youtube.com)
a year ago |
og_kalu
| youtube.com
|
newest
1
Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-Wise LoRA (arxiv.org)
a year ago |
og_kalu
| arxiv.org
|
newest
2
Solving Global Lyapunov functions: open problem in mathematics with transformers (arxiv.org)
a year ago |
og_kalu
| arxiv.org
|
newest
2
ChatGPT Topped 3B Visits in September (similarweb.com)
a year ago |
og_kalu
| similarweb.com
|
newest
1
Tx-LLM: Supporting therapeutic development with large language models (research.google)
a year ago |
og_kalu
| research.google
|
newest
1
Tx-LLM: Supporting therapeutic development with large language models (research.google)
a year ago |
og_kalu
| research.google
|
newest
2
Visual Autoregressive Modeling: Image Generation via Next-Resolution Prediction (arxiv.org)
a year ago |
og_kalu
| arxiv.org
|
newest
4
xAI's Colossus (100k H100 cluster) has begun training (twitter.com/elonmusk)
a year ago |
og_kalu
| twitter.com
|
newest
1
Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon (arxiv.org)
a year ago |
og_kalu
| arxiv.org
|
newest
1
GPT-4o's image generation capabilities (twitter.com/gdb)
a year ago |
og_kalu
| twitter.com
|
newest
1
LLMs for few-shot low level robot control by representing trajectories as tokens (twitter.com/ed__johns)
a year ago |
og_kalu
| twitter.com
|
newest
1
Keypoint Action Tokens Enable In-Context Imitation Learning in Robotics (robot-learning.uk)
a year ago |
og_kalu
| robot-learning.uk
|
newest
2
You can now edit DALL·E images in ChatGPT (twitter.com/openai)
a year ago |
og_kalu
| twitter.com
|
newest
6
Microsoft and Open AI Plot $100B AI Supercomputer Called "Stargate" (reuters.com)
a year ago |
og_kalu
| reuters.com
|
newest
1
Arrows of Time for Large Language Models (arxiv.org)
a year ago |
og_kalu
| arxiv.org
|
newest
1
3D Vision-Language-Action Generative World Model (umass.edu)
a year ago |
og_kalu
| umass.edu
|
newest
2
Introducng RFM-1: Giving robots human-like reasoning capabilities (covariant.ai)
a year ago |
og_kalu
| covariant.ai
|
newest
1
LLMs and the Abstraction and Reasoning Corpus (arxiv.org)
a year ago |
og_kalu
| arxiv.org
|
newest
2
Beyond Language Models: Byte Models Are Digital World Simulators (byte-gpt.github.io)
a year ago |
og_kalu
| github.io
|
newest
1
A Vision Check-Up for Language Models (csail.mit.edu)
a year ago |
og_kalu
| mit.edu
|
newest
1
The Impact of Reasoning Step Length on Large Language Models (arxiv.org)
a year ago |
og_kalu
| arxiv.org
|
newest
36
Chain-of-Thought Reasoning Without Prompting (arxiv.org)
a year ago |
og_kalu
| arxiv.org
|
best
2
The Manga Whisperer: Automatically Generating Transcriptions for Comics (github.com/ragavsachdeva)
a year ago |
og_kalu
| github.com
|
newest
2
The Manga Whisperer: Automatically Generating Transcriptions for Comics (github.com/ragavsachdeva)
a year ago |
og_kalu
| github.com
|
newest
1
Show HN: Automatic Translation of Comics (Bande Dessinée, Manga, Webtoons, etc.) (github.com/ogkalu2)
a year ago |
og_kalu
| github.com
|
newest
1
Towards General World Models for Video Generation via Predicting Masked Tokens (world-dreamer.github.io)
a year ago |
og_kalu
| github.io
|
newest
1
Towards Conversational Diagnostic AI (arxiv.org)
a year ago |
og_kalu
| arxiv.org
|
newest
43
OpenAI Announces $10M Superalignment Grants (openai.com)
a year ago |
og_kalu
| openai.com
|
newest
1
From Text to Motion: Grounding GPT-4 in a Humanoid Robot "Alter3" (tnoinkwms.github.io)
a year ago |
og_kalu
| github.io
|
newest
1
Using Large Language Models for Hyperparameter Optimization (arxiv.org)
a year ago |
og_kalu
| arxiv.org
|
newest
2
Scaling Transformers for skillful and reliable medium-range weather forecasting (arxiv.org)
a year ago |
og_kalu
| arxiv.org
|
newest
47
Sequential modeling enables scalable learning for large vision models (yutongbai.com)
a year ago |
og_kalu
| yutongbai.com
|
best
1
Subject/Style-driven image generation and Audio editing with a Multimodal LLM (codi-2.github.io)
a year ago |
og_kalu
| github.io
|
newest
2
CoDi-2: In-Context, Interleaved, and Interactive Any-to-Any Generation (codi-2.github.io)
a year ago |
og_kalu
| github.io
|
newest
2
Max Tegmark on Computation, Substrate-Independence and Consciousness (edge.org)
a year ago |
og_kalu
| edge.org
|
newest
4
LLMs and the Abstraction and Reasoning Corpus (arxiv.org)
a year ago |
og_kalu
| arxiv.org
|
newest
55
Misalignment and Deception by an autonomous stock trading LLM agent (arxiv.org)
a year ago |
og_kalu
| arxiv.org
|
best
5
LLMs and the Abstraction and Reasoning Corpus (arxiv.org)
a year ago |
og_kalu
| arxiv.org
|
newest
2
Unprompted, LLM Agents can strategically deceive users when put under pressure (arxiv.org)
a year ago |
og_kalu
| arxiv.org
|
newest
1
Brains, Planes, Blimps and Algorithms (reddit.com)
a year ago |
og_kalu
| reddit.com
|
newest
1
A conceptual precursor to today's language machines (hedgehogreview.com)
a year ago |
og_kalu
| hedgehogreview.com
|
newest
2
Large Language Models Can Strategically Deceive Their Users When Under Pressure (arxiv.org)
a year ago |
og_kalu
| arxiv.org
|
newest
7
Jarvis-1: Open-World Multi-Task Agents with Memory-Augmented Multimodal LLMs (craftjarvis-jarvis1.github.io)
a year ago |
og_kalu
| github.io
|
best
1
RoboVQA: Multimodal Long-Horizon Reasoning for Robotics (robovqa.github.io)
a year ago |
og_kalu
| github.io
|
newest
2
Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4Vision (som-gpt4v.github.io)
a year ago |
og_kalu
| github.io
|
newest
1
The Dark Side of Antarctica [video] (youtube.com)
a year ago |
og_kalu
| youtube.com
|
newest
1
Taken out of context: On measuring situational awareness in LLMs (arxiv.org)
2 years ago |
og_kalu
| arxiv.org
|
newest
1
Interactive Robot Learning from Verbal Correction (ut-austin-rpl.github.io)
2 years ago |
og_kalu
| github.io
|
newest
1
Unleashing the Power of Pre-Trained LLMs for Offline Reinforcement Learning (arxiv.org)
2 years ago |
og_kalu
| arxiv.org
|
newest
4
CodeFusion: A Pre-Trained Diffusion Model for Code Generation (huggingface.co)
2 years ago |
og_kalu
| huggingface.co
|
newest
1
Learning in High Dimension Always Amounts to Extrapolation (arxiv.org)
2 years ago |
og_kalu
| arxiv.org
|
newest
2
Multi-Game Decision Transformers (sites.google.com)
2 years ago |
og_kalu
| google.com
|
newest
1
In-Context Learning Creates Task Vectors (arxiv.org)
2 years ago |
og_kalu
| arxiv.org
|
newest
8
LLMs playing chess are sensitive to how the position came to be (github.com/dpaleka)
2 years ago |
og_kalu
| github.com
|
frontpage
3
Comprehension of Sentences by Bottlenosed Dolphins (sciencedirect.com)
2 years ago |
og_kalu
| sciencedirect.com
|
newest
2
Is Computer Vision dead? (tenyks.ai)
2 years ago |
og_kalu
| tenyks.ai
|
newest
5
GPT-4 designs reward functions for robot dexterity at Super-human level (twitter.com/drjimfan)
2 years ago |
og_kalu
| twitter.com
|
frontpage
2
Eureka! Nvidia Research Breakthrough Puts New Spin on Robot Learning (nvidia.com)
2 years ago |
og_kalu
| nvidia.com
|
newest
2
Eureka: Human-Level Reward Design via Coding Large Language Models (arxiv.org)
2 years ago |
og_kalu
| arxiv.org
|
newest
1
Revealing the structure of language model capabilities (arxiv.org)
2 years ago |
og_kalu
| arxiv.org
|
newest
10
85% of the variance in LLM performance is explained by a single factor, g (reddit.com)
2 years ago |
og_kalu
| reddit.com
|
frontpage
0
Character AI's Group Chat Feature (reddit.com)
2 years ago |
og_kalu
| reddit.com
|
newest
2
85% of the variance in LLM performance is explained by a single factor, g (reddit.com)
2 years ago |
og_kalu
| reddit.com
|
newest
1
Unveiling the General Intelligence Factor in Language Models (arxiv.org)
2 years ago |
og_kalu
| arxiv.org
|
newest
20
xVal: A continuous number encoding for large language models (arxiv.org)
2 years ago |
og_kalu
| arxiv.org
|
best
1
Inductive reasoning in humans and large language models (sciencedirect.com)
2 years ago |
og_kalu
| sciencedirect.com
|
newest
2
Identifying depression and its determinants: ChatGPT vs. primary care physicians (bmj.com)
2 years ago |
og_kalu
| bmj.com
|
newest
1
Using Transformers for Multi-Agent Reinforcement Learning (arxiv.org)
2 years ago |
og_kalu
| arxiv.org
|
newest
1
Multi-Agent Reinforcement Learning Is a Sequence Modeling Problem (arxiv.org)
2 years ago |
og_kalu
| arxiv.org
|
newest
1
Large Language Models Can Learn Rules (arxiv.org)
2 years ago |
og_kalu
| arxiv.org
|
newest
2
The Geometry of Truth: Emergent Linear Structure in How LLMs Represent Truth (arxiv.org)
2 years ago |
og_kalu
| arxiv.org
|
newest
1
Just Ask for Calibration: Eliciting Calibrated Confidence Scores from LLMs (arxiv.org)
2 years ago |
og_kalu
| arxiv.org
|
newest
1
Can LLMs provide useful feedback on research papers? A broad empirical analysis (arxiv.org)
2 years ago |
og_kalu
| arxiv.org
|
newest
68
Training language models with pause tokens (arxiv.org)
2 years ago |
og_kalu
| arxiv.org
|
best
1
Boolformer: Symbolic Regression of Logic Functions with Transformers (arxiv.org)
2 years ago |
og_kalu
| arxiv.org
|
newest
3
The Dawn of Large Multimodal Models: Preliminary Explorations with GPT-4V(ision) (arxiv.org)
2 years ago |
og_kalu
| arxiv.org
|
newest
1
Catch an AI Liar: Lie Detection in Black-Box LLMs by Asking Unrelated Questions (arxiv.org)
2 years ago |
og_kalu
| arxiv.org
|
newest
1
Large Language Models as Superpositions of Cultural Perspectives (arxiv.org)
2 years ago |
og_kalu
| arxiv.org
|
newest
2
DALL-E 3 will finish rolling out to all Bing Image users by 8PM PST today (twitter.com/mparakhin)
2 years ago |
og_kalu
| twitter.com
|
newest
2
Large Language Models Understand and Can Be Enhanced by Emotional Stimuli (arxiv.org)
2 years ago |
og_kalu
| arxiv.org
|
newest
3
The Internal State of a Large Language Model Knows When Its Lying (arxiv.org)
2 years ago |
og_kalu
| arxiv.org
|
newest
2
Dalle-3 Results and Requests (reddit.com)
2 years ago |
og_kalu
| reddit.com
|
newest
Next >