Hacker News headlines

4

Rabbit R1 source code analysis by Retr0id (github.com/rabbitscam)

2 days ago | GaggiX | github.com | newest

0

MeshLRM: Large Reconstruction Model for High-Quality Meshes (sarahweiii.github.io)

6 days ago | GaggiX | github.io | newest

17

Dynamic Typography: Bringing Text to Life via Video Diffusion Prior (animate-your-word.github.io)

5 days ago | GaggiX | github.io | best

2

Bringing generative AI to video editing workflows in Adobe Premiere Pro (adobe.com)

a week ago | GaggiX | adobe.com | newest

1

Open model Command R+ beats GPT-4 in the LMSYS Chatbot Arena (reddit.com)

2 weeks ago | GaggiX | reddit.com | newest

3

Mixture-of-Depths: Dynamically allocating compute in transformer language models (arxiv.org)

3 weeks ago | GaggiX | arxiv.org | newest

31

Qwen1.5-Moe: Matching 7B Model Performance with 1/3 Activated Parameters (qwenlm.github.io)

4 weeks ago | GaggiX | github.io | best

38

VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild (jasonppy.github.io)

4 weeks ago | GaggiX | github.io | best

1

Claude 3 Haiku is ranked #6 on LLM arena (huggingface.co)

a month ago | GaggiX | huggingface.co | newest

0

A new way to search and connect. Only on Android (ai.android)

2 months ago | GaggiX | ai.android | newest

1

Possible Mistral Medium model leak? (twitter.com/qtnx_)

3 months ago | GaggiX | twitter.com | newest

1

Moe-LLaVA: Mixture of Experts for Large Vision-Language Models (github.com/pku-yuangroup)

3 months ago | GaggiX | github.com | newest

2

SupIR: Revolutionizing image restoration with cutting-edge large-scale AI (xpixel.group)

3 months ago | GaggiX | xpixel.group | newest

1

AutoRT: Foundation Models for Large Scale Orchestration of Robotic Agents (auto-rt.github.io)

3 months ago | GaggiX | github.io | newest

3

Midjourney V6 photorealistic images collection (reddit.com)

4 months ago | GaggiX | reddit.com | newest

2

Gemini vs GPT-4V: A Comparison and Combination of VLMs Through Qualitative Cases (arxiv.org)

4 months ago | GaggiX | arxiv.org | newest

2

CoSeR: Bridging Image and Language for Cognitive Super-Resolution (coser-main.github.io)

4 months ago | GaggiX | github.io | newest

1

VecFusion: Vector Font Generation with Diffusion (arxiv.org)

4 months ago | GaggiX | arxiv.org | newest

49

ReconFusion: 3D Reconstruction with Diffusion Priors (reconfusion.github.io)

4 months ago | GaggiX | github.io | best

24

Music ControlNet: Multiple Time-Varying Controls for Music Generation (musiccontrolnet.github.io)

5 months ago | GaggiX | github.io | best

2

Instant3D: Fast Text-to-3D with Sparse-View Generation (jiahao.ai)

5 months ago | GaggiX | jiahao.ai | newest

1

LRM: Large Reconstruction Model for Single Image to 3D (yiconghong.me)

5 months ago | GaggiX | yiconghong.me | newest

1

OpenAI Consistency Decoder (github.com/openai)

5 months ago | GaggiX | github.com | newest

50

Playing Pokemon Red with Reinforcement Learning (github.com/pwhiddy)

6 months ago | GaggiX | github.com | best

10

I ask DALLE-3 to generate a Pepe but each time I tell it to make it “more rare.” (twitter.com/willdepue)

7 months ago | GaggiX | twitter.com | newest

25

Q-Transformer: Scalable Reinforcement Learning via Autoregressive Q-Functions (q-transformer.github.io)

7 months ago | GaggiX | github.io | best

4

Stable Diffusion XL Inpainting model released (huggingface.co)

7 months ago | GaggiX | huggingface.co | frontpage

2

YouTube uses AI to summarize videos in latest test (theverge.com)

8 months ago | GaggiX | theverge.com | newest

2

AvatarVerse: High-Quality and Stable 3D Avatar Creation from Text and Pose (avatarverse3d.github.io)

8 months ago | GaggiX | github.io | newest

3

JEN-1: Text-Guided Music Generation with Omnidirectional Diffusion Models (futureverse.com)

8 months ago | GaggiX | futureverse.com | newest

55

Magic123: One Image to High-Quality 3D Object Generation (guochengqian.github.io)

8 months ago | GaggiX | github.io | best

1

[PDF] Scaling TransNormer to 175B Parameters (arxiv.org)

9 months ago | GaggiX | arxiv.org | newest

8

Announcing SDXL 1.0 (stability.ai)

9 months ago | GaggiX | stability.ai | frontpage

1

AUTOMATIC1111 webui updated to v1.5 (github.com/automatic1111)

9 months ago | GaggiX | github.com | newest

1

Brain2Music: Reconstructing Music from Human Brain Activity (google-research.github.io)

9 months ago | GaggiX | github.io | newest

2

Video2dataset: A simple tool for large video dataset curation (laion.ai)

9 months ago | GaggiX | laion.ai | newest

60

Stable Diffusion XL technical report [pdf] (github.com/stability-ai)

9 months ago | GaggiX | github.com | best

1

DragDiffusion: Diffusion Models for Interactive Point-Based Image Editing (arxiv.org)

10 months ago | GaggiX | arxiv.org | newest

3

Yuzu: Progress Report May 2023 (yuzu-emu.org)

10 months ago | GaggiX | yuzu-emu.org | newest

2

Rerender a Video: Zero-Shot Text-Guided Video-to-Video Translation (anonymous-31415926.github.io)

10 months ago | GaggiX | github.io | newest

1

Boot: Data-Free Distillation of Denoising Diffusion Models with Bootstrapping (jiataogu.me)

10 months ago | GaggiX | jiataogu.me | newest

2

VideoComposer: Compositional Video Synthesis with Motion Controllability (videocomposer.github.io)

10 months ago | GaggiX | github.io | newest

1

Video Adapter: Efficient Adaption of Text-to-Video Foundation Models (video-adapter.github.io)

10 months ago | GaggiX | github.io | newest

5

ControlNet for QR Code (reddit.com)

10 months ago | GaggiX | reddit.com | frontpage

1

No positional encoding outperforms all positional encoding variants in decoders (twitter.com/a_kazemnejad)

10 months ago | GaggiX | twitter.com | frontpage

1

Ghost in the Minecraft: Generally Capable Agents for Open-World Enviroments (arxiv.org)

11 months ago | GaggiX | arxiv.org | newest

2

Training Diffusion Models with Reinforcement Learning (rl-diffusion.github.io)

11 months ago | GaggiX | github.io | newest

1

Key-Locked Rank One Editing for Text-to-Image Personalization (nvidia.com)

11 months ago | GaggiX | nvidia.com | newest

3

In-Context Learning Unlocked for Diffusion Models (zhendong-wang.github.io)

11 months ago | GaggiX | github.io | newest

12

Text-to-Audio Generation Using Instruction Tuned LLM and Latent Diffusion Model (tango-web.github.io)

12 months ago | GaggiX | github.io | best

3

Training Stable Diffusion from Scratch for <$50k with MosaicML (mosaicml.com)

a year ago | GaggiX | mosaicml.com | newest

288

MiniGPT-4 (minigpt-4.github.io)

a year ago | GaggiX | github.io | best

2

A new Paella: simple and efficient text-to-image generation (laion.ai)

a year ago | GaggiX | laion.ai | newest

2

ControlNet v1.1 (github.com/lllyasviel)

a year ago | GaggiX | github.com | newest

1

GeNVS: Generative Novel View Synthesis with 3D-Aware Diffusion Models (nvlabs.github.io)

a year ago | GaggiX | github.io | newest

1

Taming Encoder for Zero Fine-Tuning Image Customization (arxiv.org)

a year ago | GaggiX | arxiv.org | newest

1

SuTI: Subject-Driven Text-to-Image Generation via Apprenticeship Learning (arxiv.org)

a year ago | GaggiX | arxiv.org | newest

2

Generative Artificial Intelligence Chatbots Have Risen to Human-Level Creativity (researchgate.net)

a year ago | GaggiX | researchgate.net | newest

2

Token Merging for Fast Stable Diffusion (github.com/dbolya)

a year ago | GaggiX | github.com | newest

2

LLaMA-Adapter: Efficient Fine-Tuning of LLaMA (github.com/zrrskywalker)

a year ago | GaggiX | github.com | newest

2

Donald Trump Shares Fake AI-Created Image of Himself on Truth Social (forbes.com/sites/mattnovak)

a year ago | GaggiX | forbes.com | newest

155

Zero-1-to-3: Zero-shot One Image to 3D Object (columbia.edu)

a year ago | GaggiX | columbia.edu | best

3

Privacy vulnerability in the Google Pixel's inbuilt screenshot editing tool (twitter.com/itssimontime)

a year ago | GaggiX | twitter.com | newest

128

Midjourney v5 can do hands (twitter.com/tristwolff)

a year ago | GaggiX | twitter.com | best

7

GigaGAN: Large-Scale GAN for Text-to-Image Synthesis (mingukkang.github.io)

a year ago | GaggiX | github.io | best

1

Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models (jerryxu.net)

a year ago | GaggiX | jerryxu.net | newest

1

Using Stable Diffusion to solve IQ test (reddit.com)

a year ago | GaggiX | reddit.com | newest

41

Word-as-image for semantic typography (wordasimage.github.io)

a year ago | GaggiX | github.io | best

8

Rock, Paper, Scissors – Stable Diffusion Anime by Corridor [video] (youtube.com)

a year ago | GaggiX | youtube.com | frontpage

2

World’s first on-device demonstration of Stable Diffusion on an Android phone (qualcomm.com)

a year ago | GaggiX | qualcomm.com | newest

4

Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision (google-research.github.io)

a year ago | GaggiX | github.io | newest

2

PADL: Language-Directed Physics-Based Character Control (nv-tlabs.github.io)

a year ago | GaggiX | github.io | newest

1

Blip-2 (arxiv.org)

a year ago | GaggiX | arxiv.org | newest

1

SingSong: Generating Musical Accompaniments from Singing (storage.googleapis.com)

a year ago | GaggiX | googleapis.com | newest

2

On the Importance of Noise Scheduling for Diffusion Models (arxiv.org)

a year ago | GaggiX | arxiv.org | newest

3

Scalable Adaptive Computation for Iterative Generation (arxiv.org)

a year ago | GaggiX | arxiv.org | newest

2

DeepMind: Human-Timescale Adaptation in an Open-Ended Task Space (sites.google.com)

a year ago | GaggiX | google.com | newest

3

Small Stable Diffusion (huggingface.co)

a year ago | GaggiX | huggingface.co | newest

5

ChatGPT knows Elon Musk is Twitter’s CEO, despite its 2021 learning cutoff (semafor.com)

a year ago | GaggiX | semafor.com | frontpage

3

Karlo – Dalle 2 model by KakaoBrain (github.com/kakaobrain)

a year ago | GaggiX | github.com | newest

2

CodeGeeX: A Multilingual Code Generative Model (huggingface.co)

a year ago | GaggiX | huggingface.co | newest

1

Stable Diffusion, custom in/outpainting model (github.com/jack000)

a year ago | GaggiX | github.com | newest

2

Stable Diffusion Image Variations

a year ago | GaggiX | github.com | newest

2

COYO-700M: Large-Scale Image-Text Pair Dataset

a year ago | GaggiX | github.com | newest

3

Stable Diffusion Finetuned on Pokemon

a year ago | GaggiX | twitter.com | frontpage