4
Rabbit R1 source code analysis by Retr0id (github.com/rabbitscam)
2 days ago | GaggiX | github.com | newest
0
MeshLRM: Large Reconstruction Model for High-Quality Meshes (sarahweiii.github.io)
6 days ago | GaggiX | github.io | newest
17
Dynamic Typography: Bringing Text to Life via Video Diffusion Prior (animate-your-word.github.io)
5 days ago | GaggiX | github.io | best
2
Bringing generative AI to video editing workflows in Adobe Premiere Pro (adobe.com)
a week ago | GaggiX | adobe.com | newest
1
Open model Command R+ beats GPT-4 in the LMSYS Chatbot Arena (reddit.com)
2 weeks ago | GaggiX | reddit.com | newest
3
Mixture-of-Depths: Dynamically allocating compute in transformer language models (arxiv.org)
3 weeks ago | GaggiX | arxiv.org | newest
31
Qwen1.5-Moe: Matching 7B Model Performance with 1/3 Activated Parameters (qwenlm.github.io)
4 weeks ago | GaggiX | github.io | best
38
VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild (jasonppy.github.io)
4 weeks ago | GaggiX | github.io | best
1
Claude 3 Haiku is ranked #6 on LLM arena (huggingface.co)
a month ago | GaggiX | huggingface.co | newest
0
A new way to search and connect. Only on Android (ai.android)
2 months ago | GaggiX | ai.android | newest
1
Possible Mistral Medium model leak? (twitter.com/qtnx_)
3 months ago | GaggiX | twitter.com | newest
1
Moe-LLaVA: Mixture of Experts for Large Vision-Language Models (github.com/pku-yuangroup)
3 months ago | GaggiX | github.com | newest
2
SupIR: Revolutionizing image restoration with cutting-edge large-scale AI (xpixel.group)
3 months ago | GaggiX | xpixel.group | newest
1
AutoRT: Foundation Models for Large Scale Orchestration of Robotic Agents (auto-rt.github.io)
3 months ago | GaggiX | github.io | newest
3
Midjourney V6 photorealistic images collection (reddit.com)
4 months ago | GaggiX | reddit.com | newest
2
Gemini vs GPT-4V: A Comparison and Combination of VLMs Through Qualitative Cases (arxiv.org)
4 months ago | GaggiX | arxiv.org | newest
2
CoSeR: Bridging Image and Language for Cognitive Super-Resolution (coser-main.github.io)
4 months ago | GaggiX | github.io | newest
1
VecFusion: Vector Font Generation with Diffusion (arxiv.org)
4 months ago | GaggiX | arxiv.org | newest
49
ReconFusion: 3D Reconstruction with Diffusion Priors (reconfusion.github.io)
4 months ago | GaggiX | github.io | best
24
Music ControlNet: Multiple Time-Varying Controls for Music Generation (musiccontrolnet.github.io)
5 months ago | GaggiX | github.io | best
2
Instant3D: Fast Text-to-3D with Sparse-View Generation (jiahao.ai)
5 months ago | GaggiX | jiahao.ai | newest
1
LRM: Large Reconstruction Model for Single Image to 3D (yiconghong.me)
5 months ago | GaggiX | yiconghong.me | newest
1
OpenAI Consistency Decoder (github.com/openai)
5 months ago | GaggiX | github.com | newest
50
Playing Pokemon Red with Reinforcement Learning (github.com/pwhiddy)
6 months ago | GaggiX | github.com | best
10
I ask DALLE-3 to generate a Pepe but each time I tell it to make it “more rare.” (twitter.com/willdepue)
7 months ago | GaggiX | twitter.com | newest
25
Q-Transformer: Scalable Reinforcement Learning via Autoregressive Q-Functions (q-transformer.github.io)
7 months ago | GaggiX | github.io | best
4
Stable Diffusion XL Inpainting model released (huggingface.co)
7 months ago | GaggiX | huggingface.co | frontpage
2
YouTube uses AI to summarize videos in latest test (theverge.com)
8 months ago | GaggiX | theverge.com | newest
2
AvatarVerse: High-Quality and Stable 3D Avatar Creation from Text and Pose (avatarverse3d.github.io)
8 months ago | GaggiX | github.io | newest
3
JEN-1: Text-Guided Music Generation with Omnidirectional Diffusion Models (futureverse.com)
8 months ago | GaggiX | futureverse.com | newest
55
Magic123: One Image to High-Quality 3D Object Generation (guochengqian.github.io)
8 months ago | GaggiX | github.io | best
1
[PDF] Scaling TransNormer to 175B Parameters (arxiv.org)
9 months ago | GaggiX | arxiv.org | newest
8
Announcing SDXL 1.0 (stability.ai)
9 months ago | GaggiX | stability.ai | frontpage
1
AUTOMATIC1111 webui updated to v1.5 (github.com/automatic1111)
9 months ago | GaggiX | github.com | newest
1
Brain2Music: Reconstructing Music from Human Brain Activity (google-research.github.io)
9 months ago | GaggiX | github.io | newest
2
Video2dataset: A simple tool for large video dataset curation (laion.ai)
9 months ago | GaggiX | laion.ai | newest
60
Stable Diffusion XL technical report [pdf] (github.com/stability-ai)
9 months ago | GaggiX | github.com | best
1
DragDiffusion: Diffusion Models for Interactive Point-Based Image Editing (arxiv.org)
10 months ago | GaggiX | arxiv.org | newest
3
Yuzu: Progress Report May 2023 (yuzu-emu.org)
10 months ago | GaggiX | yuzu-emu.org | newest
2
Rerender a Video: Zero-Shot Text-Guided Video-to-Video Translation (anonymous-31415926.github.io)
10 months ago | GaggiX | github.io | newest
1
Boot: Data-Free Distillation of Denoising Diffusion Models with Bootstrapping (jiataogu.me)
10 months ago | GaggiX | jiataogu.me | newest
2
VideoComposer: Compositional Video Synthesis with Motion Controllability (videocomposer.github.io)
10 months ago | GaggiX | github.io | newest
1
Video Adapter: Efficient Adaption of Text-to-Video Foundation Models (video-adapter.github.io)
10 months ago | GaggiX | github.io | newest
5
ControlNet for QR Code (reddit.com)
10 months ago | GaggiX | reddit.com | frontpage
1
No positional encoding outperforms all positional encoding variants in decoders (twitter.com/a_kazemnejad)
10 months ago | GaggiX | twitter.com | frontpage
1
Ghost in the Minecraft: Generally Capable Agents for Open-World Enviroments (arxiv.org)
11 months ago | GaggiX | arxiv.org | newest
2
Training Diffusion Models with Reinforcement Learning (rl-diffusion.github.io)
11 months ago | GaggiX | github.io | newest
1
Key-Locked Rank One Editing for Text-to-Image Personalization (nvidia.com)
11 months ago | GaggiX | nvidia.com | newest
3
In-Context Learning Unlocked for Diffusion Models (zhendong-wang.github.io)
11 months ago | GaggiX | github.io | newest
12
Text-to-Audio Generation Using Instruction Tuned LLM and Latent Diffusion Model (tango-web.github.io)
12 months ago | GaggiX | github.io | best
3
Training Stable Diffusion from Scratch for <$50k with MosaicML (mosaicml.com)
a year ago | GaggiX | mosaicml.com | newest
288
MiniGPT-4 (minigpt-4.github.io)
a year ago | GaggiX | github.io | best
2
A new Paella: simple and efficient text-to-image generation (laion.ai)
a year ago | GaggiX | laion.ai | newest
2
ControlNet v1.1 (github.com/lllyasviel)
a year ago | GaggiX | github.com | newest
1
GeNVS: Generative Novel View Synthesis with 3D-Aware Diffusion Models (nvlabs.github.io)
a year ago | GaggiX | github.io | newest
1
Taming Encoder for Zero Fine-Tuning Image Customization (arxiv.org)
a year ago | GaggiX | arxiv.org | newest
1
SuTI: Subject-Driven Text-to-Image Generation via Apprenticeship Learning (arxiv.org)
a year ago | GaggiX | arxiv.org | newest
2
Generative Artificial Intelligence Chatbots Have Risen to Human-Level Creativity (researchgate.net)
a year ago | GaggiX | researchgate.net | newest
2
Token Merging for Fast Stable Diffusion (github.com/dbolya)
a year ago | GaggiX | github.com | newest
2
LLaMA-Adapter: Efficient Fine-Tuning of LLaMA (github.com/zrrskywalker)
a year ago | GaggiX | github.com | newest
2
Donald Trump Shares Fake AI-Created Image of Himself on Truth Social (forbes.com/sites/mattnovak)
a year ago | GaggiX | forbes.com | newest
155
Zero-1-to-3: Zero-shot One Image to 3D Object (columbia.edu)
a year ago | GaggiX | columbia.edu | best
3
Privacy vulnerability in the Google Pixel's inbuilt screenshot editing tool (twitter.com/itssimontime)
a year ago | GaggiX | twitter.com | newest
128
Midjourney v5 can do hands (twitter.com/tristwolff)
a year ago | GaggiX | twitter.com | best
7
GigaGAN: Large-Scale GAN for Text-to-Image Synthesis (mingukkang.github.io)
a year ago | GaggiX | github.io | best
1
Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models (jerryxu.net)
a year ago | GaggiX | jerryxu.net | newest
1
Using Stable Diffusion to solve IQ test (reddit.com)
a year ago | GaggiX | reddit.com | newest
41
Word-as-image for semantic typography (wordasimage.github.io)
a year ago | GaggiX | github.io | best
8
Rock, Paper, Scissors – Stable Diffusion Anime by Corridor [video] (youtube.com)
a year ago | GaggiX | youtube.com | frontpage
2
World’s first on-device demonstration of Stable Diffusion on an Android phone (qualcomm.com)
a year ago | GaggiX | qualcomm.com | newest
4
Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision (google-research.github.io)
a year ago | GaggiX | github.io | newest
2
PADL: Language-Directed Physics-Based Character Control (nv-tlabs.github.io)
a year ago | GaggiX | github.io | newest
1
Blip-2 (arxiv.org)
a year ago | GaggiX | arxiv.org | newest
1
SingSong: Generating Musical Accompaniments from Singing (storage.googleapis.com)
a year ago | GaggiX | googleapis.com | newest
2
On the Importance of Noise Scheduling for Diffusion Models (arxiv.org)
a year ago | GaggiX | arxiv.org | newest
3
Scalable Adaptive Computation for Iterative Generation (arxiv.org)
a year ago | GaggiX | arxiv.org | newest
2
DeepMind: Human-Timescale Adaptation in an Open-Ended Task Space (sites.google.com)
a year ago | GaggiX | google.com | newest
3
Small Stable Diffusion (huggingface.co)
a year ago | GaggiX | huggingface.co | newest
5
ChatGPT knows Elon Musk is Twitter’s CEO, despite its 2021 learning cutoff (semafor.com)
a year ago | GaggiX | semafor.com | frontpage
3
Karlo – Dalle 2 model by KakaoBrain (github.com/kakaobrain)
a year ago | GaggiX | github.com | newest
2
CodeGeeX: A Multilingual Code Generative Model (huggingface.co)
a year ago | GaggiX | huggingface.co | newest
1
Stable Diffusion, custom in/outpainting model (github.com/jack000)
a year ago | GaggiX | github.com | newest
2
Stable Diffusion Image Variations
a year ago | GaggiX | github.com | newest
2
COYO-700M: Large-Scale Image-Text Pair Dataset
a year ago | GaggiX | github.com | newest
3
Stable Diffusion Finetuned on Pokemon
a year ago | GaggiX | twitter.com | frontpage