AI with Papers - Artificial Intelligence & Deep Learning

🪛 AgileAvatar: HQ stylized 3D #Avatar 🪛

👉#ByteDance unveils a novel self-supervised framework for #3D avatars

😎Review https://bit.ly/3kaOAw6
😎Project ssangx.github.io/projects/agileavatar
😎Paper ssangx.github.io/pubs/2022-SIGGRAPHAsia-AgileAvatar.pdf

👍9❤3

4.48K views18:55

🍥 PanoHead: 3D Full-Head Synthesis 🍥

👉#ByteDance (+UW-M) unveils PanoHead: 360◦ view-consistent portraits from a single-view image

😎Review https://t.ly/MrLNR
😎Paper arxiv.org/pdf/2303.13071.pdf
😎Project sizhean.github.io/panohead
😎Code github.com/sizhean/panohead

🔥7❤4🤯3😱1

5.9K viewsedited 07:12

🐤 MagicVideo-V2 announced! 🐤

👉#Bytedance announces a novel multi-stage pipeline capable of generating high-aesthetic videos from textual description

👉Review https://t.ly/zIq4v
👉Project https://lnkd.in/dKUrJPJd
👉Paper https://lnkd.in/dixnN-kU

🔥7❤1👍1🥰1💩1

6.38K viewsedited 07:48

🆔 Magic-Me: ID-Specific Video 🆔

👉#ByteDance VCD: with just a few images of a specific identity it can generate temporal consistent videos aligned with the given prompt

👉Review https://t.ly/qjJ2O
👉Paper arxiv.org/pdf/2402.09368.pdf
👉Project magic-me-webpage.github.io
👉Code github.com/Zhen-Dong/Magic-Me

❤6🥰1🤯1🤣1

8.07K viewsedited 15:27

⛽ VoRA: Vision as LoRA ⛽

👉#ByteDance unveils Vision as LoRA (VoRA), a novel paradigm converting LLMs into Multimodal Large Language Models (MLLMs) by integrating vision-specific LoRA layers. All training data, codes, and model weights available💙

👉Review https://t.ly/guNVN
👉Paper arxiv.org/pdf/2503.20680
👉Repo github.com/Hon-Wong/VoRA
👉Project georgeluimmortal.github.io/vora-homepage.github.io/

👍15❤7🤯4👏1

7.84K viewsedited 06:59