AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ SOTA ALERT: new Text-to-Video #AI ๐Ÿ”ฅ

๐Ÿ‘‰#META unveils a novel Text-to-Video (T2V) generation #AI

๐Ÿ˜ŽReview https://bit.ly/3E1ZDzG
๐Ÿ˜ŽProject https://makeavideo.studio/
๐Ÿ˜ŽPaper makeavideo.studio/Make-A-Video.pdf
๐Ÿคฏ9๐Ÿ‘6๐Ÿ˜ฑ1๐Ÿ’ฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅDreamFusion: Text-to-3D via Diffusion๐Ÿ”ฅ

๐Ÿ‘‰DeepDream-like procedure to create #3D assets just from a given text

๐Ÿ˜ŽReview https://bit.ly/3BYY5nu
๐Ÿ˜ŽPaper arxiv.org/pdf/2209.14988.pdf
๐Ÿ˜ŽProject dreamfusion3d.github.io/gallery.html
๐Ÿคฏ12๐Ÿ‘5๐Ÿ’ฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿงช Light Field Neural Rendering ๐Ÿงช

๐Ÿ‘‰Two-stage transformer capable of non-Lambertian effects (reflection, refraction, translucency)

๐Ÿ˜ŽReview https://bit.ly/3CpIFdm
๐Ÿ˜ŽPaper arxiv.org/pdf/2112.09687.pdf
๐Ÿ˜ŽProject light-field-neural-rendering.github.io
๐Ÿ˜ŽCode github.com/google-research/google-research/tree/master/light_field_neural_rendering
๐Ÿคฏ14๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸฆฉPhenaki: Text-to(LOOONG)Video generation๐Ÿฆฉ

๐Ÿ‘‰Phenaki is an #AI capable of realistic long video synthesis, given a sequence of textual open prompts

๐Ÿ˜ŽReview https://bit.ly/3RwUvXx
๐Ÿ˜ŽProject phenaki.video/index.h
๐Ÿ˜ŽPaper openreview.net/pdf?id=vOEXS39nOF
๐Ÿ”ฅ7โค3๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ VToonify: Neural Portrait Style Transfer ๐Ÿ”ฅ

๐Ÿ‘‰VToonify for portrait style transfer. Powered by DualStyleGAN backbone, now with #stablediffusion!

๐Ÿ˜ŽReview https://bit.ly/3M9wgNP
๐Ÿ˜ŽDemo https://t.co/8gXzF3IrpB
๐Ÿ˜ŽPaper arxiv.org/pdf/2209.11224.pdf
๐Ÿ˜ŽProject mmlab-ntu.com/project/vtoonify
๐Ÿ˜ŽCode github.com/williamyang1991/VToonify
๐Ÿ‘22โค3๐Ÿคฏ2๐Ÿ”ฅ1๐Ÿ‘1๐Ÿ’ฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿข Stable Diffusion for #Pokemon ๐Ÿข

๐Ÿ‘‰Fine-tuning the stable diffusion to create a text-to-pokemon generation model

๐Ÿ˜ŽReview https://bit.ly/3C9qBTw
๐Ÿ˜ŽTutorial https://lambdalabs.com/blog/how-to-fine-tune-stable-diffusion-how-we-made-the-text-to-pokemon-model-at-lambda/
โค8๐Ÿ‘4
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ Imagen Video by #Google. SICK! ๐Ÿ”ฅ

๐Ÿ‘‰Novel text-conditional video generation via cascade of video diffusion models ๐Ÿคฏ

๐Ÿ˜ŽReview https://bit.ly/3SH2TVH
๐Ÿ˜ŽProject imagen.research.google/video/
๐Ÿ˜ŽPaper imagen.research.google/video/paper.pdf
๐Ÿคฏ20๐Ÿ”ฅ7๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ Human MDM: source code is out! ๐Ÿ”ฅ

๐Ÿ‘‰A classifier-free diffusion-based generative model for human motion domain

๐Ÿ˜ŽReview https://bit.ly/3rFhR2G
๐Ÿ˜ŽProject guytevet.github.io/mdm-page
๐Ÿ˜ŽPaper arxiv.org/pdf/2209.14916.pdf
๐Ÿ˜ŽCode github.com/GuyTevet/motion-diffusion-model
๐Ÿ”ฅ6๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
โš›๏ธSOTA ALERT! Particles Tracking โš›๏ธ

๐Ÿ‘‰The new SOTA in video particles tracking. "Old school" taste, with neural flavor ๐Ÿงก

๐Ÿ˜ŽReview https://bit.ly/3CaU5Ai
๐Ÿ˜ŽProject particle-video-revisited.github.io/
๐Ÿ˜ŽPaper arxiv.org/pdf/2204.04153.pdf
๐Ÿ˜ŽCode github.com/aharley/pips
๐Ÿ‘7๐Ÿฅฐ4๐Ÿ”ฅ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ #AIwithPapers: we are 4,500+! ๐Ÿ”ฅ

๐Ÿ’™๐Ÿ’› Someone put the smiling ๐Ÿ’ฉ under a few recent posts. But I still love you! ๐Ÿ’™๐Ÿ’›

๐Ÿ˜ˆ Invite your friends -> https://t.me/AI_DeepLearning
โค18๐Ÿ’ฉ7๐Ÿ”ฅ5๐Ÿ‘3๐Ÿฅฐ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ‹ Long Video via Transformers ๐Ÿ‹

๐Ÿ‘‰TECO is a vector-quantized latent dynamics prediction for long video

๐Ÿ˜ŽReview https://bit.ly/3Ch0tWD
๐Ÿ˜ŽProject wilson1yan.github.io/teco/
๐Ÿ˜ŽPaper arxiv.org/pdf/2210.02396.pdf
๐Ÿ˜ŽCode github.com/wilson1yan/teco
๐Ÿ‘7
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅSIMPLI: ligh novel-view synthesis๐Ÿ”ฅ

๐Ÿ‘‰Lightweight novel-view synthesis by #Samsung for arbitrary forward-facing scenes

๐Ÿ˜ŽReview https://bit.ly/3CivSYZ
๐Ÿ˜ŽProject samsunglabs.github.io/MLI
๐Ÿ˜ŽCode github.com/SamsungLabs/MLI
๐Ÿ˜ŽPaper samsunglabs.github.io/MLI/paper/paper.pdf
๐Ÿ‘8
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿฅ EVA3D: new SOTA in #3D humans ๐Ÿฅ

๐Ÿ‘‰EVA3D: new SOTA for unconditional NeRF-human generation from 2D only

๐Ÿ˜ŽReview https://bit.ly/3Th9qX7
๐Ÿ˜ŽCode github.com/hongfz16/EVA3D
๐Ÿ˜ŽPaper arxiv.org/pdf/2210.04888.pdf
๐Ÿ˜ŽProject hongfz16.github.io/projects/EVA3D.html
๐Ÿ”ฅ14๐Ÿ‘2
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ f-DM: Diffusion Models by Apple ๐Ÿ

๐Ÿ‘‰Spectacular work by #Apple on DMs: HQ generation with better efficiency and semantic

๐Ÿ˜ŽReview https://bit.ly/3Tils2u
๐Ÿ˜ŽProject https://jiataogu.me/fdm/
๐Ÿ˜ŽPaper arxiv.org/pdf/2210.04955.pdf
โค10๐Ÿ˜ฑ2๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ…GENIE by #Nvidia -> Faster Generation๐Ÿ…

๐Ÿ‘‰Higher-Order Denoising Diffusion Solvers for faster and better synthesis

๐Ÿ˜ŽReview https://bit.ly/3CRjtwr
๐Ÿ˜ŽProject nv-tlabs.github.io/GENIE/
๐Ÿ˜ŽPaper arxiv.org/pdf/2210.05475.pdf
๐Ÿ˜ŽCode github.com/nv-tlabs/GENIE
๐Ÿ”ฅ10๐Ÿ‘4
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿฅฌ "Perception Test" by #DeepMind ๐Ÿฅฌ

๐Ÿ‘‰Huge dataset with obj & point tracks, temporal sounds, multiple & grounded vQA

๐Ÿ˜ŽReview https://bit.ly/3Vqh96Q
๐Ÿ˜ŽDataset github.com/deepmind/perception_test
๐Ÿ˜ŽProject www.deepmind.com/blog/measuring-perception-in-ai-models
๐Ÿ‘15๐Ÿ”ฅ4๐Ÿ˜ฑ3
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ Matterport 3D Semantics Dataset ๐Ÿ”ฅ

๐Ÿ‘‰#Meta opens HM3DSEM, the largest #3D real-world dataset with dense semantic

๐Ÿ˜ŽReview https://bit.ly/3yF4W4G
๐Ÿ˜ŽPaper arxiv.org/pdf/2210.05633.pdf
๐Ÿ˜ŽProject aihabitat.org/datasets/hm3d-semantics
๐Ÿ˜ŽData github.com/matterport/habitat-matterport-3dresearch
๐Ÿ‘13
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿฆ‘ Instant Map-free Relocalization ๐Ÿฆ‘

๐Ÿ‘‰#Niantic unveils a novel instant, metric scaled re-localization with one single photo

๐Ÿ˜ŽReview https://bit.ly/3S1Gdyh
๐Ÿ˜ŽPaper arxiv.org/pdf/2210.05494.pdf
๐Ÿ˜ŽProject research.nianticlabs.com/mapfree-reloc-benchmark
๐Ÿ˜ŽData research.nianticlabs.com/mapfree-reloc-benchmark/dataset
๐Ÿ”ฅ13๐Ÿ‘2
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿงฎ Novel DM for 3D Shapes by #Nvidia ๐Ÿงฎ

๐Ÿ‘‰Hierarchical Latent Point Diffusion Model (LION) for 3D shape generation

๐Ÿ˜ŽReview https://bit.ly/3yDhZ6I
๐Ÿ˜ŽPaper arxiv.org/pdf/2210.06978.pdf
๐Ÿ˜ŽProject https://nv-tlabs.github.io/LION/
๐Ÿ˜ŽCode(soon) github.com/nv-tlabs/LION
โค11๐Ÿ˜ฑ2๐Ÿ”ฅ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿชฒ#6D estimation fully in the wild๐Ÿชฒ

๐Ÿ‘‰First ever self-supervised 6D pose estimation training in the wild

๐Ÿ˜ŽReview https://bit.ly/3yHdHuS
๐Ÿ˜ŽPaper arxiv.org/pdf/2210.07199.pdf
๐Ÿ˜ŽProject kywind.github.io/self-pose
๐Ÿ˜ŽCode (soon)
๐Ÿ‘15๐Ÿคฏ8๐Ÿ˜ฑ4