AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ#Google just announced "TensorStore"๐Ÿ”ฅ

๐Ÿ‘‰Novel open-source C++ / #Python library for storage/manipulation of high-dim data

๐Ÿ˜ŽReview https://bit.ly/3DLwbha
๐Ÿ˜ŽProject https://bit.ly/3C4T2TR
๐Ÿ˜ŽCode github.com/google/tensorstore
๐Ÿ”ฅ14๐Ÿ‘2
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿฆ  Motion Transformer for #selfdriving ๐Ÿฆ 

๐Ÿ‘‰The 1st place solution for 2022 #waymo "motion prediction" challenge

๐Ÿ˜ŽReview https://bit.ly/3f8G4LD
๐Ÿ˜ŽPaper arxiv.org/pdf/2209.10033.pdf
๐Ÿ˜ŽCode github.com/sshaoshuai/MTR
๐Ÿ”ฅ17๐Ÿ‘3
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ’น Image Synthesis @160+ FPS! ๐Ÿ’น

๐Ÿ‘‰Super-fast, 3D-Aware Image Synthesis with Sparse Voxels -> up to 167 FPS!

๐Ÿ˜ŽReview https://bit.ly/3r3ZNij
๐Ÿ˜ŽPaper arxiv.org/pdf/2206.07695.pdf
๐Ÿ˜ŽProject katjaschwarz.github.io/voxgraf
๐Ÿ‘3๐Ÿคฏ2๐Ÿ”ฅ1๐Ÿ’ฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ‘› #Nvidia GET3D: #3D generative #AI ๐Ÿ‘›

๐Ÿ‘‰AI-based Textured 3D meshes with complex topology, rich geometry & hi-fi textures

๐Ÿ˜ŽReview https://bit.ly/3SgnT5h
๐Ÿ˜ŽCode github.com/nv-tlabs/GET3D
๐Ÿ˜ŽProject nv-tlabs.github.io/GET3D/
๐Ÿ˜ŽPaper nv-tlabs.github.io/GET3D/assets/paper.pdf
โคโ€๐Ÿ”ฅ7๐Ÿ‘5
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ๐Ÿ”ฅ IDE-3D: source code is out! ๐Ÿ”ฅ๐Ÿ”ฅ

๐Ÿ‘‰Novel, photorealistic, 3D-aware facial generator: source code just released!

๐Ÿ˜ŽReview https://bit.ly/3BNrO2C
๐Ÿ˜ŽProject mrtornado24.github.io/IDE-3D/
๐Ÿ˜ŽCode github.com/MrTornado24/IDE-3D
๐Ÿ˜ŽPaper arxiv.org/pdf/2205.15517.pdf
๐Ÿคฏ8๐Ÿ‘5๐Ÿ”ฅ3๐Ÿคฉ3
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅDiffusion Model of Neural Checkpoints๐Ÿ”ฅ

๐Ÿ‘‰Conditional diffusion model on Millions of checkpoints of a given task/architecture ๐Ÿคฏ

๐Ÿ˜ŽReview https://bit.ly/3SBR4Qb
๐Ÿ˜ŽProject www.wpeebles.com/Gpt
๐Ÿ˜ŽCode github.com/wpeebles/G.pt
๐Ÿ˜ŽPaper arxiv.org/pdf/2209.12892.pdf
๐Ÿคฏ5โค1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ Semantic VISOR dataset is out! ๐Ÿ”ฅ

๐Ÿ‘‰Segmenting hands / active objects in egocentric video (millions masks)

๐Ÿ˜ŽReview https://bit.ly/3LOBLBv
๐Ÿ˜ŽProject epic-kitchens.github.io/VISOR/
๐Ÿ˜ŽPaper arxiv.org/pdf/2209.13064.pdf
๐Ÿคฏ8๐Ÿ”ฅ4๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿฅ‡๐Ÿฅ‡ Olympic Games in 2028? ๐Ÿฅ‡๐Ÿฅ‡

๐Ÿ‘‰ In a few years, the fastest runner on earth will not be a human ๐Ÿฅถ

๐Ÿ˜ŽReview https://bit.ly/3Rme3O3
๐Ÿ˜ฑ8๐Ÿ‘3๐Ÿ‘Ž1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ SOTA ALERT: new Text-to-Video #AI ๐Ÿ”ฅ

๐Ÿ‘‰#META unveils a novel Text-to-Video (T2V) generation #AI

๐Ÿ˜ŽReview https://bit.ly/3E1ZDzG
๐Ÿ˜ŽProject https://makeavideo.studio/
๐Ÿ˜ŽPaper makeavideo.studio/Make-A-Video.pdf
๐Ÿคฏ9๐Ÿ‘6๐Ÿ˜ฑ1๐Ÿ’ฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅDreamFusion: Text-to-3D via Diffusion๐Ÿ”ฅ

๐Ÿ‘‰DeepDream-like procedure to create #3D assets just from a given text

๐Ÿ˜ŽReview https://bit.ly/3BYY5nu
๐Ÿ˜ŽPaper arxiv.org/pdf/2209.14988.pdf
๐Ÿ˜ŽProject dreamfusion3d.github.io/gallery.html
๐Ÿคฏ12๐Ÿ‘5๐Ÿ’ฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿงช Light Field Neural Rendering ๐Ÿงช

๐Ÿ‘‰Two-stage transformer capable of non-Lambertian effects (reflection, refraction, translucency)

๐Ÿ˜ŽReview https://bit.ly/3CpIFdm
๐Ÿ˜ŽPaper arxiv.org/pdf/2112.09687.pdf
๐Ÿ˜ŽProject light-field-neural-rendering.github.io
๐Ÿ˜ŽCode github.com/google-research/google-research/tree/master/light_field_neural_rendering
๐Ÿคฏ14๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸฆฉPhenaki: Text-to(LOOONG)Video generation๐Ÿฆฉ

๐Ÿ‘‰Phenaki is an #AI capable of realistic long video synthesis, given a sequence of textual open prompts

๐Ÿ˜ŽReview https://bit.ly/3RwUvXx
๐Ÿ˜ŽProject phenaki.video/index.h
๐Ÿ˜ŽPaper openreview.net/pdf?id=vOEXS39nOF
๐Ÿ”ฅ7โค3๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ VToonify: Neural Portrait Style Transfer ๐Ÿ”ฅ

๐Ÿ‘‰VToonify for portrait style transfer. Powered by DualStyleGAN backbone, now with #stablediffusion!

๐Ÿ˜ŽReview https://bit.ly/3M9wgNP
๐Ÿ˜ŽDemo https://t.co/8gXzF3IrpB
๐Ÿ˜ŽPaper arxiv.org/pdf/2209.11224.pdf
๐Ÿ˜ŽProject mmlab-ntu.com/project/vtoonify
๐Ÿ˜ŽCode github.com/williamyang1991/VToonify
๐Ÿ‘22โค3๐Ÿคฏ2๐Ÿ”ฅ1๐Ÿ‘1๐Ÿ’ฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿข Stable Diffusion for #Pokemon ๐Ÿข

๐Ÿ‘‰Fine-tuning the stable diffusion to create a text-to-pokemon generation model

๐Ÿ˜ŽReview https://bit.ly/3C9qBTw
๐Ÿ˜ŽTutorial https://lambdalabs.com/blog/how-to-fine-tune-stable-diffusion-how-we-made-the-text-to-pokemon-model-at-lambda/
โค8๐Ÿ‘4
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ Imagen Video by #Google. SICK! ๐Ÿ”ฅ

๐Ÿ‘‰Novel text-conditional video generation via cascade of video diffusion models ๐Ÿคฏ

๐Ÿ˜ŽReview https://bit.ly/3SH2TVH
๐Ÿ˜ŽProject imagen.research.google/video/
๐Ÿ˜ŽPaper imagen.research.google/video/paper.pdf
๐Ÿคฏ20๐Ÿ”ฅ7๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ Human MDM: source code is out! ๐Ÿ”ฅ

๐Ÿ‘‰A classifier-free diffusion-based generative model for human motion domain

๐Ÿ˜ŽReview https://bit.ly/3rFhR2G
๐Ÿ˜ŽProject guytevet.github.io/mdm-page
๐Ÿ˜ŽPaper arxiv.org/pdf/2209.14916.pdf
๐Ÿ˜ŽCode github.com/GuyTevet/motion-diffusion-model
๐Ÿ”ฅ6๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
โš›๏ธSOTA ALERT! Particles Tracking โš›๏ธ

๐Ÿ‘‰The new SOTA in video particles tracking. "Old school" taste, with neural flavor ๐Ÿงก

๐Ÿ˜ŽReview https://bit.ly/3CaU5Ai
๐Ÿ˜ŽProject particle-video-revisited.github.io/
๐Ÿ˜ŽPaper arxiv.org/pdf/2204.04153.pdf
๐Ÿ˜ŽCode github.com/aharley/pips
๐Ÿ‘7๐Ÿฅฐ4๐Ÿ”ฅ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ #AIwithPapers: we are 4,500+! ๐Ÿ”ฅ

๐Ÿ’™๐Ÿ’› Someone put the smiling ๐Ÿ’ฉ under a few recent posts. But I still love you! ๐Ÿ’™๐Ÿ’›

๐Ÿ˜ˆ Invite your friends -> https://t.me/AI_DeepLearning
โค18๐Ÿ’ฉ7๐Ÿ”ฅ5๐Ÿ‘3๐Ÿฅฐ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ‹ Long Video via Transformers ๐Ÿ‹

๐Ÿ‘‰TECO is a vector-quantized latent dynamics prediction for long video

๐Ÿ˜ŽReview https://bit.ly/3Ch0tWD
๐Ÿ˜ŽProject wilson1yan.github.io/teco/
๐Ÿ˜ŽPaper arxiv.org/pdf/2210.02396.pdf
๐Ÿ˜ŽCode github.com/wilson1yan/teco
๐Ÿ‘7
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅSIMPLI: ligh novel-view synthesis๐Ÿ”ฅ

๐Ÿ‘‰Lightweight novel-view synthesis by #Samsung for arbitrary forward-facing scenes

๐Ÿ˜ŽReview https://bit.ly/3CivSYZ
๐Ÿ˜ŽProject samsunglabs.github.io/MLI
๐Ÿ˜ŽCode github.com/SamsungLabs/MLI
๐Ÿ˜ŽPaper samsunglabs.github.io/MLI/paper/paper.pdf
๐Ÿ‘8