IamPython
291 subscribers
148 photos
13 videos
8 files
195 links
This is Python based telegram group for web developers, Artificial intelligence, webscraping, Datascience, Data analysis, Ethical Hacking and more. You will learn lot insights and useful information
Download Telegram
Meet LAVIS - a one-stop library for language-and-vision research and applications!

πŸ”₯Github: https://github.com/salesforce/LAVIS

πŸ“œTech Report: arxiv.org/abs/2209.09019

LAVIS features
- Unified and modular interface to access 10+ tasks, 20+ datasets, 30+ pre-trained models!
πŸ¦‹πŸ¦‹All tools Data Engineers need! Categorized into cloud native (only available on that platform) and cloud agnostic (use anywhere) platforms & tools on the top. On the left you find categories and subcategories for the tools.

πŸ€πŸ€The goal for every engineer is to at least have knowledge of one tool in every category (row).

🐚🐚As example:

- If you are on Azure then learn when and how to use for at least one of the tools in every row of Azure
- Or go fully cloud agnostic and open source. It's your choice.
- You can also combine cloud agnostic with cloud platforms together by replacing the cloud native tools of one row with a cloud agnostic one.

πŸ€·β€β™‚οΈ that’s it man πŸ‘¨!!
πŸ”…πŸ”…πŸŒšMachine Learning types and algorithms which you must know based on their classification in supervised, unsupervised and reinforcement learning.
πŸ¦‹πŸ¦‹βœοΈβœοΈ

OpenAI trained and are open-sourcing a neural net called Whisper that approaches human level robustness and accuracy on English speech recognition.

Whisper is an automatic speech recognition (ASR) system trained on 680,000 hours of multilingual and multitask supervised data collected from the web.

https://openai.com/blog/whisper/

Check out above link for paper, code and more details

πŸ‘½πŸ‘½
✍️✍️What is π‚πšπ₯π’π›π«πšπ­π’π¨π§ 𝐒𝐧 𝐌𝐚𝐜𝐑𝐒𝐧𝐞 π‹πžπšπ«π§π’π§π ?

πŸ€Calibration is the property that tells us how well the estimated probabilities of a model match the actual probabilities, a.k.a the observed frequency of occurrences.

πŸ€Calibration can be represented using the Brier score. The Brier score is nothing more than the MSE between the actual and the estimated probabilities.

πŸ€The two most common methods to address poor calibration is:
πŸ”‘platt scaling and
πŸ”‘isotonic regression
=======================================
✍️✍️Visual explanations of core machine learning concepts.
=======================================
πŸ€Nothing can beat the use of infographics and interactivity when explaining some concept,

πŸ€For Linear Regression
https://mlu-explain.github.io/linear-regression/

πŸ€For all,
https://mlu-explain.github.io/
How to select which types of statistical test on a given data ?

Source : MLCommunity Ln
🧬 The data structure for unstructured multimodal data · Neural Search · Vector Search · Document Store


For doc
https://docarray.jina.ai/

For GitHub
https://github.com/jina-ai/docarray
Transformers in Time Series: A Survey

A curated list of awesome resources (papers, code, data) on Transformers in Time Series categorized by tasks, including:

β€’ Forecasting
β€’ Anomaly detection
β€’ Classification

Transformers capture long-range dependencies and interactions.


abs: https://arxiv.org/abs/2202.07125
pdf: https://arxiv.org/pdf/2202.07125.pdf

Awesome list repo: https://github.com/qingsongedu/time-series-transformers-review
googlefinance

Python module to get stock data from Google Finance API. This module provides no delay, real time stock data in NYSE & NASDAQ.

$pip install googlefinance

https://github.com/hongtaocai/googlefinance
πŸ“£πŸ“£Django Rest Framework latest version was out. No more compatible with Django 2.2
Pandas 1.5 Released bit.ly/3DQdRn7
Monday, 26 September 2022
Latest AI Curated Track

✍️✍️A third of scientists working on AI say it could cause global disaster. A survey covering the opinions of 327 researchers who had recently co-authored papers on AI research in natural language processing.
✍️✍️Tesla’s AI Day 2022 is scheduled for September 30 in Palo Alto, California.
✍️✍️AI will help phone photos surpass the DSLR, says Qualcomm
✍️✍️Over the past few weeks, researchers at Google have demoed an AI system, PaLI, that can perform many tasks in over 100 languages
✍️✍️A Berlin-based group launched a project called Source+ that's designed as a way of allowing artists, including visual artists, musicians and writers, to opt into and out of allowing their work being used as training data for AI.
✍️✍️According to International Data Corp, China will invest US$26.7 billion in artificial intelligence in 2026. With regards A total of 45,000 AI-related patent applications were filed in Shanghai
✍️✍️Saudi Arabia focuses on AI-driven economy, considers data the new oil: SDAIA
✍️✍️An artist based in New York City has been granted the first known registered copyright for artwork made using latent diffusion AI.
✍️✍️Salesforce AI Open-Sources β€˜LAVIS,’ A Deep Learning Library For Language-Vision Research/Applications.
✍️✍️Harvard celebrated the launch of the Kempner Institute for the Study of Natural and Artificial Intelligence on Thursday
✍️✍️Cleareye AI Announces Strategic Alliance with J.P. Morgan
✍️✍️AI proves to be more accurate in diagnosing cardiac function than sonographers
✍️✍️Amazon SageMaker Provides New Built-in TensorFlow Image Classification Algorithms available in tensor flow hub.
✍️✍️ Google today released TensorFlow Graph Neural Networks (TF-GNN) in alpha, a library designed to make it easier to work with graph structured data using TensorFlow, its machine learning framework.

Inscribed by,
Raja
AWS Question: A customer has a workload that will run for total of 6 months and can withstand interruptions. What would be the most cost-efficient Amazon EC2 instance purchasing option?
Anonymous Quiz
56%
On-Demand Instance
22%
Spot instance
0%
Dedicated Instance
22%
Reserved Instance
πŸ“£ Meta AI:

πŸ«₯ Make-a-video: The text-to-video generation without text-video data

πŸ«₯ Paper: https://makeavideo.studio/Make-A-Video.pdf

πŸ«₯ Project page: makeavideo.studio

An effective method that extends a diffusion-based T2I model to T2V through a spatiotemporally factorized diffusion model.
πŸ“£πŸ“£An AI used medical notes to teach itself to spot disease on chest x-rays


πŸ«₯πŸ«₯A team of researchers from Harvard Medical School trained the CheXzero model on a publicly available data set of more than 377,000 chest x-rays and more than 227,000 corresponding clinical reports. This taught it to associate certain types of images with their existing notes, rather than learning from structured data that had been manually labeled for the task. 

Paper:
https://www.nature.com/articles/s41551-022-00936-9

News link:

https://www-technologyreview-com.cdn.ampproject.org/c/s/www.technologyreview.com/2022/09/15/1059541/ai-medical-notes-teach-itself-spot-disease-chest-x-rays/amp/
Channel name was changed to Β«IamPythonΒ»
πŸ“£CLIP: The Most Influential AI Model From OpenAI β€” And How To Use It

πŸ€ CLIP
stands for Constastive Language-Image Pretraining

πŸ€ CLIP is an open source, multi-modal, zero-shot model.

πŸ€ Given an image and text descriptions, the model can predict the most relevant text description for that image, without optimizing for a particular task.


πŸ€CLIP is trained using a staggering amount of 400 million image-text pairs. For comparison, the ImageNet dataset contains 1.2 million images.

πŸ€The final tuned CLIP model was trained on 256 V100 GPUs for two weeks. For an on-demand training on AWS Sagemaker, this would cost at least 200k dollars!

πŸ€ The model uses a minibatch of 32,768 images for training.