Spark in me – Telegram

Spark in me

2.2K subscribers

831 photos

48 videos

116 files

2.68K links

Lost like tears in rain. DS, ML, a bit of philosophy and math. No bs or ads.

Download Telegram

About

Blog

Apps

Platform

2.2K subscribers

Digest 2022-02

# Code

Creating Beautiful Tracebacks with Python's Exception Hooks - https://martinheinz.dev/blog/66
Finding leaked secrets in your Docker image with a scanner - https://pythonspeed.com/articles/docker-secret-scanner/
Include diagrams in your Markdown files with Mermaid - https://github.blog/2022-02-14-include-diagrams-markdown-files-mermaid/
Why I started using Python type annotations – and why you should too - https://florimond.dev/en/posts/2018/07/why-i-started-using-python-type-annotations-and-why-you-should-too/

#digest

martinheinz.dev

Creating Beautiful Tracebacks with Python's Exception Hooks

<p>
We all spend a good chuck of our time debugging, sifting through logs or reading tracebacks. Each of these can be difficult and time-consuming and in t...

1.3K viewsAlexander, 16:46

Digest 2022-03

# ML

The Gradient Update features our article - https://thegradientpub.substack.com/p/gradient-update-19-woes-of-the-irs?s=r
Constrained Reweighting for Training Deep Neural Nets with Noisy Labels - https://ai.googleblog.com/2022/02/constrained-reweighting-for-training.html
4D-Net: Learning Multi-Modal Alignment for 3D and Image Inputs in Time - https://ai.googleblog.com/2022/02/4d-net-learning-multi-modal-alignment.html
ProteInfer: deep networks for protein functional inference - https://google-research.github.io/proteinfer/
Co-training Transformer with Videos and Images Improves Action Recognition - https://ai.googleblog.com/2022/03/co-training-transformer-with-videos-and.html
Machine Learning's Most Useful Multitool: Embeddings - https://daleonai.com/embeddings-explained
Microsoft Translator enhanced with Z-code Mixture of Experts models - https://www.microsoft.com/en-us/research/blog/microsoft-translator-enhanced-with-z-code-mixture-of-experts-models/
Accelerating Ukraine Intelligence Analysis with Computer Vision on Synthetic Aperture Radar Imagery - https://bair.berkeley.edu/blog/2022/03/21/ukraine-sar-maers/
Auto-generated Summaries in Google Docs - https://ai.googleblog.com/2022/03/auto-generated-summaries-in-google-docs.html
NVIDIA Research Turns 2D Photos Into 3D Scenes in the Blink of an AI - https://blogs.nvidia.com/blog/2022/03/25/instant-nerf-research-3d-ai/?ncid=so-twit-802781-vt37#cid=gtcs22_so-twit_en-us
Instant Neural Graphics Primitives - https://nvlabs.github.io/instant-ngp/

#digest

Gradient Update #19: Woes of the IRS, Deep RL in Nuclear Fusion

In which we cover non-facial recognition options for the IRS to verify identity and DeepMind's recent application of deep reinforcement learning to nuclear fusion.

1.1K viewsAlexander, 07:10

Digest 2022-03

# Hardware

Infocast - https://www.youtube.com/watch?v=FlewsuI2IRY
VICARIOUS MWC – THE POWER OF A SINGLE HYPHEN - https://digitstodollars.com/2022/03/02/vicarious-mwc-the-power-of-a-single-hyphen/
Using 176-Layer NAND for High-Capacity Data Center SSDs - https://thessdguy.com/using-176-layer-nand-for-high-capacity-data-center-ssds/
THE SHRINKING OF ARM - https://digitstodollars.com/2022/03/16/the-shrinking-of-arm/
WAVE OF COMPUTATION - https://digitstodollars.com/2022/03/24/wave-of-computation/
https://s22.q4cdn.com/364334381/files/doc_presentations/2022/NVIDIA-Investor-Day-2022-Presentation.pdf

#digest

InfoCAST #053 | Доступность железа и другие новости за март

Основные новости мира IT за март 2022 года.
0:00 Вступление
0:26 Проблемы с поставками и санкции
0:43 Что в действительности не поставляется, а что продолжает поступать в РФ?
1:20 Увеличение беспошлинных лимитов на ввоз до 1000 Евро
1:42 Про покупки за пределами…

987 viewsAlexander, 07:12

Digest 2022-03

# Blogs

It's now your fault they don't know about it - https://rachelbythebay.com/w/2022/03/02/wrong/
Illegible Medicis And Hunting For Outliers - https://www.strangeloopcanon.com/p/illegible-medicis-and-hunting-for?s=r
A sysadmin's rant about feed readers and crawlers - https://rachelbythebay.com/w/2022/03/07/get/
Questions about construction trucks on train tracks - https://rachelbythebay.com/w/2022/03/10/caltrain/
Push and pull: when and why to update your dependencies - https://pythonspeed.com/articles/when-update-dependencies/
A Study Of Fame - https://www.strangeloopcanon.com/p/a-study-of-fame?s=r
The Sunny Side of Firing Someone - https://madned.substack.com/p/the-sunny-side-of-firing-someone?s=r
Russia in Ukraine: Let Loose the Dogs of War! - https://aswathdamodaran.blogspot.com/2022/03/russia-and-ukraine-let-loose-dogs-of-war.html

#digest

Strange Loop Canon

Illegible Medicis And Hunting For Outliers

On black swan farming, VCs vs grant-making, the spectrum of illegibility and part III on our new institutional order

1.1K viewsAlexander, 07:13

Digest 2022-03

# Code

Optimizing Memory Usage in Python Applications - https://martinheinz.dev/blog/68
Processing large JSON files in Python without running out of memory - https://pythonspeed.com/articles/json-memory-streaming/
Image rebase and improved remote cache support in new BuildKit - https://www.docker.com/blog/image-rebase-and-improved-remote-cache-support-in-new-buildkit/
Cron best practices - https://blog.sanctum.geek.nz/cron-best-practices/
Please stop writing shell scripts - https://pythonspeed.com/articles/shell-scripts/
The best Docker base image for your Python application (August 2021) - https://pythonspeed.com/articles/base-image-python-docker-images/

#digest

Processing large JSON files in Python without running out of memory

Loading complete JSON files into Python can use too much memory, leading to slowness or crashes. The solution: process JSON data one chunk at a time.

1.5K viewsAlexander, 07:13

Digest 2022-04

📌 ML

Pathways Language Model (PaLM): Scaling to 540 Billion Parameters for Breakthrough Performance - https://ai.googleblog.com/2022/04/pathways-language-model-palm-scaling-to.html

Detecting Signs of Disease from External Images of the Eye - https://ai.googleblog.com/2022/03/detecting-signs-of-disease-from.html

Reproducibility in Deep Learning and Smooth Activations - https://ai.googleblog.com/2022/04/reproducibility-in-deep-learning-and.html

VDTTS: Visually-Driven Text-To-Speech - https://ai.googleblog.com/2022/04/vdtts-visually-driven-text-to-speech.html

Discovering the systematic errors made by machine learning models - https://ai.stanford.edu/blog/domino/

Understanding BLEU Scores in Customized Machine Translation - https://blog.taus.net/understanding-bleu-scores-in-customized-machine-translation

Locked-Image Tuning: Adding Language Understanding to Image Models - https://ai.googleblog.com/2022/04/locked-image-tuning-adding-language.html

FormNet: Beyond Sequential Modeling for Form-Based Document Understanding - https://ai.googleblog.com/2022/04/formnet-beyond-sequential-modeling-for.html

Compact word vectors with Bloom embeddings - https://explosion.ai/blog/bloom-embeddings

Nobody wants your fancy algorithm - https://joemorrison.substack.com/p/nobody-wants-your-fancy-algorithm?s=r

Why Dark and Light is Complicated in Photographs - https://aaronhertzmann.com/2022/03/10/photographic-tone.html

#digest

research.google

Pathways Language Model (PaLM): Scaling to 540 Billion Parameters for Breakthrou

Posted by Sharan Narang and Aakanksha Chowdhery, Software Engineers, Google Research In recent years, large neural networks trained for language un...

1.1K viewsAlexander, 05:02

Digest 2022-04

📌 Blogs

Horrible edge cases to consider when dealing with music - https://dustri.org/b/horrible-edge-cases-to-consider-when-dealing-with-music.html

Old bittorrent alternatives - https://habr.com/ru/post/318400/

Как врать с помощью статистики - https://habr.com/ru/post/660269/

Как мы кикшеринг взломали - https://habr.com/ru/post/660575/

TV, merchant media and the unbundling of advertising - https://www.ben-evans.com/benedictevans/2022/3/18/unbundling-advertising

Goodbye, Google Analytics - Why and How You Should Leave The
Platform - https://martinheinz.dev/blog/71

How we lost 54k GitHub stars - https://httpie.io/blog/stardust

Ускорение производительности Python в 3.11 - https://habr.com/ru/post/662087/

The Problem With Experts - https://www.strangeloopcanon.com/p/the-problem-with-experts

Netflix is not a tech company - https://www.ben-evans.com/benedictevans/2019/7/31/Netflix

Content isn't king - https://www.ben-evans.com/benedictevans/2017/7/13/content-isnt-king

#digest

Horrible edge cases to consider when dealing with music

Personal blog of Julien (jvoisin) Voisin

1.0K viewsAlexander, 05:03

Digest 2022-04

📌 Hardware

Replacing Tape with Flash - https://thessdguy.com/replacing-tape-with-flash/

HARD OR SOFT? - https://digitstodollars.com/2022/04/01/hard-or-soft/

HARD VS. SOFT – WITH MATH - https://digitstodollars.com/2022/04/07/hard-vs-soft-with-math/

Успехи импортозамещения Поднебесной: в КНР с нуля разработали игровые видеокарты и не только - https://habr.com/ru/company/selectel/blog/653807/

Объединение компьютеров через VPN и личное облако на VPS сервере - https://pc-01.tech/vpn-oblako/

Is Google Spying on your Conversations? - https://petewarden.com/2022/04/11/is-google-spying-on-your-conversations/

Иностранные хостеры с возможностью оплаты из России - https://habr.com/ru/post/657639/

WHAT IS GOING ON IN THE SEMIS SUPPLY CHAIN? - https://digitstodollars.com/2022/04/14/what-is-going-on-in-the-semis-supply-chain/

WHO SHOULD ROLL THEIR OWN CHIP? - https://digitstodollars.com/2022/04/15/who-should-roll-their-own-chip/

Перенос нейронной сети из PyTorch на Google Coral - https://habr.com/ru/company/kryptonite/blog/660505/

MAKING ALL THE CHIPS - https://digitstodollars.com/2022/04/19/making-all-the-chips/

BENCHMARKING ARM IN THE DATA CENTER - https://digitstodollars.com/2022/04/21/benchmarking-arm-in-the-data-center/

Почему GPU обманывают о своей нагрузке и как с этим бороться - https://habr.com/ru/company/yandex/blog/661989/

KEEPING UP WITH GOOGLE SEMICONDUCTOR - https://digitstodollars.com/2022/04/26/keeping-up-with-google-semiconductor/

#digest

Digits to Dollars

The risk profile for venture investing in hardware and software are of course very different, but the market is shifting, making hardware investing much more appealing.

1.3K viewsAlexander, edited 05:04

Digest 2022-04

📌 Code

Строковые алгоритмы на практике. Часть 1 — Алгоритм Кнута — Морриса — Пратта - https://habr.com/ru/post/658779/

Speeding up software with faster hardware: tradeoffs and alternatives - https://pythonspeed.com/articles/fixing-performance-with-hardware/

Python f-strings Are More Powerful Than You Might Think - https://martinheinz.dev/blog/70

Яндекс выложил в опенсорс YDB - https://habr.com/ru/company/yandex/blog/660271/

When Python can’t thread: a deep-dive into the GIL’s impact - https://pythonspeed.com/articles/python-gil/

Постраничный итератор в Python - https://antonz.ru/python-plus-one/

#digest

Строковые алгоритмы на практике. Часть 1 — Алгоритм Кнута — Морриса — Пратта

Начал я на днях читать книгу про обработку строк и буквально с первых страниц, прихлебывая чаечек я начал поражаться тому, что за пять лет работы программистом я смотрел на строки только как на...

1.6K viewsAlexander, 05:05

Digest 2022-05

📌 ML

Deep Learning in Neuroimaging - https://thegradient.pub/the-role-of-deep-learning-in-understanding-neuroimaging-data/

Alpa: Automated Model-Parallel Deep Learning - https://ai.googleblog.com/2022/05/alpa-automated-model-parallel-deep.html

Rethinking Human-in-the-Loop for Artificial Augmented Intelligence - https://bair.berkeley.edu/blog/2022/05/03/human-in-the-loop/

How Should you Protect your Machine Learning Models and IP? - https://petewarden.com/2022/05/08/how-should-you-protect-your-machine-learning-models-and-ip/

Hiding a photo inside another photo - https://www.avestura.dev/blog/hide-a-photo-inside-another-photo

Unlocking Zero-Resource Machine Translation to Support New

Languages in Google Translate - https://ai.googleblog.com/2022/05/24-new-languages-google-translate.html

Baidu and Pony.ai become first robotaxi services to operate without safety drivers in Beijing - https://www.theverge.com/2022/4/30/23050493/baidu-pony-ai-first-robotaxi-services-operate-without-safety-drivers-beijing-china

Tackling multiple tasks with a single visual language model - https://www.deepmind.com/blog/tackling-multiple-tasks-with-a-single-visual-language-model

Lessons From Deploying Deep Learning To Production (it's all about feedback loops) - https://thegradient.pub/lessons-from-deploying-deep-learning-to-production/

OPT: Open Pre-trained Transformer Language Models - http://arxiv.org/abs/2205.01068
- Talk about gatekeeping: access will be granted to academic researchers; those affiliated with organizations in government, civil society, and academia; and those in industry re- search laboratories
- OPT-175B on 992 80GB A100 GPUs (1/7th the carbon footprint of GPT-3)

WHO WILL END UP HOLDING THE SEMIS BAG? - https://digitstodollars.com/2022/05/18/who-will-end-up-holding-the-semis-bag/

Image-Text Pre-training with Contrastive Captioners - https://ai.googleblog.com/2022/05/image-text-pre-training-with.html

The Future of Interactive Media — Pipelining StyleGAN3 for Production - https://medium.com/codex/the-future-of-interactive-media-pipelining-stylegan3-for-production-636c080db2f4

(De)ToxiGen: Leveraging large language models to build more robust hate speech detection tools - https://www.microsoft.com/en-us/research/blog/detoxigen-leveraging-large-language-models-to-build-more-robust-hate-speech-detection-tools/

Partnering people with large language models to find and fix bugs in NLP systems - https://www.microsoft.com/en-us/research/blog/partnering-people-with-large-language-models-to-find-and-fix-bugs-in-nlp-systems/

StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion - https://starganv2-vc.github.io/

#digest

Deep Learning in Neuroimaging

An introduction to unique aspects of neuroimaging data and how we can leverage these aspects with deep learning algorithms.

833 viewsAlexander, edited 10:21