TensorRT-Model-Optimizer
A unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed.
https://github.com/NVIDIA/TensorRT-Model-Optimizer
A unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed.
https://github.com/NVIDIA/TensorRT-Model-Optimizer
GitHub
GitHub - NVIDIA/TensorRT-Model-Optimizer: A unified library of state-of-the-art model optimization techniques like quantization…
A unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment...
datalab-to / marker
Convert PDF to markdown + JSON quickly with high accuracy
https://github.com/datalab-to/marker
Convert PDF to markdown + JSON quickly with high accuracy
https://github.com/datalab-to/marker
GitHub
GitHub - datalab-to/marker: Convert PDF to markdown + JSON quickly with high accuracy
Convert PDF to markdown + JSON quickly with high accuracy - datalab-to/marker
Jaxformer Scaling Modern Transformers
This is a zero-to-one guide on scaling modern transformers with n-dimensional parallelism. Transformers have driven much of the deep learning revolution, yet no practical guide reflects SOTA architectures and the complexities of large-scale language modelling. While excellent resources such as DeepMind’s How to Scale Your Model and HuggingFace’s Ultra Scale Playbook exist, a gap remains ...
https://jaxformer.com/
This is a zero-to-one guide on scaling modern transformers with n-dimensional parallelism. Transformers have driven much of the deep learning revolution, yet no practical guide reflects SOTA architectures and the complexities of large-scale language modelling. While excellent resources such as DeepMind’s How to Scale Your Model and HuggingFace’s Ultra Scale Playbook exist, a gap remains ...
https://jaxformer.com/
Jaxformer
JAXformer: Scaling Modern Transformers
A zero-to-one guide on scaling modern transformers with n-dimensional parallelism.
Post-training 101
A hitchhiker's guide into LLM post-training.
https://tokens-for-thoughts.notion.site/post-training-101
A hitchhiker's guide into LLM post-training.
https://tokens-for-thoughts.notion.site/post-training-101
tokens-for-thoughts on Notion
Post-training 101 | Tokens for Thoughts
A hitchhiker's guide into LLM post-training, by Han Fang and Karthik A Sankararaman
Nvmath-Python: Nvidia Math Libraries for the Python Ecosystem
https://github.com/NVIDIA/nvmath-python
https://github.com/NVIDIA/nvmath-python
GitHub
GitHub - NVIDIA/nvmath-python: NVIDIA Math Libraries for the Python Ecosystem
NVIDIA Math Libraries for the Python Ecosystem. Contribute to NVIDIA/nvmath-python development by creating an account on GitHub.
Python 3.13 is 10% slower than 3.12 for my file parser
https://www.reddit.com/r/Python/comments/1nmuy7t/python_313_is_10_slower_than_312_for_my_file/
https://www.reddit.com/r/Python/comments/1nmuy7t/python_313_is_10_slower_than_312_for_my_file/
Reddit
From the Python community on Reddit: Python 3.13 is 10% slower than 3.12 for my file parser
Explore this post and more from the Python community
List of 87 Programming Ideas for Beginners (with Python implementations)
https://www.reddit.com/r/Python/comments/1nitzoz/list_of_87_programming_ideas_for_beginners_with/
https://www.reddit.com/r/Python/comments/1nitzoz/list_of_87_programming_ideas_for_beginners_with/
Reddit
From the Python community on Reddit
Explore this post and more from the Python community
Mini-o3
Scaling Up Reasoning Patterns and Interaction Turns for Visual Search.
https://mini-o3.github.io/
Scaling Up Reasoning Patterns and Interaction Turns for Visual Search.
https://mini-o3.github.io/
Sphinx Docs Instantly in Your Browser (MyST Markdown + reStructuredText)
Edit and preview reStructuredText or MyST Markdown instantly in a Sphinx running in a browser. Runs entirely in Python using WebAssembly, so it’s private, fast, and ideal for learning markup.
https://snippets.documatt.com
Edit and preview reStructuredText or MyST Markdown instantly in a Sphinx running in a browser. Runs entirely in Python using WebAssembly, so it’s private, fast, and ideal for learning markup.
https://snippets.documatt.com
Documatt
Sphinx reStucturedText and Markdown online preview and editor
Preview and edit reStructuredText or Markdown (MyST) documents online with Sphinx and Docutils without installing it.
Just for fun: animating a mosaic of 90s GIFs
The post describes an experiment in animating a mosaic of vintage 90s GIFs collected from the GeoCities archive, using HTML Canvas for random, lively playback. It celebrates the playful aesthetics of early web graphics and highlights the technical and nostalgic joy of reintroducing these classic GIFs into a modern browser setting.
https://alexplescan.com/posts/2025/09/15/gifs/
The post describes an experiment in animating a mosaic of vintage 90s GIFs collected from the GeoCities archive, using HTML Canvas for random, lively playback. It celebrates the playful aesthetics of early web graphics and highlights the technical and nostalgic joy of reintroducing these classic GIFs into a modern browser setting.
https://alexplescan.com/posts/2025/09/15/gifs/
Alex Plescan
Just for fun: animating a mosaic of 90s GIFs
How I built a scrolling GIF mosaic for Battle of the Tech Bands: p5.js/WebGL, CRT shader, perceptual hashing, and NSFW filtering on GeoCities classics
JiraTUI
A Textual User Interface for interacting with Atlassian Jira from your shell.
https://github.com/whyisdifficult/jiratui
A Textual User Interface for interacting with Atlassian Jira from your shell.
https://github.com/whyisdifficult/jiratui
GitHub
GitHub - whyisdifficult/jiratui: A Textual User Interface for interacting with Atlassian Jira from your shell
A Textual User Interface for interacting with Atlassian Jira from your shell - whyisdifficult/jiratui
Tiny LLM - LLM Serving in a Week
A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.
https://skyzh.github.io/tiny-llm/
A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.
https://skyzh.github.io/tiny-llm/
ApeRAG
Production-ready GraphRAG with multi-modal indexing, AI agents, MCP support, and scalable K8s deployment
https://github.com/apecloud/ApeRAG
Production-ready GraphRAG with multi-modal indexing, AI agents, MCP support, and scalable K8s deployment
https://github.com/apecloud/ApeRAG
GitHub
GitHub - apecloud/ApeRAG: ApeRAG: Production-ready GraphRAG with multi-modal indexing, AI agents, MCP support, and scalable K8s…
ApeRAG: Production-ready GraphRAG with multi-modal indexing, AI agents, MCP support, and scalable K8s deployment - apecloud/ApeRAG
Nallely – A Python signals/MIDI processing system inspired by Smalltalk
https://dr-schlange.github.io/nallely-midi/
https://dr-schlange.github.io/nallely-midi/
Nallely MIDI
Nallely MIDI · Nallely MIDI
Nallely is an experimental organic system for advanced MIDI patching, live coding, generative music, and multimodal art, built for hacker/musicians, developed in Python, inspired by Smalltalk and Systems as Living Things
Context Engineering - Short-Term Memory Management with Sessions from OpenAI Agents SDK
The guide demonstrates how to use the OpenAI Agents SDK’s Session object to manage short-term memory in AI agents, enabling context trimming and compression for efficient, coherent, and cost-effective multi-turn conversations. Effective session memory ensures agents maintain relevant history across turns while reducing noise, latency, and error risk in longer interactions.
https://cookbook.openai.com/examples/agents_sdk/session_memory
The guide demonstrates how to use the OpenAI Agents SDK’s Session object to manage short-term memory in AI agents, enabling context trimming and compression for efficient, coherent, and cost-effective multi-turn conversations. Effective session memory ensures agents maintain relevant history across turns while reducing noise, latency, and error risk in longer interactions.
https://cookbook.openai.com/examples/agents_sdk/session_memory
Openai
Context Engineering - Short-Term Memory Management with Sessions from OpenAI Agents SDK | OpenAI Cookbook
AI agents often operate in long-running, multi-turn interactions, where keeping the right balance of context is critical. If too much is...
Semlib
Build data processing and data analysis pipelines that leverage the power of LLMs.
https://github.com/anishathalye/semlib
Build data processing and data analysis pipelines that leverage the power of LLMs.
https://github.com/anishathalye/semlib
GitHub
GitHub - anishathalye/semlib: Build data processing and data analysis pipelines that leverage the power of LLMs 🧠
Build data processing and data analysis pipelines that leverage the power of LLMs 🧠 - anishathalye/semlib
Python Tutorial: Build an AI-assisted Reddit Scraping Pipeline
The video provides an in-depth, hands-on tutorial for building a resilient, AI-assisted Reddit scraping pipeline in Python, covering everything from Jupyter prototyping and LangChain agents to a Django-based background worker architecture. It teaches viewers to automate web scraping, integrate Google’s Gemini LLM for query refinement, and store structured results in PostgreSQL, suitable ...
https://www.youtube.com/watch?v=XI-iP-qk_Vk
The video provides an in-depth, hands-on tutorial for building a resilient, AI-assisted Reddit scraping pipeline in Python, covering everything from Jupyter prototyping and LangChain agents to a Django-based background worker architecture. It teaches viewers to automate web scraping, integrate Google’s Gemini LLM for query refinement, and store structured results in PostgreSQL, suitable ...
https://www.youtube.com/watch?v=XI-iP-qk_Vk
YouTube
Python Tutorial: Build an AI-assisted Reddit Scraping Pipeline
🚀 Sign up for Bright Data right now: https://brdta.com/cfe
Automatically find and track topics you care about across Reddit posts. From camping to the latest in AI news, this course will show you how to build a powerful and resilient system in Python.
…
Automatically find and track topics you care about across Reddit posts. From camping to the latest in AI news, this course will show you how to build a powerful and resilient system in Python.
…