Model Quantization: Post-Training Quantization Using NVIDIA Model Optimizer
https://developer.nvidia.com/blog/model-quantization-post-training-quantization-using-nvidia-model-optimizer/
https://developer.nvidia.com/blog/model-quantization-post-training-quantization-using-nvidia-model-optimizer/
NVIDIA Technical Blog
Model Quantization: Post-Training Quantization Using NVIDIA Model Optimizer
Model quantization is an effective method to reduce VRAM usage and improve inference performance on consumer devices such as NVIDIA GeForce RTX GPUs. By lowering computational and memory requirements…
👍1
Achieving Peak System and Workload Efficiency on NVIDIA GB200 NVL72 with Slurm Block Scheduling
https://developer.nvidia.com/blog/achieving-peak-system-and-workload-efficiency-on-nvidia-gb200-nvl72-with-slurm-block-scheduling/
https://developer.nvidia.com/blog/achieving-peak-system-and-workload-efficiency-on-nvidia-gb200-nvl72-with-slurm-block-scheduling/
NVIDIA Technical Blog
Achieving Peak System and Workload Efficiency on NVIDIA GB200 NVL72 with Slurm Block Scheduling
NVIDIA GB200 NVL72 introduces a fundamentally new way to build GPU clusters by extending NVIDIA NVLink coherence across an entire rack. This design enables exascale performance, but it also changes…
👍1
Streaming Tokens and Tools: Multi-Turn Agentic Harness Support in NVIDIA Dynamo
https://developer.nvidia.com/blog/streaming-tokens-and-tools-multi-turn-agentic-harness-support-in-nvidia-dynamo/
https://developer.nvidia.com/blog/streaming-tokens-and-tools-multi-turn-agentic-harness-support-in-nvidia-dynamo/
NVIDIA Technical Blog
Streaming Tokens and Tools: Multi-Turn Agentic Harness Support in NVIDIA Dynamo
An agentic exchange must preserve a structured interaction: assistant turns interleave reasoning with one or more tool calls, and subsequent user turns return the corresponding tool results to the…
👍2
Improving Bash Generation in Small Language Models with Grammar-Constrained Decoding
https://developer.nvidia.com/blog/improving-bash-generation-in-small-language-models-with-grammar-constrained-decoding/
https://developer.nvidia.com/blog/improving-bash-generation-in-small-language-models-with-grammar-constrained-decoding/
NVIDIA Technical Blog
Improving Bash Generation in Small Language Models with Grammar-Constrained Decoding
Bash is one of the most flexible and powerful interfaces exposed to AI agents. In the right system, a model that emits , , , or a shell pipeline is producing an executable action that can read files…
👍1
‘Your Career Starts at the Beginning of the AI Revolution,’ NVIDIA CEO Tells Graduates
https://blogs.nvidia.com/blog/nvidia-ceo-carnegie-mellon-commencement-address/
https://blogs.nvidia.com/blog/nvidia-ceo-carnegie-mellon-commencement-address/
NVIDIA Blog
‘Your Career Starts at the Beginning of the AI Revolution,’ NVIDIA CEO Tells Graduates
Delivering the commencement address to Carnegie Mellon University’s Class of 2026, NVIDIA founder and CEO Jensen Huang said, ‘I cannot imagine a more exciting time to begin your life’s work.’
👍4
Introducing NVIDIA Fleet Intelligence for Real-Time GPU Fleet Visibility and Optimization
https://developer.nvidia.com/blog/introducing-nvidia-fleet-intelligence-for-real-time-gpu-fleet-visibility-and-optimization/
https://developer.nvidia.com/blog/introducing-nvidia-fleet-intelligence-for-real-time-gpu-fleet-visibility-and-optimization/
NVIDIA Technical Blog
Introducing NVIDIA Fleet Intelligence for Real-Time GPU Fleet Visibility and Optimization
The compute capability of large GPU fleets presents unprecedented opportunities to innovate and provide value to customers in record time. Yet these advancements come with a variety of challenges.
👍2
NVIDIA and SAP Bring Trust to Specialized Agents
https://blogs.nvidia.com/blog/sap-specialized-agents/
https://blogs.nvidia.com/blog/sap-specialized-agents/
NVIDIA Blog
NVIDIA and SAP Bring Trust to Specialized Agents
Announced today at SAP Sapphire — where NVIDIA founder and CEO Jensen Huang joined SAP CEO Christian Klein’s keynote by video — SAP and NVIDIA’s expanded collaboration helps enterprises run specialized agents with security and governance controls.
👍2
How to Eliminate Pipeline Friction in AI Model Serving
https://developer.nvidia.com/blog/how-to-eliminate-pipeline-friction-in-ai-model-serving/
https://developer.nvidia.com/blog/how-to-eliminate-pipeline-friction-in-ai-model-serving/
NVIDIA Technical Blog
How to Eliminate Pipeline Friction in AI Model Serving
The path from a trained AI model to production should be smooth, but rarely is. Many teams invest weeks fine-tuning models, only to discover that exporting to a deployment format breaks layers…
Hermes Unlocks Self-Improving AI Agents, Powered by NVIDIA RTX PCs and DGX Spark
https://blogs.nvidia.com/blog/rtx-ai-garage-hermes-agent-dgx-spark/
https://blogs.nvidia.com/blog/rtx-ai-garage-hermes-agent-dgx-spark/
NVIDIA Blog
Hermes Unlocks Self-Improving AI Agents, Powered by NVIDIA RTX PCs and DGX Spark
Reliable, self-evolving and powered by the newest agentic large language models, Hermes brings a new class of agents to NVIDIA RTX PCs and workstations.
👍2
NVIDIA, Ineffable Intelligence Team Up to Build the Future of Reinforcement Learning Infrastructure
https://blogs.nvidia.com/blog/ineffable-intelligence-reinforcement-learning-infrastructure/
https://blogs.nvidia.com/blog/ineffable-intelligence-reinforcement-learning-infrastructure/
NVIDIA Blog
NVIDIA, Ineffable Intelligence Team Up to Build the Future of Reinforcement Learning Infrastructure
Together, NVIDIA and Ineffable Intelligence are building the reinforcement learning infrastructure that unlocks new levels of intelligence.
👍2
Transform Video Into Instantly Searchable, Actionable Intelligence with AI Agents and Skills
https://developer.nvidia.com/blog/transform-video-into-instantly-searchable-actionable-intelligence-with-ai-agents-and-skills/
https://developer.nvidia.com/blog/transform-video-into-instantly-searchable-actionable-intelligence-with-ai-agents-and-skills/
👍1
Accelerated X-Ray Analysis for Nanoscale Imaging (XANI) of Novel Materials
https://developer.nvidia.com/blog/accelerated-x-ray-analysis-for-nanoscale-imaging-xani-of-novel-materials/
https://developer.nvidia.com/blog/accelerated-x-ray-analysis-for-nanoscale-imaging-xani-of-novel-materials/
NVIDIA Technical Blog
Accelerated X-Ray Analysis for Nanoscale Imaging (XANI) of Novel Materials
A massive-scale X-ray free-electron laser (XFEL) enables tracking structural and electron dynamics in novel systems, including fusion materials, semiconductors, batteries, and catalysis.
Sea You in the Cloud: ‘Subnautica 2’ Early Access Dives Onto GeForce NOW
https://blogs.nvidia.com/blog/geforce-now-thursday-subnautica-2/
https://blogs.nvidia.com/blog/geforce-now-thursday-subnautica-2/
NVIDIA Blog
Sea You in the Cloud: ‘Subnautica 2’ Early Access Dives Onto GeForce NOW
Plunge into 'Subnautica 2' on GeForce NOW, leading 11 new games; plus catch a new reward for members and early access for 'Forza Horizon 6'.
How the NVIDIA Vera Rubin Platform is Solving Agentic AI’s Scale-Up Problem
https://developer.nvidia.com/blog/how-the-nvidia-vera-rubin-platform-is-solving-agentic-ais-scale-up-problem/
https://developer.nvidia.com/blog/how-the-nvidia-vera-rubin-platform-is-solving-agentic-ais-scale-up-problem/
NVIDIA Technical Blog
How the NVIDIA Vera Rubin Platform is Solving Agentic AI’s Scale-Up Problem
Agentic inference has fundamentally changed the runtime dynamics of inference workloads by introducing non-deterministic trajectories—actions, observations, and decisions that an AI agent produces…
👍6
Vera Arrives: NVIDIA’s First CPU Built for Agents Lands at Top AI Labs
https://blogs.nvidia.com/blog/vera-cpu-delivery/
https://blogs.nvidia.com/blog/vera-cpu-delivery/
NVIDIA Blog
Vera Arrives: NVIDIA’s First CPU Built for Agents Lands at Top AI Labs
Ian Buck hand-delivers the first NVIDIA Vera CPU systems to Anthropic, OpenAI, Oracle Cloud Infrastructure and SpaceXAI — marking the moment agentic CPUs move from announcement to production.
👍1
NVIDIA CEO Jensen Huang at Dell Technologies World: “Demand Is Going Parabolic, Utterly Parabolic”
https://blogs.nvidia.com/blog/dell-technologies-agent-enterprise-ai/
https://blogs.nvidia.com/blog/dell-technologies-agent-enterprise-ai/
NVIDIA Blog
NVIDIA CEO Jensen Huang at Dell Technologies World: ‘Demand Is Going Parabolic, Utterly Parabolic’
NVIDIA CEO Jensen Huang joined Dell CEO Michael Dell on stage Monday to unveil the latest updates to the Dell AI Factory with NVIDIA — delivering a full-stack platform for autonomous agents, from deskside workstations to data center racks.
👍2
NVIDIA and Google Cloud Empower the Next Wave of AI Builders
https://blogs.nvidia.com/blog/google-cloud-developer-community-ai-builders/
https://blogs.nvidia.com/blog/google-cloud-developer-community-ai-builders/
NVIDIA Blog
NVIDIA and Google Cloud Empower the Next Wave of AI Builders
Over 100,000 developers have joined the companies’ joint developer community, tapping into NVIDIA and Google Cloud technologies, learning paths and hands-on labs to build what’s next in AI.
Mastering Agentic Techniques: AI Agent Evaluation
https://developer.nvidia.com/blog/mastering-agentic-techniques-ai-agent-evaluation/
https://developer.nvidia.com/blog/mastering-agentic-techniques-ai-agent-evaluation/
NVIDIA Technical Blog
Mastering Agentic Techniques: AI Agent Evaluation
Evaluating an AI model and evaluating an AI agent are related—but they answer fundamentally different questions. A model benchmark tests the capability of a foundation model (how well it understands…
👍2
NVIDIA-Verified Agent Skills Provide Capability Governance for AI Agents
https://developer.nvidia.com/blog/nvidia-verified-agent-skills-provide-capability-governance-for-ai-agents/
https://developer.nvidia.com/blog/nvidia-verified-agent-skills-provide-capability-governance-for-ai-agents/
NVIDIA Technical Blog
NVIDIA-Verified Agent Skills Provide Capability Governance for AI Agents
Autonomous AI agents are becoming more capable. Open models, Model Context Protocol (MCP)-connected tools, and portable skills are also making agents easier to extend. But scaling agent use with…
👍4
Add a Specialized Deep Research Skill to Agent Harnesses
https://developer.nvidia.com/blog/add-a-specialized-deep-research-skill-to-agent-harnesses/
https://developer.nvidia.com/blog/add-a-specialized-deep-research-skill-to-agent-harnesses/
NVIDIA Technical Blog
Add a Specialized Deep Research Skill to Agent Harnesses
Agent harnesses like Claude Code, Codex, and LangChain Deep Agents are excellent orchestrators. They manage sessions, chain tools, execute code, and respond to developer intent.
👍5
Mastering Agentic Techniques: AI Agent Customization
https://developer.nvidia.com/blog/mastering-agentic-techniques-ai-agent-customization/
https://developer.nvidia.com/blog/mastering-agentic-techniques-ai-agent-customization/
NVIDIA Technical Blog
Mastering Agentic Techniques: AI Agent Customization
Autonomous AI agents are taking on all types of work for businesses: routing logistics fleets, triaging support tickets, generating code, and orchestrating multistep workflows. How do you take a…
👍4