Home DEVELOPER
  • Home
  • Blog
  • Forums
  • Docs
  • Downloads
  • Training
  • Join
Computer Vision / Video Analytics

AI Vision Helps Green Recycling Plants

Read now
AI Vision Helps Green Recycling Plants
Generative AI

NVIDIA TensorRT-LLM Now Supports Recurrent Drafting for Optimizing LLM Inference

Read now
NVIDIA TensorRT-LLM Now Supports Recurrent Drafting for Optimizing LLM Inference
Generative AI

Develop Multilingual and Cross-Lingual Information Retrieval Systems with Efficient Data Storage

Read now
Develop Multilingual and Cross-Lingual Information Retrieval Systems with Efficient Data Storage
Robotics

NVIDIA Jetson Orin Nano Developer Kit Gets a “Super” Boost

Read now
NVIDIA Jetson Orin Nano Developer Kit Gets a “Super” Boost
Data Center / Cloud

Top Posts of 2024 Highlight NVIDIA NIM, LLM Breakthroughs, and Data Science Optimization

Read now
Top Posts of 2024 Highlight NVIDIA NIM, LLM Breakthroughs, and Data Science Optimization
  • Computer Vision / Video Analytics
    AI Vision Helps Green Recycling Plants
  • Generative AI
    NVIDIA TensorRT-LLM Now Supports Recurrent Drafting for Optimizing LLM Inference
  • Generative AI
    Develop Multilingual and Cross-Lingual Information Retrieval Systems with Efficient Data Storage
  • Robotics
    NVIDIA Jetson Orin Nano Developer Kit Gets a “Super” Boost
  • Data Center / Cloud
    Top Posts of 2024 Highlight NVIDIA NIM, LLM Breakthroughs, and Data Science Optimization

Recent

See all
Dec 20, 2024

Accelerating GPU Analytics Using RAPIDS and Ray

RAPIDS is a suite of open-source GPU-accelerated data science and AI libraries that are well supported for scale-out with distributed engines like Spark and...
4 MIN READ
Accelerating GPU Analytics Using RAPIDS and Ray
A surgeon using a medical device in an operating room.
Dec 20, 2024

Build a Generative AI Medical Device Training Assistant with NVIDIA NIM Microservices

Innovation in medical devices continues to accelerate, with a record number authorized by the FDA every year. When these new or updated devices are introduced...
5 MIN READ
Build a Generative AI Medical Device Training Assistant with NVIDIA NIM Microservices
Dec 20, 2024

NVIDIA Hackathon Winners Share Strategies for RAPIDS-Accelerated ML Workflows

Approximately 220 teams gathered at the Open Data Science Conference (ODSC) West this year to compete in the NVIDIA hackathon, a 24-hour machine learning (ML)...
8 MIN READ
NVIDIA Hackathon Winners Share Strategies for RAPIDS-Accelerated ML Workflows
Dec 20, 2024

Just Released: GPU Zen 3: Advanced Rendering Techniques

Grab your copy of GPU Zen 3 to lean about the latest in real-time rendering, including NVIDIA contributions to Cyberpunk 2077.
1 MIN READ
Just Released: GPU Zen 3: Advanced Rendering Techniques
Picture of the NVIDIA H200 NVL GPU on a black background.
Dec 20, 2024

Taking Computational Fluid Dynamics to the Next Level with the NVIDIA H200 Tensor Core GPU

Computational fluid dynamics (CFD) is used in industry and academia to address a wide range of use cases, including external aerodynamics, internal flows, heat...
5 MIN READ
Taking Computational Fluid Dynamics to the Next Level with the NVIDIA H200 Tensor Core GPU
Dec 19, 2024

New Whitepaper: NVIDIA AI Enterprise Security

This white paper details our commitment to securing the NVIDIA AI Enterprise software stack. It outlines the processes and measures NVIDIA takes to ensure...
1 MIN READ
New Whitepaper: NVIDIA AI Enterprise Security
Dec 19, 2024

Enhance Your Training Data with New NVIDIA NeMo Curator Classifier Models

Classifier models are specialized in categorizing data into predefined groups or classes, playing a crucial role in optimizing data processing pipelines for...
11 MIN READ
Enhance Your Training Data with New NVIDIA NeMo Curator Classifier Models
Dec 19, 2024

RAPIDS 24.12 Introduces cuDF on PyPI, CUDA Unified Memory for Polars, and Faster GNNs

RAPIDS 24.12 introduces cuDF packages to PyPI, speeds up groupby aggregations and reading files from AWS S3, enables larger-than-GPU memory queries in the...
8 MIN READ
RAPIDS 24.12 Introduces cuDF on PyPI, CUDA Unified Memory for Polars, and Faster GNNs
Images on a conveyor belt identifed with computer vision.
Dec 19, 2024

AI Vision Helps Green Recycling Plants

Each year, the world recycles only around 13% of its two billion-plus tons of municipal waste. By 2050, the world's annual municipal waste will reach 3.88B...
4 MIN READ
AI Vision Helps Green Recycling Plants
Post-visualization still from Mad Max: Furiosa. A close-up view of a desert chase scene after a disaster. The scene has modified vehicles, including a big tanker truck, a crane-like vehicle, motorbikes, and a pickup truck driving fast across a dusty, reddish-brown road under a dramatic, cloudy sky.
Dec 19, 2024

Accelerating Film Production with Dell AI Factory and NVIDIA

Filmmaking is an intricate and complex process that involves a diverse team of artists, writers, visual effects professionals, technicians, and countless other...
5 MIN READ
Accelerating Film Production with Dell AI Factory and NVIDIA
Dec 19, 2024

Spotlight: Stone Ridge Technology Accelerates Reservoir Simulation Workflows with NVIDIA Modulus on AWS

Risk and uncertainty inherent in energy exploration include unknown geological parameters, variations in fluid and rock properties, boundary conditions, and...
8 MIN READ
Spotlight: Stone Ridge Technology Accelerates Reservoir Simulation Workflows with NVIDIA Modulus on AWS
Dec 18, 2024

Security for Data Privacy in Federated Learning with CUDA-Accelerated Homomorphic Encryption in XGBoost

XGBoost is a machine learning algorithm widely used for tabular data modeling. To expand the XGBoost model from single-site learning to multisite collaborative...
10 MIN READ
Security for Data Privacy in Federated Learning with CUDA-Accelerated Homomorphic Encryption in XGBoost

Inference Performance

See all
Dec 18, 2024

NVIDIA TensorRT-LLM Now Supports Recurrent Drafting for Optimizing LLM Inference

Recurrent drafting (referred as ReDrafter) is a novel speculative decoding technique developed and open-sourced by Apple for large language model (LLM)...
6 MIN READ
NVIDIA TensorRT-LLM Now Supports Recurrent Drafting for Optimizing LLM Inference
Dec 17, 2024

Boost Llama 3.3 70B Inference Throughput 3x with NVIDIA TensorRT-LLM Speculative Decoding

Meta's Llama collection of open large language models (LLMs) continues to grow with the recent addition of Llama 3.3 70B, a text-only...
8 MIN READ
Boost Llama 3.3 70B Inference Throughput 3x with NVIDIA TensorRT-LLM Speculative Decoding
Dec 05, 2024

Spotlight: Perplexity AI Serves 400 Million Search Queries a Month Using NVIDIA Inference Stack

The demand for AI-enabled services continues to grow rapidly, placing increasing pressure on IT and infrastructure teams. These teams are tasked with...
7 MIN READ
Spotlight: Perplexity AI Serves 400 Million Search Queries a Month Using NVIDIA Inference Stack
Image of the TensorRT-LLM icon next to multiple other icons of computer activities.
Dec 02, 2024

TensorRT-LLM Speculative Decoding Boosts Inference Throughput by up to 3.6x

NVIDIA TensorRT-LLM support for speculative decoding now provides over 3x the speedup in total token throughput. TensorRT-LLM is an open-source library that...
9 MIN READ
TensorRT-LLM Speculative Decoding Boosts Inference Throughput by up to 3.6x
Image of an HGX H200
Nov 21, 2024

NVIDIA TensorRT-LLM Multiblock Attention Boosts Throughput by More Than 3x for Long Sequence Lengths on NVIDIA HGX H200

Generative AI models are advancing rapidly. Every generation of models comes with a larger number of parameters and longer context windows. The Llama 2 series...
5 MIN READ
NVIDIA TensorRT-LLM Multiblock Attention Boosts Throughput by More Than 3x for Long Sequence Lengths on NVIDIA HGX H200
Nov 19, 2024

Llama 3.2 Full-Stack Optimizations Unlock High Performance on NVIDIA GPUs

Meta recently released its Llama 3.2 series of vision language models (VLMs), which come in 11B parameter and 90B parameter variants. These models are...
6 MIN READ
Llama 3.2 Full-Stack Optimizations Unlock High Performance on NVIDIA GPUs
Nov 15, 2024

Streamlining AI Inference Performance and Deployment with NVIDIA TensorRT-LLM Chunked Prefill

In this blog post, we take a closer look at chunked prefill, a feature of NVIDIA TensorRT-LLM that increases GPU utilization and simplifies the deployment...
4 MIN READ
Streamlining AI Inference Performance and Deployment with NVIDIA TensorRT-LLM Chunked Prefill
NVIDIA H100.
Nov 08, 2024

5x Faster Time to First Token with NVIDIA TensorRT-LLM KV Cache Early Reuse

In our previous blog post, we demonstrated how reusing the key-value (KV) cache by offloading it to CPU memory can accelerate time to first token (TTFT) by up...
5 MIN READ
5x Faster Time to First Token with NVIDIA TensorRT-LLM KV Cache Early Reuse
Image of an HGX H200
Nov 01, 2024

3x Faster AllReduce with NVSwitch and TensorRT-LLM MultiShot

Deploying generative AI workloads in production environments where user numbers can fluctuate from hundreds to hundreds of thousands – and where input...
5 MIN READ
3x Faster AllReduce with NVSwitch and TensorRT-LLM MultiShot
Oct 28, 2024

NVIDIA GH200 Superchip Accelerates Inference by 2x in Multiturn Interactions with Llama Models

Deploying large language models (LLMs) in production environments often requires making hard trade-offs between enhancing user interactivity and increasing...
7 MIN READ
NVIDIA GH200 Superchip Accelerates Inference by 2x in Multiturn Interactions with Llama Models
Oct 09, 2024

NVIDIA Grace CPU Delivers World-Class Data Center Performance and Breakthrough Energy Efficiency

NVIDIA designed the NVIDIA Grace CPU to be a new kind of high-performance, data center CPU—one built to deliver breakthrough energy efficiency and optimized...
8 MIN READ
NVIDIA Grace CPU Delivers World-Class Data Center Performance and Breakthrough Energy Efficiency
Oct 09, 2024

Boosting Llama 3.1 405B Throughput by Another 1.5x on NVIDIA H200 Tensor Core GPUs and NVLink Switch

The continued growth of LLMs capability, fueled by increasing parameter counts and support for longer contexts, has led to their usage in a wide variety of...
8 MIN READ
Boosting Llama 3.1 405B Throughput by Another 1.5x on NVIDIA H200 Tensor Core GPUs and NVLink Switch

Generative AI

See all
A surgeon using a medical device in an operating room.
Dec 20, 2024

Build a Generative AI Medical Device Training Assistant with NVIDIA NIM Microservices

Innovation in medical devices continues to accelerate, with a record number authorized by the FDA every year. When these new or updated devices are introduced...
5 MIN READ
Build a Generative AI Medical Device Training Assistant with NVIDIA NIM Microservices
Dec 19, 2024

Enhance Your Training Data with New NVIDIA NeMo Curator Classifier Models

Classifier models are specialized in categorizing data into predefined groups or classes, playing a crucial role in optimizing data processing pipelines for...
11 MIN READ
Enhance Your Training Data with New NVIDIA NeMo Curator Classifier Models
Post-visualization still from Mad Max: Furiosa. A close-up view of a desert chase scene after a disaster. The scene has modified vehicles, including a big tanker truck, a crane-like vehicle, motorbikes, and a pickup truck driving fast across a dusty, reddish-brown road under a dramatic, cloudy sky.
Dec 19, 2024

Accelerating Film Production with Dell AI Factory and NVIDIA

Filmmaking is an intricate and complex process that involves a diverse team of artists, writers, visual effects professionals, technicians, and countless other...
5 MIN READ
Accelerating Film Production with Dell AI Factory and NVIDIA
Dec 18, 2024

A Guide to Retrieval-Augmented Generation for AEC

Large language models (LLMs) are rapidly changing the business landscape, offering new capabilities in natural language processing (NLP), content generation,...
12 MIN READ
A Guide to Retrieval-Augmented Generation for AEC
Dec 18, 2024

NVIDIA TensorRT-LLM Now Supports Recurrent Drafting for Optimizing LLM Inference

Recurrent drafting (referred as ReDrafter) is a novel speculative decoding technique developed and open-sourced by Apple for large language model (LLM)...
6 MIN READ
NVIDIA TensorRT-LLM Now Supports Recurrent Drafting for Optimizing LLM Inference
Icon image of a chart and search symbol, on a purple background.
Dec 17, 2024

Data-Efficient Knowledge Distillation for Supervised Fine-Tuning with NVIDIA NeMo-Aligner

Knowledge distillation is an approach for transferring the knowledge of a much larger teacher model to a smaller student model, ideally yielding a compact,...
5 MIN READ
Data-Efficient Knowledge Distillation for Supervised Fine-Tuning with NVIDIA NeMo-Aligner
Image of a photorealistic digital human looking at the camera.
Dec 17, 2024

Deploy Agents, Assistants, and Avatars on NVIDIA RTX AI PCs with New Small Language Models

NVIDIA just announced a series of small language models (SLMs) that increase the amount and type of information digital humans can use to augment their...
4 MIN READ
Deploy Agents, Assistants, and Avatars on NVIDIA RTX AI PCs with New Small Language Models
Dec 17, 2024

Fine-Tuning Small Language Models to Optimize Code Review Accuracy

Generative AI is transforming enterprises by driving innovation and boosting efficiency across numerous applications. However, adopting large foundational...
15 MIN READ
Fine-Tuning Small Language Models to Optimize Code Review Accuracy
Dec 17, 2024

Boost Llama 3.3 70B Inference Throughput 3x with NVIDIA TensorRT-LLM Speculative Decoding

Meta's Llama collection of open large language models (LLMs) continues to grow with the recent addition of Llama 3.3 70B, a text-only...
8 MIN READ
Boost Llama 3.3 70B Inference Throughput 3x with NVIDIA TensorRT-LLM Speculative Decoding
Dec 17, 2024

Develop Multilingual and Cross-Lingual Information Retrieval Systems with Efficient Data Storage

Efficient text retrieval is critical for a broad range of information retrieval applications such as search, question answering, semantic textual similarity,...
8 MIN READ
Develop Multilingual and Cross-Lingual Information Retrieval Systems with Efficient Data Storage
Dec 17, 2024

NVIDIA Jetson Orin Nano Developer Kit Gets a “Super” Boost

The generative AI landscape is rapidly evolving, with new large language models (LLMs), visual language models (VLMs), and vision language action (VLA) models...
11 MIN READ
NVIDIA Jetson Orin Nano Developer Kit Gets a “Super” Boost
Dec 16, 2024

Sandboxing Agentic AI Workflows with WebAssembly

Agentic AI workflows often involve the execution of large language model (LLM)-generated code to perform tasks like creating data visualizations. However, this...
7 MIN READ
Sandboxing Agentic AI Workflows with WebAssembly

Data Science

See all
Dec 20, 2024

Accelerating GPU Analytics Using RAPIDS and Ray

RAPIDS is a suite of open-source GPU-accelerated data science and AI libraries that are well supported for scale-out with distributed engines like Spark and...
4 MIN READ
Accelerating GPU Analytics Using RAPIDS and Ray
Dec 20, 2024

NVIDIA Hackathon Winners Share Strategies for RAPIDS-Accelerated ML Workflows

Approximately 220 teams gathered at the Open Data Science Conference (ODSC) West this year to compete in the NVIDIA hackathon, a 24-hour machine learning (ML)...
8 MIN READ
NVIDIA Hackathon Winners Share Strategies for RAPIDS-Accelerated ML Workflows
Dec 19, 2024

Enhance Your Training Data with New NVIDIA NeMo Curator Classifier Models

Classifier models are specialized in categorizing data into predefined groups or classes, playing a crucial role in optimizing data processing pipelines for...
11 MIN READ
Enhance Your Training Data with New NVIDIA NeMo Curator Classifier Models
Dec 19, 2024

RAPIDS 24.12 Introduces cuDF on PyPI, CUDA Unified Memory for Polars, and Faster GNNs

RAPIDS 24.12 introduces cuDF packages to PyPI, speeds up groupby aggregations and reading files from AWS S3, enables larger-than-GPU memory queries in the...
8 MIN READ
RAPIDS 24.12 Introduces cuDF on PyPI, CUDA Unified Memory for Polars, and Faster GNNs
Dec 18, 2024

Security for Data Privacy in Federated Learning with CUDA-Accelerated Homomorphic Encryption in XGBoost

XGBoost is a machine learning algorithm widely used for tabular data modeling. To expand the XGBoost model from single-site learning to multisite collaborative...
10 MIN READ
Security for Data Privacy in Federated Learning with CUDA-Accelerated Homomorphic Encryption in XGBoost
Dec 16, 2024

Top Posts of 2024 Highlight NVIDIA NIM, LLM Breakthroughs, and Data Science Optimization

2024 was another landmark year for developers, researchers, and innovators working with NVIDIA technologies. From groundbreaking developments in AI inference to...
4 MIN READ
Top Posts of 2024 Highlight NVIDIA NIM, LLM Breakthroughs, and Data Science Optimization
NVIDIA Data Science Teaching Kit
Dec 12, 2024

NVIDIA Deep Learning Institute Releases New Data Science Teaching Kit for Educators

As data grows in volume, velocity, and complexity, the data science field is booming.  There’s an ever-increasing demand for talent and skill sets to...
3 MIN READ
NVIDIA Deep Learning Institute Releases New Data Science Teaching Kit for Educators
Dec 12, 2024

Harnessing GPU Acceleration for Multi-Label Classification with RAPIDS cuML

Modern classification workflows often require classifying individual records and data points into multiple categories instead of just assigning a single label....
4 MIN READ
Harnessing GPU Acceleration for Multi-Label Classification with RAPIDS cuML
Dec 05, 2024

Unified Virtual Memory Supercharges pandas with RAPIDS cuDF

cuDF-pandas, introduced in a previous post, is a GPU-accelerated library that accelerates pandas to deliver significant performance improvements—up to 50x...
5 MIN READ
Unified Virtual Memory Supercharges pandas with RAPIDS cuDF
Image shows a 3D molecular structure of a protein, most likely an antibody, visualized using a ribbon diagram, with the classic Y-shaped configuration characteristic of antibodies.
Dec 03, 2024

In-Silico Antibody Development with AlphaBind Using NVIDIA BioNeMo and AWS HealthOmics

Antibodies have become the most prevalent class of therapeutics, primarily due to their ability to target specific antigens, enabling them to treat a wide range...
6 MIN READ
In-Silico Antibody Development with AlphaBind Using NVIDIA BioNeMo and AWS HealthOmics
Nov 28, 2024

Supercharging Deduplication in pandas Using RAPIDS cuDF

A common operation in data analytics is to drop duplicate rows. Deduplication is critical in Extract, Transform, Load (ETL) workflows, where you might want to...
12 MIN READ
Supercharging Deduplication in pandas Using RAPIDS cuDF
Nov 21, 2024

Best Practices for Multi-GPU Data Analysis Using RAPIDS with Dask

As we move towards a more dense computing infrastructure, with more compute, more GPUs, accelerated networking, and so forth—multi-gpu training and analysis...
5 MIN READ
Best Practices for Multi-GPU Data Analysis Using RAPIDS with Dask

Robotics

See all
Dec 17, 2024

NVIDIA Jetson Orin Nano Developer Kit Gets a “Super” Boost

The generative AI landscape is rapidly evolving, with new large language models (LLMs), visual language models (VLMs), and vision language action (VLA) models...
11 MIN READ
NVIDIA Jetson Orin Nano Developer Kit Gets a “Super” Boost
Dec 14, 2024

Introducing Tile-Based Programming in Warp 1.5.0

With the latest release of Warp 1.5.0, developers now have access to new tile-based programming primitives in Python. Leveraging cuBLASDx and cuFFTDx, these new...
14 MIN READ
Introducing Tile-Based Programming in Warp 1.5.0
Robot fingers tying a knot.
Dec 10, 2024

New AI Research Foreshadows Autonomous Robotic Surgery

A robot commonly used and manually manipulated by surgeons for routine operations can now autonomously perform key surgical tasks as precisely as humans....
4 MIN READ
New AI Research Foreshadows Autonomous Robotic Surgery
Dec 03, 2024

Scaling Action Recognition Models with Synthetic Data

Action recognition models such as PoseClassificationNet have been around for some time, helping systems identify and classify human actions like walking,...
11 MIN READ
Scaling Action Recognition Models with Synthetic Data
Dec 02, 2024

Unified Whole-Body Control for Physically Simulated Humanoids

Creating interactive simulated humanoids that move naturally and respond intelligently to diverse control inputs remains one of the most challenging problems in...
7 MIN READ
Unified Whole-Body Control for Physically Simulated Humanoids
Nov 25, 2024

Just Released: NVIDIA DeepStream 7.1

The new release introduces Python support in Service Maker to accelerate real-time multimedia and AI inference applications with a powerful GStreamer...
1 MIN READ
Just Released: NVIDIA DeepStream 7.1
Connected icons show the workflow.
Nov 21, 2024

NVIDIA JetPack 6.1 Boosts Performance and Security through Camera Stack Optimizations and Introduction of Firmware TPM

NVIDIA JetPack has continuously evolved to offer cutting-edge software tailored to the growing needs of edge AI and robotic developers. With each release,...
8 MIN READ
NVIDIA JetPack 6.1 Boosts Performance and Security through Camera Stack Optimizations and Introduction of Firmware TPM
Nov 06, 2024

Advancing Humanoid Robot Sight and Skill Development with NVIDIA Project GR00T

Humanoid robots present a multifaceted challenge at the intersection of mechatronics, control theory, and AI. The dynamics and control of humanoid robots are...
10 MIN READ
Advancing Humanoid Robot Sight and Skill Development with NVIDIA Project GR00T
Nov 06, 2024

Spotlight: Galbot Builds a Large-Scale Dexterous Hand Dataset for Humanoid Robots Using NVIDIA Isaac Sim

Robotic dexterous grasping is a critical area of research and development, aimed at enabling robots to interact with and manipulate objects as flexibly as...
5 MIN READ
Spotlight: Galbot Builds a Large-Scale Dexterous Hand Dataset for Humanoid Robots Using NVIDIA Isaac Sim
Nov 06, 2024

Spotlight: Fourier Trains Humanoid Robots for Real-World Roles Using NVIDIA Isaac Gym

This post was written in partnership with the Fourier research team. Training humanoid robots to operate in fields that demand high levels of interaction and...
4 MIN READ
Spotlight: Fourier Trains Humanoid Robots for Real-World Roles Using NVIDIA Isaac Gym
Decorative image of icons and a molecular structure in green.
Nov 04, 2024

Build a Video Search and Summarization Agent with NVIDIA AI Blueprint

This post was originally published July 29, 2024 but has been extensively revised with NVIDIA AI Blueprint information. Traditional video analytics applications...
11 MIN READ
Build a Video Search and Summarization Agent with NVIDIA AI Blueprint
A robot making toast.
Oct 30, 2024

Teaching Robots to Tackle Household Chores

Robotics could make everyday life easier by taking on repetitive or time-consuming tasks. At NVIDIA GTC 2024, researchers from Stanford University unveiled...
2 MIN READ
Teaching Robots to Tackle Household Chores

Simulation / Modeling / Design

See all
Picture of the NVIDIA H200 NVL GPU on a black background.
Dec 20, 2024

Taking Computational Fluid Dynamics to the Next Level with the NVIDIA H200 Tensor Core GPU

Computational fluid dynamics (CFD) is used in industry and academia to address a wide range of use cases, including external aerodynamics, internal flows, heat...
5 MIN READ
Taking Computational Fluid Dynamics to the Next Level with the NVIDIA H200 Tensor Core GPU
Dec 19, 2024

Spotlight: Stone Ridge Technology Accelerates Reservoir Simulation Workflows with NVIDIA Modulus on AWS

Risk and uncertainty inherent in energy exploration include unknown geological parameters, variations in fluid and rock properties, boundary conditions, and...
8 MIN READ
Spotlight: Stone Ridge Technology Accelerates Reservoir Simulation Workflows with NVIDIA Modulus on AWS
Dec 18, 2024

Five Takeaways from NVIDIA 6G Developer Day 2024

NVIDIA 6G Developer Day 2024 brought together members of the 6G research and development community to share insights and learn new ways of engaging with NVIDIA...
10 MIN READ
Five Takeaways from NVIDIA 6G Developer Day 2024
Dec 14, 2024

Introducing Tile-Based Programming in Warp 1.5.0

With the latest release of Warp 1.5.0, developers now have access to new tile-based programming primitives in Python. Leveraging cuBLASDx and cuFFTDx, these new...
14 MIN READ
Introducing Tile-Based Programming in Warp 1.5.0
Dec 13, 2024

High-Fidelity 3D Mesh Generation at Scale with Meshtron

Meshes are one of the most important and widely used representations of 3D assets. They are the default standard in the film, design, and gaming industries and...
7 MIN READ
High-Fidelity 3D Mesh Generation at Scale with Meshtron
Dec 12, 2024

Advancing Solar Irradiance Prediction with NVIDIA Earth-2

As global electricity demand continues to rise, traditional sources of energy are increasingly unsustainable. Energy providers are facing pressure to reduce...
9 MIN READ
Advancing Solar Irradiance Prediction with NVIDIA Earth-2
Dec 12, 2024

Time-Lapse AI Model Enhances IVF Embryo Selection

Researchers from Weill Cornell Medicine have developed an AI-powered model that could help couples undergoing in vitro fertilization (IVF) and guide...
3 MIN READ
Time-Lapse AI Model Enhances IVF Embryo Selection
Robot fingers tying a knot.
Dec 10, 2024

New AI Research Foreshadows Autonomous Robotic Surgery

A robot commonly used and manually manipulated by surgeons for routine operations can now autonomously perform key surgical tasks as precisely as humans....
4 MIN READ
New AI Research Foreshadows Autonomous Robotic Surgery
Dec 10, 2024

NVIDIA CUDA-Q Runs Breakthrough Logical Qubit Application on Infleqtion QPU

Infleqtion, a world leader in neutral atom quantum computing, used the NVIDIA CUDA-Q platform to first simulate, and then orchestrate the first-ever...
6 MIN READ
NVIDIA CUDA-Q Runs Breakthrough Logical Qubit Application on Infleqtion QPU
Dec 05, 2024

Just Released: NVIDIA Modulus v24.12

The new release includes new network architectures for external aerodynamics application as well as for climate and weather prediction.
1 MIN READ
Just Released: NVIDIA Modulus v24.12
An image of Earth from space.
Dec 04, 2024

How AI is Making Climate Modeling Faster, Greener, and More Accurate

Christopher Bretherton, Senior Director of Climate Modeling at the Allen Institute for AI (AI2), highlights how AI is revolutionizing climate science. In this...
2 MIN READ
How AI is Making Climate Modeling Faster, Greener, and More Accurate
Dec 03, 2024

Scaling Action Recognition Models with Synthetic Data

Action recognition models such as PoseClassificationNet have been around for some time, helping systems identify and classify human actions like walking,...
11 MIN READ
Scaling Action Recognition Models with Synthetic Data

Computer Vision / Video Analytics

See all
Images on a conveyor belt identifed with computer vision.
Dec 19, 2024

AI Vision Helps Green Recycling Plants

Each year, the world recycles only around 13% of its two billion-plus tons of municipal waste. By 2050, the world's annual municipal waste will reach 3.88B...
4 MIN READ
AI Vision Helps Green Recycling Plants
Dec 12, 2024

Time-Lapse AI Model Enhances IVF Embryo Selection

Researchers from Weill Cornell Medicine have developed an AI-powered model that could help couples undergoing in vitro fertilization (IVF) and guide...
3 MIN READ
Time-Lapse AI Model Enhances IVF Embryo Selection
Dec 09, 2024

Just Released: NVIDIA VILA VLM

Now available in preview, NVIDIA VILA is an advanced multimodal VLM that provides visual understanding of multi-images and video.
1 MIN READ
Just Released: NVIDIA VILA VLM
Simulated 2D and 3D CT scans.
Dec 05, 2024

Celebrating Open Science and Enterprise AI Innovation on MONAI’s 5th Anniversary

As MONAI celebrates its fifth anniversary, we're witnessing the convergence of our vision for open medical AI with production-ready enterprise solutions. ...
7 MIN READ
Celebrating Open Science and Enterprise AI Innovation on MONAI’s 5th Anniversary
Dec 03, 2024

Scaling Action Recognition Models with Synthetic Data

Action recognition models such as PoseClassificationNet have been around for some time, helping systems identify and classify human actions like walking,...
11 MIN READ
Scaling Action Recognition Models with Synthetic Data
An avatar sitting at a computer, which is linked to multiple action icons through the NVIDIA NIM icon.
Dec 03, 2024

Build an Agentic Video Workflow with Video Search and Summarization

Building a question-answering chatbot with large language models (LLMs) is now a common workflow for text-based interactions. What about creating an AI system...
11 MIN READ
Build an Agentic Video Workflow with Video Search and Summarization
Nov 25, 2024

Just Released: NVIDIA DeepStream 7.1

The new release introduces Python support in Service Maker to accelerate real-time multimedia and AI inference applications with a powerful GStreamer...
1 MIN READ
Just Released: NVIDIA DeepStream 7.1
Connected icons show the workflow.
Nov 21, 2024

NVIDIA JetPack 6.1 Boosts Performance and Security through Camera Stack Optimizations and Introduction of Firmware TPM

NVIDIA JetPack has continuously evolved to offer cutting-edge software tailored to the growing needs of edge AI and robotic developers. With each release,...
8 MIN READ
NVIDIA JetPack 6.1 Boosts Performance and Security through Camera Stack Optimizations and Introduction of Firmware TPM
A closeup of an eye.
Nov 21, 2024

AI Unlocks Early Clues to Alzheimer’s Through Retinal Scans

Your eyes could hold the key to unlocking early detection of Alzheimer’s and dementia, with a groundbreaking AI study. Called Eye-AD, the deep learning...
3 MIN READ
AI Unlocks Early Clues to Alzheimer’s Through Retinal Scans
Decorative image of icons and a molecular structure in green.
Nov 04, 2024

Build a Video Search and Summarization Agent with NVIDIA AI Blueprint

This post was originally published July 29, 2024 but has been extensively revised with NVIDIA AI Blueprint information. Traditional video analytics applications...
11 MIN READ
Build a Video Search and Summarization Agent with NVIDIA AI Blueprint
A slide of breast cancer cells.
Oct 31, 2024

Deep Learning AI Model Identifies Breast Cancer Spread without Surgery

A new deep learning model could reduce the need for surgery when diagnosing whether cancer cells are spreading, including to nearby lymph nodes—also known as...
4 MIN READ
Deep Learning AI Model Identifies Breast Cancer Spread without Surgery
Close-up shot of a wolf howling. Courtesy of Pexels/patrice schoefolt.
Oct 29, 2024

AI-Powered Devices Track Howls to Save Wolves

A new cell-phone-sized device—which can be deployed in vast, remote areas—is using AI to identify and geolocate wildlife to help conservationists track...
5 MIN READ
AI-Powered Devices Track Howls to Save Wolves

Content Creation / Rendering

See all
Dec 20, 2024

Just Released: GPU Zen 3: Advanced Rendering Techniques

Grab your copy of GPU Zen 3 to lean about the latest in real-time rendering, including NVIDIA contributions to Cyberpunk 2077.
1 MIN READ
Just Released: GPU Zen 3: Advanced Rendering Techniques
Post-visualization still from Mad Max: Furiosa. A close-up view of a desert chase scene after a disaster. The scene has modified vehicles, including a big tanker truck, a crane-like vehicle, motorbikes, and a pickup truck driving fast across a dusty, reddish-brown road under a dramatic, cloudy sky.
Dec 19, 2024

Accelerating Film Production with Dell AI Factory and NVIDIA

Filmmaking is an intricate and complex process that involves a diverse team of artists, writers, visual effects professionals, technicians, and countless other...
5 MIN READ
Accelerating Film Production with Dell AI Factory and NVIDIA
Dec 17, 2024

Efficient Ray Tracing with NVIDIA OptiX Shader Binding Table Optimization

NVIDIA OptiX is the API for GPU-accelerated ray tracing with CUDA, and is often used to render scenes containing a wide variety of objects and materials. During...
11 MIN READ
Efficient Ray Tracing with NVIDIA OptiX Shader Binding Table Optimization
Image of a photorealistic digital human looking at the camera.
Dec 17, 2024

Deploy Agents, Assistants, and Avatars on NVIDIA RTX AI PCs with New Small Language Models

NVIDIA just announced a series of small language models (SLMs) that increase the amount and type of information digital humans can use to augment their...
4 MIN READ
Deploy Agents, Assistants, and Avatars on NVIDIA RTX AI PCs with New Small Language Models
Dec 13, 2024

High-Fidelity 3D Mesh Generation at Scale with Meshtron

Meshes are one of the most important and widely used representations of 3D assets. They are the default standard in the film, design, and gaming industries and...
7 MIN READ
High-Fidelity 3D Mesh Generation at Scale with Meshtron
Dec 05, 2024

Optimize GPU Workloads for Graphics Applications with NVIDIA Nsight Graphics

One of the great pastimes of graphics developers and enthusiasts is comparing specifications of GPUs and marveling at the ever-increasing counts of shader...
11 MIN READ
Optimize GPU Workloads for Graphics Applications with NVIDIA Nsight Graphics
A person looking at a computer monitor.
Nov 21, 2024

Powering AI-Augmented Workloads with NVIDIA and Windows 365

We are entering a new era of AI-powered digital workflow, where Windows 365 Cloud PCs are dynamic platforms that host AI technologies and reshape traditional...
7 MIN READ
Powering AI-Augmented Workloads with NVIDIA and Windows 365
Collage of 12 different car and background images.
Oct 07, 2024

Producing Cinematic Content at Scale with a Generative AI-Enabled OpenUSD Pipeline

Producing commercials is resource-intensive, requiring physical locations and various props and setups to display products in different settings and...
7 MIN READ
Producing Cinematic Content at Scale with a Generative AI-Enabled OpenUSD Pipeline
Oct 02, 2024

Accelerating LLMs with llama.cpp on NVIDIA RTX Systems

The NVIDIA RTX AI for Windows PCs platform offers a thriving ecosystem of thousands of open-source models for application developers to leverage and integrate...
5 MIN READ
Accelerating LLMs with llama.cpp on NVIDIA RTX Systems
Decorative image of GDN logo floating in a green cloud above a world map that has other gaming logos floating lower down.
Oct 01, 2024

Revolutionizing Cloud Gaming and Graphics Rendering with NVIDIA GDN

Gaming has always pushed the boundaries of graphics hardware. The most popular games typically required robust GPU, CPU, and RAM resources on a user’s PC or...
7 MIN READ
Revolutionizing Cloud Gaming and Graphics Rendering with NVIDIA GDN
Oct 01, 2024

Simplify and Scale AI-Powered MetaHuman Deployment with NVIDIA ACE and Unreal Engine 5

At Unreal Fest 2024, NVIDIA released new Unreal Engine 5 on-device plugins for NVIDIA ACE, making it easier to build and deploy AI-powered MetaHuman characters...
4 MIN READ
Simplify and Scale AI-Powered MetaHuman Deployment with NVIDIA ACE and Unreal Engine 5
Sep 23, 2024

Just Released: Free OpenUSD Training Courses

Accelerate your OpenUSD workflows with this free curriculum for developers and 3D practitioners.
1 MIN READ
Just Released: Free OpenUSD Training Courses

Conversational AI

See all
A surgeon using a medical device in an operating room.
Dec 20, 2024

Build a Generative AI Medical Device Training Assistant with NVIDIA NIM Microservices

Innovation in medical devices continues to accelerate, with a record number authorized by the FDA every year. When these new or updated devices are introduced...
5 MIN READ
Build a Generative AI Medical Device Training Assistant with NVIDIA NIM Microservices
Dec 16, 2024

Sandboxing Agentic AI Workflows with WebAssembly

Agentic AI workflows often involve the execution of large language model (LLM)-generated code to perform tasks like creating data visualizations. However, this...
7 MIN READ
Sandboxing Agentic AI Workflows with WebAssembly
Dec 11, 2024

Three Building Blocks for Creating AI Virtual Assistants for Customer Service with an NVIDIA AI Blueprint

In today's fast-paced business environment, providing exceptional customer service is no longer just a nice-to-have—it's a necessity. Whether addressing...
10 MIN READ
Three Building Blocks for Creating AI Virtual Assistants for Customer Service with an NVIDIA AI Blueprint
Nov 22, 2024

Hymba Hybrid-Head Architecture Boosts Small Language Model Performance

Transformers, with their attention-based architecture, have become the dominant choice for language models (LMs) due to their strong performance,...
12 MIN READ
Hymba Hybrid-Head Architecture Boosts Small Language Model Performance
Chatbot avatar in front of a stylized chat screen on a purple background.
Nov 19, 2024

Create a Custom Slackbot LLM Agent with NVIDIA NIM and LangChain

In the dynamic world of modern business, where communication and efficient workflows are crucial for success, AI-powered solutions have become a competitive...
9 MIN READ
Create a Custom Slackbot LLM Agent with NVIDIA NIM and LangChain
GIF shows chat app in use.
Oct 28, 2024

Creating RAG-Based Question-and-Answer LLM Workflows at NVIDIA

The rapid development of solutions using retrieval augmented generation (RAG) for question-and-answer LLM workflows has led to new types of system...
11 MIN READ
Creating RAG-Based Question-and-Answer LLM Workflows at NVIDIA
Oct 22, 2024

Scaling LLMs with NVIDIA Triton and NVIDIA TensorRT-LLM Using Kubernetes

Large language models (LLMs) have been widely used for chatbots, content generation, summarization, classification, translation, and more. State-of-the-art LLMs...
16 MIN READ
Scaling LLMs with NVIDIA Triton and NVIDIA TensorRT-LLM Using Kubernetes
Oct 21, 2024

IBM’s New Granite 3.0 Generative AI Models Are Small, Yet Highly Accurate and Efficient

Today, IBM released the third generation of IBM Granite, a collection of open language models and complementary tools. Prior generations of Granite focused on...
5 MIN READ
IBM’s New Granite 3.0 Generative AI Models Are Small, Yet Highly Accurate and Efficient
NCNS logo on a black background.
Oct 16, 2024

Simplify AI Application Development with NVIDIA Cloud Native Stack

In the rapidly evolving landscape of AI and data science, the demand for scalable, efficient, and flexible infrastructure has never been higher. Traditional...
5 MIN READ
Simplify AI Application Development with NVIDIA Cloud Native Stack
Avatars of a patient in a bed with a doctor sitting at a desk in another location, looking at a computer screen.
Oct 01, 2024

Evaluating Medical RAG with NVIDIA AI Endpoints and Ragas

In the rapidly evolving field of medicine, the integration of cutting-edge technologies is crucial for enhancing patient care and advancing research. One such...
11 MIN READ
Evaluating Medical RAG with NVIDIA AI Endpoints and Ragas
Sep 26, 2024

Low Latency Inference Chapter 2: Blackwell is Coming. NVIDIA GH200 NVL32 with NVLink Switch Gives Signs of Big Leap in Time to First Token Performance

Many of the most exciting applications of large language models (LLMs), such as interactive speech bots, coding co-pilots, and search, need to begin responding...
8 MIN READ
Low Latency Inference Chapter 2: Blackwell is Coming. NVIDIA GH200 NVL32 with NVLink Switch Gives Signs of Big Leap in Time to First Token Performance
Sep 25, 2024

Build a Digital Human Interface for AI Apps with an NVIDIA NIM Agent Blueprint

Providing customers with quality service remains a top priority for businesses across industries, from answering questions and troubleshooting issues to...
5 MIN READ
Build a Digital Human Interface for AI Apps with an NVIDIA NIM Agent Blueprint

Edge Computing

See all
Images on a conveyor belt identifed with computer vision.
Dec 19, 2024

AI Vision Helps Green Recycling Plants

Each year, the world recycles only around 13% of its two billion-plus tons of municipal waste. By 2050, the world's annual municipal waste will reach 3.88B...
4 MIN READ
AI Vision Helps Green Recycling Plants
Dec 18, 2024

Five Takeaways from NVIDIA 6G Developer Day 2024

NVIDIA 6G Developer Day 2024 brought together members of the 6G research and development community to share insights and learn new ways of engaging with NVIDIA...
10 MIN READ
Five Takeaways from NVIDIA 6G Developer Day 2024
Dec 17, 2024

NVIDIA Jetson Orin Nano Developer Kit Gets a “Super” Boost

The generative AI landscape is rapidly evolving, with new large language models (LLMs), visual language models (VLMs), and vision language action (VLA) models...
11 MIN READ
NVIDIA Jetson Orin Nano Developer Kit Gets a “Super” Boost
Nov 25, 2024

Just Released: NVIDIA DeepStream 7.1

The new release introduces Python support in Service Maker to accelerate real-time multimedia and AI inference applications with a powerful GStreamer...
1 MIN READ
Just Released: NVIDIA DeepStream 7.1
Nov 22, 2024

Hymba Hybrid-Head Architecture Boosts Small Language Model Performance

Transformers, with their attention-based architecture, have become the dominant choice for language models (LMs) due to their strong performance,...
12 MIN READ
Hymba Hybrid-Head Architecture Boosts Small Language Model Performance
Connected icons show the workflow.
Nov 21, 2024

NVIDIA JetPack 6.1 Boosts Performance and Security through Camera Stack Optimizations and Introduction of Firmware TPM

NVIDIA JetPack has continuously evolved to offer cutting-edge software tailored to the growing needs of edge AI and robotic developers. With each release,...
8 MIN READ
NVIDIA JetPack 6.1 Boosts Performance and Security through Camera Stack Optimizations and Introduction of Firmware TPM
Nov 14, 2024

NVIDIA DOCA 2.9 Enhances AI and Cloud Computing Infrastructure with New Performance and Security Features

NVIDIA DOCA enhances the capabilities of NVIDIA networking platforms by providing a comprehensive software framework for developers to leverage hardware...
9 MIN READ
NVIDIA DOCA 2.9 Enhances AI and Cloud Computing Infrastructure with New Performance and Security Features
Close-up shot of a wolf howling. Courtesy of Pexels/patrice schoefolt.
Oct 29, 2024

AI-Powered Devices Track Howls to Save Wolves

A new cell-phone-sized device—which can be deployed in vast, remote areas—is using AI to identify and geolocate wildlife to help conservationists track...
5 MIN READ
AI-Powered Devices Track Howls to Save Wolves
Oct 24, 2024

Powering the Next Wave of AI Robotics with Three Computers 

NVIDIA has built three computers and accelerated development platforms to enable developers to create physical AI.
1 MIN READ
Powering the Next Wave of AI Robotics with Three Computers 
A GIF of a hurricane forecast.
Oct 21, 2024

AI Accurately Forecasts Extreme Weather Up to 23 Days Ahead

New research from the University of Washington is refining AI weather models using deep learning for more accurate predictions and longer-term forecasts. The...
3 MIN READ
AI Accurately Forecasts Extreme Weather Up to 23 Days Ahead
Oct 16, 2024

Maximizing Energy and Power Efficiency in Applications with NVIDIA GPUs

As the demand for high-performance computing (HPC) and AI applications grows, so does the importance of energy efficiency. NVIDIA Principal Developer Technology...
2 MIN READ
Maximizing Energy and Power Efficiency in Applications with NVIDIA GPUs
Decorative image of a person looking at a monitor, which has multiple brain scans displayed.
Oct 16, 2024

Treating Brain Disease with Brain-Machine Interactive Neuromodulation and NVIDIA Jetson

Neuromodulation is a technique that enhances or restores brain function by directly intervening in neural activity. It is commonly used to treat conditions like...
4 MIN READ
Treating Brain Disease with Brain-Machine Interactive Neuromodulation and NVIDIA Jetson

Data Center / Cloud

See all
Dec 19, 2024

New Whitepaper: NVIDIA AI Enterprise Security

This white paper details our commitment to securing the NVIDIA AI Enterprise software stack. It outlines the processes and measures NVIDIA takes to ensure...
1 MIN READ
New Whitepaper: NVIDIA AI Enterprise Security
Dec 19, 2024

Spotlight: Stone Ridge Technology Accelerates Reservoir Simulation Workflows with NVIDIA Modulus on AWS

Risk and uncertainty inherent in energy exploration include unknown geological parameters, variations in fluid and rock properties, boundary conditions, and...
8 MIN READ
Spotlight: Stone Ridge Technology Accelerates Reservoir Simulation Workflows with NVIDIA Modulus on AWS
Dec 18, 2024

Five Takeaways from NVIDIA 6G Developer Day 2024

NVIDIA 6G Developer Day 2024 brought together members of the 6G research and development community to share insights and learn new ways of engaging with NVIDIA...
10 MIN READ
Five Takeaways from NVIDIA 6G Developer Day 2024
Dec 16, 2024

Top Posts of 2024 Highlight NVIDIA NIM, LLM Breakthroughs, and Data Science Optimization

2024 was another landmark year for developers, researchers, and innovators working with NVIDIA technologies. From groundbreaking developments in AI inference to...
4 MIN READ
Top Posts of 2024 Highlight NVIDIA NIM, LLM Breakthroughs, and Data Science Optimization
Black and white topology of connected nodes in NVIDIA Air.
Dec 12, 2024

An Introduction to NVIDIA Air

The advent of AI has introduced a new type of data center, the AI factory, purpose-built from the ground up to handle AI workloads. AI workloads can...
6 MIN READ
An Introduction to NVIDIA Air
Dec 12, 2024

Advancing Solar Irradiance Prediction with NVIDIA Earth-2

As global electricity demand continues to rise, traditional sources of energy are increasingly unsustainable. Energy providers are facing pressure to reduce...
9 MIN READ
Advancing Solar Irradiance Prediction with NVIDIA Earth-2
Dec 12, 2024

Integration of NVIDIA BlueField DPUs with WEKA Client Boosts AI Workload Efficiency

WEKA, a pioneer in scalable software-defined data platforms, and NVIDIA are collaborating to unite WEKA's state-of-the-art data platform solutions with powerful...
5 MIN READ
Integration of NVIDIA BlueField DPUs with WEKA Client Boosts AI Workload Efficiency
Dec 11, 2024

Deploying NVIDIA H200 NVL at Scale with New Enterprise Reference Architecture

Last month at the Supercomputing 2024 conference, NVIDIA announced the availability of NVIDIA H200 NVL, the latest NVIDIA Hopper platform. Optimized for...
8 MIN READ
Deploying NVIDIA H200 NVL at Scale with New Enterprise Reference Architecture
Dec 05, 2024

Spotlight: Perplexity AI Serves 400 Million Search Queries a Month Using NVIDIA Inference Stack

The demand for AI-enabled services continues to grow rapidly, placing increasing pressure on IT and infrastructure teams. These teams are tasked with...
7 MIN READ
Spotlight: Perplexity AI Serves 400 Million Search Queries a Month Using NVIDIA Inference Stack
Image of the TensorRT-LLM icon next to multiple other icons of computer activities.
Dec 02, 2024

TensorRT-LLM Speculative Decoding Boosts Inference Throughput by up to 3.6x

NVIDIA TensorRT-LLM support for speculative decoding now provides over 3x the speedup in total token throughput. TensorRT-LLM is an open-source library that...
9 MIN READ
TensorRT-LLM Speculative Decoding Boosts Inference Throughput by up to 3.6x
Image of an HGX H200
Nov 21, 2024

NVIDIA TensorRT-LLM Multiblock Attention Boosts Throughput by More Than 3x for Long Sequence Lengths on NVIDIA HGX H200

Generative AI models are advancing rapidly. Every generation of models comes with a larger number of parameters and longer context windows. The Llama 2 series...
5 MIN READ
NVIDIA TensorRT-LLM Multiblock Attention Boosts Throughput by More Than 3x for Long Sequence Lengths on NVIDIA HGX H200
Nov 21, 2024

Deploying Fine-Tuned AI Models with NVIDIA NIM

For organizations adapting AI foundation models with domain-specific data, the ability to rapidly create and deploy fine-tuned models is key to efficiently...
6 MIN READ
Deploying Fine-Tuned AI Models with NVIDIA NIM