developer.nvidia.com
Open in
urlscan Pro
152.199.20.126
Public Scan
Submitted URL: http://developer.nvidia.com/
Effective URL: https://developer.nvidia.com/
Submission: On April 04 via api from US — Scanned from DE
Effective URL: https://developer.nvidia.com/
Submission: On April 04 via api from US — Scanned from DE
Form analysis
2 forms found in the DOMhttps://developer.nvidia.com/search
<form action="https://developer.nvidia.com/search" class="svelte-df2sxi" __bizdiag="1803202813" __biza="WJ__"><input type="text" name="term" placeholder="Search NVIDIA" class="svelte-df2sxi"> <input type="hidden" name="facet.subcollection[]"
value="Developer Zone" class="svelte-df2sxi"> <input type="hidden" name="facet.subcollection[]" value="Developer Forums" class="svelte-df2sxi"> <input type="hidden" name="facet.subcollection[]" value="Technical Blog" class="svelte-df2sxi">
<button id="btn-search" type="submit" class="svelte-df2sxi"><svg xmlns="http://www.w3.org/2000/svg" width="16" height="16" fill="#fff" class="bi bi-search svelte-df2sxi" viewBox="0 0 16 16">
<path d="M11.742 10.344a6.5 6.5 0 1 0-1.397 1.398h-.001c.03.04.062.078.098.115l3.85 3.85a1 1 0 0 0 1.415-1.414l-3.85-3.85a1.007 1.007 0 0 0-.115-.1zM12 6.5a5.5 5.5 0 1 1-11 0 5.5 5.5 0 0 1 11 0z"></path>
</svg></button></form>
/search
<form action="/search" __bizdiag="3556460" __biza="WJ__">
<input class="p--medium" type="search" id="search" name="term" placeholder="Search">
</form>
Text Content
Toggle Navigation * Home * Blog * Forums * Docs * Downloads * Training * Join * * Solutions * AI and Data Science * Conversational AI * Deep Learning * Inference * Machine Learning * Federated Learning * AI-Enabled Video Analytics * Data Analytics * Recommender Systems * Computer Vision * Accelerate AI Applications * Digital Humans * High-Performance Computing * Overview * Genomics * Scientific Visualization * Simulation & Modeling * Computational Lithography * Intelligent Machines * Overview * Embedded and Edge AI * Robotics * AI-Enabled Video Analytics * Hardware (Jetson) * Rendering * Overview * Rendering Performance Tools * Image Processing * Graphics Research Tools * Ray Tracing * Simulation * Physics and Dynamics Simulation * Medical Imaging * Scientific Visualization * AR and VR Acceleration * XR Streaming * Robotics Simulation * Game Engines * Overview * Unreal Engine * Unity * Networking * Overview * DOCA * HPC-X * Magnum IO * Rivermax * Video, Broadcast and Display * Overview * Display and Output * Display and Output Solutions * HMD Support * Motion Estimation * Latency Optimization * Virtual Collaboration & Content Creation * Video Decode and Encode * Video and Broadcast Networking * AI-Enabled Video Analytics * Autonomous Vehicles * Overview * Development Platform * Modular Software Stack * Simulation Platform * DNN Training Platform * Tools and Management * Arm Developer Tools * Developer Tools * Android for Mobile * Telecommunications * Overview * Aerial * Sionna * GPU-Optimized Software * AI and HPC Containers * AI Models * Jupyter Notebooks * NGC Catalog * Platforms * CUDA-X AI * TensorRT * Triton Inference Server * NeMo * cuDNN * NCCL * DALI * cuBLAS * cuSPARSE * Optical Flow SDK * RAPIDS * DOCA * DOCA * CLARA * Clara Guardian * Clara Parabricks * HPC * HPC SDK * CUDA Toolkit * OpenACC * IndeX * CUDA-X Libraries * Developer Tools * Modulus * cuLitho * Quantum Computing * CUDA Quantum * cuQuantum * DRIVE * DRIVE AGX * DRIVE OS * DriveWorks * DRIVE Sim * ISAAC * Isaac SDK * Isaac Sim * Jetson Developer Kits * Jetpack * RTX * DLSS * Kickstart RT * Micro-Mesh * OptiX * RTX Dynamic Illumination * RTX Global Illumination (RTXGI) * RTX Memory Utility (RTXMU) * RTX Path Tracing * Real-Time Denoisers (NRD) * Reflex * Streamline * Metropolis * Metropolis SDK * DeepStream SDK * TAO Toolkit * Metropolis Microservices * Metropolis for Factories * Simulation * Avatar Cloud Engine (ACE) * Universal Scene Description (OpenUSD) * Omniverse * Overview * Isaac Sim * Universal Scene Description (OpenUSD) * Other Platforms * Aerial * Arm * CloudXR * DGX * DOCA * Holoscan SDK * Riva * Maxine * Merlin * cuOpt * Rivermax * TAO * Converged Accelerator * Morpheus * Industries * Financial Services * Gaming * Healthcare * Higher Ed and Research * Public Sector * Transportation * Media and Entertainment * See More * Resources * Contact Us * Developer Program * Training * Educators * NGC * NVIDIA GTC * NVIDIA On-Demand * Open Source * For Startups * AI Playground BUILD APPLICATIONS WITH GENERATIVE AI Experience, prototype, and deploy AI with production-ready APIs that run anywhere. Explore API Catalog TUTORIALS April 03, 2024 OPTIMIZING MEMORY AND RETRIEVAL FOR GRAPH NEURAL NETWORKS WITH WHOLEGRAPH, PART 2 Read More April 03, 2024 NEW LAB: GENERATIVE AI INFERENCE WITH NVIDIA NIM Read More April 02, 2024 TUNE AND DEPLOY LORA LLMS WITH NVIDIA TENSORRT-LLM Read More LATEST RELEASES * CUDA Toolkit 12.4 * CUTLASS 3.4.1 * DLSS 3.5 * HPC SDK 24.3 * Modulus 24.01 * Nsight Systems 2024.2 * RAPIDS 24.02 * Sionna 0.16.2 * Triton Inference Server 2.42 NEWS March 19, 2024 GENERATIVE AI FOR DIGITAL HUMANS AND NEW AI-POWERED NVIDIA RTX LIGHTING Read More March 19, 2024 NVIDIA SPEECH AND TRANSLATION AI MODELS SET RECORDS FOR SPEED AND ACCURACY Read More March 19, 2024 BOOST MULTI-OMICS ANALYSIS WITH GPU-ACCELERATION AND GENERATIVE AI Read More March 19, 2024 BREAKING BARRIERS IN HEALTHCARE WITH NEW MODELS FOR GENERATIVE AI AND CELLULAR IMAGING Read More TRAINING FUNDAMENTALS OF DEEP LEARNING Instructor-Led, Certificate Available Read More BUILDING CONVERSATIONAL AI APPLICATIONS Instructor-Led, Certificate Available Read More MODEL PARALLELISM: BUILDING AND DEPLOYING LARGE NEURAL NETWORKS Instructor-Led, Certificate Available Read More ADDITIONAL RESOURCES * Instructor-Led Workshops * Self-Paced Courses * Full Course Catalog * Learning Paths * Enterprise Training Solutions * Free Courses * Training Videos On Demand CHECK OUT RECOMMENDED GTC 2024 SESSIONS TRANSFORMING AI The Transforming AI Panel features the authors of "Attention Is All You Need," the groundbreaking paper that introduced the transformer neural network architecture. Transformers have since dominated all areas of AI and revolutionized the industry. Join Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, and Illia Polosukhin, hosted by NVIDIA founder and CEO Jensen Huang. See Session NAVIGATING THE LARGE LANGUAGE MODELS FRONTIER: PRACTICAL STRATEGIES FOR BUILDING ENTERPRISE APPLICATIONS POWERED BY LLMS Our panel of experts will talk about the best practices for building robust large language model (LLM)-based enterprise applications that deliver value and efficiency. Products such as ChatGPT have demonstrated the unprecedented power of LLMs in processing information and generating content. But harnessing LLMs for building enterprise applications introduces a spectrum of intricate challenges. They include, but aren't limited to, managing the behavior of LLMs (e.g., avoiding hallucination), adapting LLMs to domain-specific tasks while pre-trained on very general domain corpora, interacting with agents to execute some specific tasks, latency, security, and so on. We'll explore how enterprises can address these challenges and exploit the full potential of LLMs for their applications. See Session CUDA: NEW FEATURES AND BEYOND The CUDA platform is the foundation of the GPU computing ecosystem. Every application and framework that uses the GPU does so through CUDA's libraries, compilers, runtimes and language — which means CUDA is growing as fast as its ecosystem is evolving. At this engineering-focused talk, you'll learn from one of the architects of CUDA about all that's new and what's coming next, for both CUDA and GPU computing as a whole. See Session EARLY SCIENCE WITH GRACE HOPPER AT SCALE ON ALPS We'll introduce early science results and HPC performance of the ETH EXCLAIM project running the ICON climate model on the Alps system infrastructure. We'll focus especially on system and software configuration requirements that optimize the balance of performance between Grace and Hopper under given power capping constraints. Initial performance investigations will begin with climate science, with the intent to inform and inspire other domains and applications. The target will be performance considerations for the ICON coupling of atmosphere and ocean, but early science results may be limited to an aqua planet model configuration. See Session DEPLOYING, OPTIMIZING, AND BENCHMARKING LARGE LANGUAGE MODELS WITH TRITON INFERENCE SERVER Learn how to serve large language models (LLMs) efficiently using Triton Inference Server with step-by-step instructions. NVIDIA Triton Inference Server is an open-source inference serving solution that simplifies the production deployment of AI models at scale. With a uniform interface and standard set of metrics, developers can easily deploy deep learning and machine learning models across many different frameworks (TensorRT, TensorRT-LLM, vLLM, TensorFlow, PyTorch, OpenVINO, and more) on multiple types of hardware (CPU and GPU). We’ll review the challenges of serving LLMs and demonstrate how Triton Inference Server’s latest features help overcome them. We’ll cover how to easily deploy an LLM across multiple backends and compare their performance, as well as how to fine-tune deployment configurations for optimal performance. We'll provide step-by-step instructions for anyone to follow using publicly available collateral and answer questions along the way. See Session A DEEP DIVE INTO THE LATEST HPC SOFTWARE Take a deep dive into the latest developments in NVIDIA software for high performance computing applications, including a comprehensive look at what’s new in programming models, compilers, libraries, and tools. We'll cover topics of interest to HPC developers, targeting traditional HPC modeling and simulation, quantum computing, HPC+AI, scientific visualization, and high-performance data analytics. See Session ROBOTICS AND THE ROLE OF AI: PAST, PRESENT, AND FUTURE Advances in artificial intelligence have enabled breakthroughs in several fields, including computer vision and natural language processing in both academia and industry. In this fireside conversation, NVIDIA’s senior director of robotics research, Dieter Fox, will be joined by Marc Raibert, the executive director of The AI Institute, to discuss how artificial intelligence has impacted robotics, from the traditional controls era to today. See Session DIGITALIZING THE WORLD'S LARGEST INDUSTRIES WITH OPENUSD AND GENERATIVE AI The world’s largest industries are racing to become software-defined, but digitalization of such processes is complex. Hear from this panel of distinguished luminaries on their industrial digitalization projects that infuse generative AI, new data platforms, 3D interoperability, and advanced visualization throughout their organizations. Want to learn more about OpenUSD for industrial digital twins? Attend this deep learning institute training course. See Session ROBOTICS IN THE AGE OF GENERATIVE AI Generative AI is taking automated common-sense reasoning, task planning, and perception to a new level. It is also revolutionizing synthetic data generation, human-computer interaction, and multimodal understanding. Collectively, these are some of the key capabilities required for robots to understand our world and provide humanity with accessible, versatile physical assistance for day-to-day tasks. The key missing ingredient is for generative AI to also understand physical interaction. I'll sketch a future in which embodied AI is a natural extension of the revolution that large multimodal models are ushering, and its implications for the future of collaborative robotics and human-centered AI at large. See Session INSIGHTS FROM NVIDIA RESEARCH We'll share some insights from NVIDIA Research for the past year. These will include a power-efficient “always-on” AI accelerator, a diffusion model that improves the resolution of weather predictions, a large language model-powered embodied agent, and a foundation model for autonomous vehicle scene reconstruction. See Session ACCELERATING AUTOMOTIVE WORKFLOWS WITH LARGE LANGUAGE MODELS Large language models (LLM) are revolutionizing the way we interact with information, making it easy to pinpoint information from a large source of data, such as vehicle owner’s manuals or manufacturing machinery manuals. However, ensuring these agents operate accurately, or without hallucinating, presents a variety of challenges. To date,the best solutions use LLMs through the retrieval augmented generation (RAG) architecture, with solutions benefiting from fine-tuning LLMs for performance, scalability, and domain knowledge. This session will demonstrate and discuss such LLM solutions for vehicle engineering, connected vehicle analytics, manufacturing, legal, vehicle service and repair, customer support, and employee support. See Session INTELLECTUAL PROPERTY CHALLENGES IN THE AGE OF GENERATIVE AI Generative AI is challenging traditional concepts of intellectual property rights for content and emerging technology, including copyright, trade secrets, and patents. This fireside chat will offer U.S. perspectives on intellectual property in light of new technological advances. This content is produced by the USPTO. See Session DRIVING ENTERPRISE TRANSFORMATION: CIO INSIGHTS ON HARNESSING GENERATIVE AI'S POTENTIAL Generative AI is leading a transformative era for enterprises, with vast potential to enhance employee experiences, improve productivity, strengthen security, and drive operational efficiencies. Our esteemed panel of chief information officers will explore how they harness Generative AI in their organizations. AI's promises are accompanied by organizational and technical challenges. CIOs grapple with structuring AI and transformation programs, acquiring essential skills, and establishing guardrails for data governance, security, hallucinations, and toxicity in Generative AI deployments. They also consider build-versus-buy options and analyze cost-benefit and total-cost-of-ownership dynamics of Generative AI solutions. Moderated by NVIDIA's vice president of enterprise AI and automation, this session provides practical insights and best practices from CIOs at the forefront of enterprise transformation through Generative AI. Join us to explore the future of AI in the enterprise. See Session REGULATING AI: GLOBAL PERSPECTIVES Governments around the world are grappling with how to regulate the development, deployment, and use of AI. This panel explores different policy and regulatory approaches being considered and how companies should interpret this rapidly shifting regulatory landscape. See Session LESSONS ON VIDEO GENERATION MODELS FROM RESEARCH TO PRODUCTION We'll describe the journey of bringing Runway's Gen-1 and Gen-2 video generation models to production, starting from the research efforts to develop and train those models and going all the way to deploying them to Runway's suite of creative tools, used by millions. See Session MACHINE LEARNING HAS TAKEN WEATHER FORECASTING BY STORM. HOW ABOUT CLIMATE MODELING? For 50 years, weather forecast and climate models have been written in Fortran using numerical analysis, physical knowledge, and expert judgment. They have slowly become more skillful with the help of better observations and finer grids. In the past two years, data-driven ML has surpassed the skill of the best forecast models. What can the new kid in town learn to do next? Climate change is a defining issue of the 21st century. Can ML help us model climate, even though the future will not be like the past? Can it help us plan for coming new extremes of heat, flood, drought, and rising sea levels? Indeed, ML may soon become a backbone of climate modeling, saving time and money and making reliable, customized, local climate information much more broadly accessible. You’ll see promising early steps toward that vision. See Session ON-DEMAND VIDEOS Latest Technical Overviews and Tutorials(11 sessions) See All 49:36 The Fast Path to Developing with LLMs David Taubenheim, NVIDIA Bringing Zero-Code Change Acceleration to … 01:26:46 Unlocking AI Model Performance: Exploring … Dmitry Mironov, NVIDIA 57:19 Tailoring LLMs to Your Use Case Christopher Pang, NVIDIA Part 1 Overview and installation of Video … 11:50 Fullbody ARKit Workflow with Audio2Face and … Edy Susanto, NVIDIA 31:33 Exploring Efficient Tools for Autonomous Vehicle … Navyaa Sanan , NVIDIA 57:36 Vector Search: Exploring Applications, Techniques,… 01:13:07 Power Networks for AI Clouds Jeff Tantsura, NVIDIA 53:44 Reinventing the Complete Cybersecurity Stack … Bartley Richardson, NVIDIA 31:28 Taming LLMs with the Latest Customization … Adi Renduchintala, NVIDIA ACCESS THE LATEST NVIDIA DEVELOPER TOOLS, TECHNOLOGY, AND TRAINING. Learn More FIND YOUR SDKS OR SOLUTIONS BROWSE BY SOLUTION AREAS Artificial Intelligence & Deep Learning Autonomous Machines Graphics & Simulation HPC Networking View All Solutions BROWSE BY INDUSTRY Healthcare Robotics Game Development Financial Services Telecommunication View All Industries POPULAR SDKS * Aerial SDK * CUDA Toolkit/SDK * cuDNN * DALI * DeepStream * DLSS * DOCA SDK * HPC SDK * Isaac SDK * Jetpack * MDL SDK * NCCL * Optical Flow SDK * OptiX SDK * RAPIDS * TensorRT * Texture Tools Exporter * TAO Toolkit * Video Codec SDK View All SDKs Sign up for NVIDIA News Subscribe Follow NVIDIA Developer Find more news and tutorials on NVIDIA Technical Blog * Privacy Policy * Manage My Privacy * Do Not Sell or Share My Data * Terms of Use * Cookie Policy * Contact Copyright © 2024 NVIDIA Corporation NVIDIA uses cookies to enable and improve the use of the website. Please see our Cookie Policy for more information. NVIDIA uses cookies to enable and improve the use of the website. GPC signal detected and only ‘Required’ cookies have been enabled. To update your communication preferences please visit the Preference Center. Please see our Cookie Policy for more information. Reject Cookies Accept Cookies Manage Cookies Cookie Settings NVIDIA websites use cookies to deliver and improve the visitor experience. Learn more about the cookies we use on our Cookie Policy page. Required Cookies These cookies are required for our sites to function and cannot be turned off. Performance Cookies Performance Cookies These cookies provide information to help us improve your web experience by monitoring the performance of our website and collecting anonymous data on how you use it. Advertising Cookies Advertising Cookies Set by our advertising partners, these cookies are used to build a profile of your interests and show you relevant ads on other sites. They do not store personal information, but are based on uniquely identifying your browser and internet device. * PERSONALIZATION COOKIES Switch Label label These cookies are used to better understand and optimize your web experience, such as pages visited or purchases made through our e-store. These cookies and the information they collect may be managed by other companies, and the information collected by these cookies may be used to build a profile of your interests and show you relevant advertising on other sites. They do not store direct personally identifiable information, but are based on uniquely identifying your browser and internet device. Cookie Details Back Button Cookie List Search Icon Filter Icon Clear checkbox label label Apply Cancel Consent Leg.Interest checkbox label label checkbox label label checkbox label label Decline All Save and Accept The Fast Path to Developing with LLMs Bringing Zero-Code Change Acceleration to … Unlocking AI Model Performance: Exploring … Tailoring LLMs to Your Use Case Part 1 Overview and installation of Video … Fullbody ARKit Workflow with Audio2Face and … Exploring Efficient Tools for Autonomous Vehicle … Vector Search: Exploring Applications, Techniques,… Power Networks for AI Clouds Reinventing the Complete Cybersecurity Stack … Taming LLMs with the Latest Customization …