ROCm for Windows Training Course

ROCm is an open-source platform designed for GPU programming that supports AMD graphics cards, while also maintaining compatibility with CUDA and OpenCL. This platform grants developers direct access to hardware specifics, offering complete control over the parallelization process. However, this level of control demands a solid grasp of device architecture, memory models, execution frameworks, and optimization strategies.

The recent introduction of ROCm for Windows enables users to install and utilize ROCm on the Windows operating system, which remains a dominant choice for both personal and professional use. This adaptation allows users to harness the computational power of AMD GPUs for diverse applications, including artificial intelligence, gaming, graphics processing, and scientific computing.

This instructor-led live training session, available either online or in-person, is tailored for beginner to intermediate-level developers who aim to install and utilize ROCm on Windows to program AMD GPUs and fully exploit their parallel capabilities.

Upon completing this training, participants will be equipped to:

Establish a development environment comprising the ROCm Platform, an AMD GPU, and Visual Studio Code on Windows.
Develop a fundamental ROCm application that executes vector addition on the GPU and retrieves results from GPU memory.
Utilize the ROCm API to query device details, manage device memory allocation and deallocation, transfer data between host and device, launch kernels, and synchronize threads.
Employ the HIP language to write kernels that run on the GPU and manipulate data.
Leverage HIP built-in functions, variables, and libraries to carry out common tasks and operations.
Optimize data transfers and memory access by utilizing ROCm and HIP memory spaces, including global, shared, constant, and local regions.
Control threads, blocks, and grids that define parallelism through ROCm and HIP execution models.
Debug and test ROCm and HIP applications using tools like ROCm Debugger and ROCm Profiler.
Enhance ROCm and HIP programs using optimization techniques such as coalescing, caching, prefetching, and profiling.

Course Format

Interactive lectures and discussions.
Extensive exercises and practical sessions.
Hands-on implementation within a live-lab environment.

Course Customization Options

To request customized training for this course, please contact us to make arrangements.

21 hours

UPGen Le Meridien

304,860,354 VND (Online)

304,860,354 VND (Classroom)

ROCm for Windows Training Course

Course Outline

Requirements

Provisional Upcoming Courses (Require 5+ participants)

ROCm for Windows

ROCm for Windows

ROCm for Windows

ROCm for Windows

ROCm for Windows

Related Categories

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites

ROCm for Windows Training Course

Course Outline

Requirements

Provisional Upcoming Courses (Require 5+ participants)

ROCm for Windows

ROCm for Windows

ROCm for Windows

ROCm for Windows

ROCm for Windows

Related Courses

Developing AI Applications with Huawei Ascend and CANN

Deploying AI Models with CANN and Ascend AI Processors

AI Inference and Deployment with CloudMatrix

GPU Programming on Biren AI Accelerators

Cambricon MLU Development with BANGPy and Neuware

Introduction to CANN for AI Framework Developers

CANN for Edge AI Deployment

Understanding Huawei’s AI Compute Stack: From CANN to MindSpore

Optimizing Neural Network Performance with CANN SDK

CANN SDK for Computer Vision and NLP Pipelines

Building Custom AI Operators with CANN TIK and TVM

Migrating CUDA Applications to Chinese GPU Architectures

Performance Optimization on Ascend, Biren, and Cambricon

Related Categories

GPU

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites