Packt
Train Large Language Models Faster - Parallelism Deep Dive

Gain next-level skills with Coursera Plus for $199 (regularly $399). Save now.

Packt

Train Large Language Models Faster - Parallelism Deep Dive

Gain insight into a topic and learn the fundamentals.
Intermediate level

Recommended experience

1 week to complete
at 10 hours a week
Flexible schedule
Learn at your own pace
Gain insight into a topic and learn the fundamentals.
Intermediate level

Recommended experience

1 week to complete
at 10 hours a week
Flexible schedule
Learn at your own pace

What you'll learn

  • Learn to apply parallelism strategies to accelerate LLM training.

  • Understand the differences and use cases of data, model, and hybrid parallelism.

  • Gain hands-on experience with PyTorch and DeepSpeed for LLM training optimization.

  • Master fault tolerance and checkpointing strategies to ensure training reliability.

Skills you'll gain

Details to know

Shareable certificate

Add to your LinkedIn profile

Recently updated!

January 2026

Assessments

16 assignments

Taught in English

See how employees at top companies are mastering in-demand skills

 logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

There are 16 modules in this course

In this module, we will introduce the course, explain the key objectives, and provide a roadmap of how parallelism techniques will accelerate large language model training. You will gain an overview of what to expect and get familiar with the course structure.

What's included

3 videos1 reading

In this module, we will explore the different parallelism strategies for LLM training, including single GPU vs. parallel strategies. You'll understand how parallelism improves efficiency and learn its key advantages in real-world applications.

What's included

4 videos1 assignment

In this module, we will establish a foundational understanding of IT concepts crucial for training LLMs. Topics like cloud computing, storage solutions, and computer architecture will provide the context for optimizing LLM workflows.

What's included

10 videos1 assignment

In this module, we will explore GPU architecture and its role in LLM training. You'll learn how GPUs are designed to handle the massive computations required by large models, ensuring faster and more efficient training.

What's included

2 videos1 assignment

In this module, we will cover the fundamentals of machine learning and deep learning. We’ll explore neural networks, training processes, and key differences between ML and DL to lay the groundwork for LLM training.

What's included

11 videos1 assignment

In this module, we will dive into the fundamentals of LLMs, starting with the Transformer architecture. You'll learn about key components such as self-attention and how the Transformer library powers modern AI applications.

What's included

5 videos1 assignment

In this module, we will introduce parallel computing concepts and their relevance to LLM training. You’ll gain a deeper understanding of how parallelism reduces bottlenecks and accelerates model development.

What's included

2 videos1 assignment

In this module, we will explore data, model, and hybrid parallelism in detail. You’ll learn how each strategy optimizes training workflows and where to apply them for maximum efficiency in LLM training.

What's included

11 videos1 assignment

In this module, we will delve into pipeline and tensor parallelism, explaining their key concepts and how they work together to enhance training efficiency. You’ll also explore real-world strategies for implementing these techniques.

What's included

11 videos1 assignment

In this module, we will dive deep into tensor parallelism, focusing on partitioning strategies, communication patterns, and device synchronization. You'll gain a clear understanding of how this technique accelerates LLM training.

What's included

8 videos1 assignment

In this module, we will shift to hands-on learning, applying data parallelism techniques in PyTorch. You'll train a small model on the MNIST dataset, testing different parallelism strategies and observing their effects on performance.

What's included

11 videos1 assignment

In this module, we will apply data parallelism to the WikiText-2 dataset and use DeepSpeed to optimize memory usage. You'll gain hands-on experience with advanced techniques to improve LLM training efficiency.

What's included

3 videos1 assignment

In this module, we will guide you through setting up Runpod.io for multi-GPU parallelism. You’ll gain practical experience running parallelism experiments on a distributed environment and working with large-scale models.

What's included

5 videos1 assignment

In this module, we will dive into fault tolerance and checkpointing strategies. You'll learn how to ensure scalable, resilient LLM training workflows that can recover from failures and continue without interruptions.

What's included

10 videos1 assignment

In this module, we will explore cutting-edge advancements in parallel computing and LLM training. You'll gain insight into the latest trends and technologies that are revolutionizing AI and the future of machine learning.

What's included

1 video1 assignment

In this module, we will wrap up the course by summarizing everything you've learned about parallelism and LLM training. You'll also receive guidance on how to proceed with your AI journey and apply these skills in future projects.

What's included

1 video2 assignments

Instructor

Packt - Course Instructors
Packt
1,299 Courses334,545 learners

Offered by

Packt

Explore more from Cloud Computing

Why people choose Coursera for their career

Felipe M.
Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."
Coursera Plus

Open new doors with Coursera Plus

Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription

Advance your career with an online degree

Earn a degree from world-class universities - 100% online

Join over 3,400 global companies that choose Coursera for Business

Upskill your employees to excel in the digital economy

Frequently asked questions