AI inference is the process of using a trained machine learning model to make predictions on new, unseen data by applying learned patterns. This course is designed for developers, data scientists, and ML engineers interested in quickly deploying AI inference services on Cloud Run. It is useful for those familiar with cloud-based serverless application deployment solutions, but who may not have experience with running AI inference using Google Cloud serverless products.

Déployer et adapter des modèles d'IA avec Cloud Run
7 days left! Grow your skills with Coursera Plus for $239/year (usually $399). Save now.

What you'll learn
Utiliser les GPU Cloud Run pour l'inférence de l'IA
Déployer des modèles de langage légers sur Cloud Run pour l'inférence de l'IA
Optimiser les déploiements de modèles sur Cloud Run pour améliorer les performances et la rentabilité
Intégrer les services d'inférence de l'IA Cloud Run avec des services de bases de données sur Google Cloud
Skills you'll gain
Tools you'll learn
Details to know

Add to your LinkedIn profile
February 2026
2 assignments
See how employees at top companies are mastering in-demand skills

There are 3 modules in this course
Instructor

Offered by
Explore more from Cloud Computing

Google Cloud

Google Cloud
Why people choose Coursera for their career

Felipe M.

Jennifer J.

Larry W.

Chaitanya A.

Open new doors with Coursera Plus
Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription
Advance your career with an online degree
Earn a degree from world-class universities - 100% online
Join over 3,400 global companies that choose Coursera for Business
Upskill your employees to excel in the digital economy



