LLMs and Agents in DevOps Workflows Training Course
Autonomous agent frameworks, such as AutoGen and CrewAI, in conjunction with Large Language Models (LLMs), are transforming the way DevOps teams automate operations—including change tracking, test creation, and alert triage—by emulating human-like cooperation and decision-making processes.
This instructor-led live training, available either online or onsite, is designed for advanced engineers who want to create and deploy automation workflows in DevOps that are driven by Large Language Models and multi-agent systems.
Upon completion of this training, participants will be capable of:
- Incorporating LLM-driven agents into CI/CD pipelines to enable intelligent automation.
- Leveraging agents to automate the generation of tests, analysis of commits, and summarization of changes.
- Orchestrating multiple agents to handle alert triage, formulate responses, and offer DevOps recommendations.
- Constructing secure and easily maintainable workflows powered by agents using open-source frameworks.
Course Format
- Interactive lectures accompanied by discussions.
- Extensive exercises and practical application.
- Practical implementation within a live laboratory environment.
Customization Options for the Course
- To arrange a tailored training session for this course, please reach out to us.
Course Outline
Introduction to Large Language Models and Agent Frameworks
- Overview of large language models applied in infrastructure automation.
- Fundamental concepts within multi-agent workflows.
- AutoGen, CrewAI, and LangChain: Use cases in DevOps.
Configuring LLM Agents for DevOps Tasks
- Installing AutoGen and setting up agent profiles.
- Utilizing the OpenAI API and other LLM service providers.
- Establishing workspaces and environments compatible with CI/CD.
Automating Test and Code Quality Processes
- Prompting Large Language Models to create unit and integration tests.
- Using agents to enforce linting standards, commit rules, and code review guidelines.
- Automated tagging and summarization of pull requests.
Utilizing LLM Agents for Alert Management and Change Detection
- Designing responder agents for pipeline failure notifications.
- Analyzing logs and traces using language models.
- Proactively identifying high-risk changes or misconfigurations.
Multi-Agent Coordination in DevOps Environments
- Role-based agent orchestration (including planner, executor, and reviewer roles).
- Managing agent messaging loops and memory systems.
- Implementing human-in-the-loop designs for critical systems.
Security, Governance, and Observability
- Managing data exposure and ensuring LLM safety within infrastructure.
- Auditing agent actions and restricting their operational scope.
- Monitoring pipeline behavior and collecting model feedback.
Real-World Use Cases and Custom Scenarios
- Designing agent workflows for incident response.
- Integrating agents with GitHub Actions, Slack, or Jira.
- Best practices for scaling LLM integration in DevOps environments.
Summary and Next Steps
Requirements
- Experience with DevOps tools and pipeline automation.
- Practical knowledge of Python and Git-based workflows.
- Familiarity with Large Language Models or exposure to prompt engineering.
Target Audience
- Innovation engineers and leads of AI-integrated platforms.
- Developers specializing in LLMs within DevOps or automation contexts.
- DevOps professionals investigating intelligent agent frameworks.
Open Training Courses require 5+ participants.
LLMs and Agents in DevOps Workflows Training Course - Booking
LLMs and Agents in DevOps Workflows Training Course - Enquiry
LLMs and Agents in DevOps Workflows - Consultancy Enquiry
Provisional Upcoming Courses (Require 5+ participants)
Related Courses
Agentic Development with Gemini 3 and Google Antigravity
21 HoursGoogle Antigravity serves as an agentic development environment, enabling the creation of autonomous agents that can plan, reason, code, and execute actions through the multimodal capabilities of Gemini 3.
This instructor-led live training, available online or onsite, is designed for advanced technical professionals who want to design, build, and deploy autonomous agents using Gemini 3 within the Antigravity environment.
Upon completing this training, participants will be equipped to:
- Create autonomous workflows that leverage Gemini 3 for reasoning, planning, and execution.
- Develop agents in Antigravity capable of analyzing tasks, writing code, and interacting with various tools.
- Integrate agents powered by Gemini into enterprise systems and APIs.
- Enhance agent behavior, safety, and reliability in complex operational environments.
Course Format
- Expert demonstrations paired with interactive discussions.
- Hands-on experimentation focused on autonomous agent development.
- Practical implementation utilizing Antigravity, Gemini 3, and supporting cloud tools.
Customization Options
- For teams requiring domain-specific agent behaviors or custom integrations, please contact us to tailor the program.
Advanced Antigravity: Feedback Loops, Learning & Long-Term Agent Memory
14 HoursGoogle Antigravity serves as a sophisticated framework for experimenting with persistent agents and emergent interactive behaviors.
This instructor-led live training (available online or onsite) is designed for advanced professionals aiming to design, analyze, and optimize agents that retain memories, improve via feedback, and evolve over extended operational periods.
After completing this course, participants will acquire the skills to:
- Construct memory architectures for agent persistence.
- Deploy effective feedback loops to influence agent behavior.
- Assess learning trajectories and monitor model drift.
- Integrate memory mechanisms into complex multi-agent environments.
Course Format
- Expert-led discussions combined with technical demonstrations.
- Practical exploration through structured design challenges.
- Application of concepts within simulated agent environments.
Course Customization Options
- For organizations requiring tailored content or specific case examples, please reach out to customize this training.
Advanced Mastra Integrations: APIs, Tools, Enterprise Data & External Systems
21 HoursMastra is a framework designed to facilitate deep integration between AI agents, APIs, enterprise applications, and external data systems.
This instructor-led live training (available online or onsite) targets intermediate-level engineers looking to build reliable, secure, and scalable integrations between Mastra agents and the wider enterprise ecosystem.
Upon completing this training, participants will be equipped to:
- Implement API-driven integrations connecting Mastra agents with external services.
- Link enterprise data systems and tools to automated agent workflows.
- Apply best practices for secure data exchange and authentication.
- Design integration layers that are scalable, maintainable, and ready for production use.
Course Format
- Interactive lectures and discussions.
- Hands-on engineering exercises involving integrations and APIs.
- Live laboratory implementation using real-world enterprise scenarios.
Course Customization Options
- Custom API scenarios, enterprise system mappings, or data-integration workshops can be arranged upon request.
AIOps in Action: Incident Prediction and Root Cause Automation
14 HoursAIOps (Artificial Intelligence for IT Operations) is increasingly being adopted to anticipate incidents before they happen and automate root cause analysis (RCA), thereby reducing downtime and speeding up resolution times.
This instructor-led live training, available online or on-site, targets advanced IT professionals looking to implement predictive analytics, automate remediation procedures, and design intelligent RCA workflows using AIOps tools and machine learning models.
Upon completion of this training, participants will be capable of:
- Building and training ML models to identify patterns associated with system failures.
- Automating RCA workflows by correlating data from multiple logs and metrics sources.
- Integrating alerting and remediation processes into existing platforms.
- Deploying and scaling intelligent AIOps pipelines within production environments.
Course Format
- Interactive lectures and discussions.
- Numerous exercises and practical activities.
- Hands-on implementation within a live-lab environment.
Customization Options for the Course
- For customized training requests, please contact us to arrange.
AIOps Fundamentals: Monitoring, Correlation, and Intelligent Alerting
14 HoursAIOps (Artificial Intelligence for IT Operations) is a methodology that leverages machine learning and analytics to automate and enhance IT operations, specifically focusing on monitoring, incident detection, and response.
This instructor-led live training (available online or onsite) targets intermediate-level IT operations professionals looking to apply AIOps techniques to correlate metrics and logs, minimize alert noise, and boost observability through intelligent automation.
Upon completion of this training, participants will be capable of:
- Grasping the core principles and architecture of AIOps platforms.
- Correlating data across logs, metrics, and traces to pinpoint root causes.
- Alleviating alert fatigue via intelligent filtering and noise suppression techniques.
- Utilizing open-source or commercial tools to automatically monitor and respond to incidents.
Course Format
- Interactive lectures and discussions.
- Extensive exercises and practical sessions.
- Hands-on implementation within a live-lab environment.
Course Customization Options
- For tailored training requests, please contact us to arrange details.
Building an AIOps Pipeline with Open Source Tools
14 HoursDeveloping an AIOps pipeline exclusively with open-source tools enables teams to create cost-efficient and adaptable solutions for observability, anomaly detection, and intelligent alerting within production environments.
This instructor-led live training, available both online and onsite, targets advanced-level engineers looking to design and implement a complete AIOps pipeline utilizing tools such as Prometheus, ELK, Grafana, and custom machine learning models.
Upon completing this training, participants will be equipped to:
- Architect an AIOps infrastructure relying solely on open-source components.
- Gather and standardize data from logs, metrics, and traces.
- Implement ML models to identify anomalies and forecast incidents.
- Automate alerting and remediation processes using open-source tooling.
Course Format
- Interactive lectures and discussions.
- Extensive exercises and practical application.
- Hands-on implementation within a live lab setting.
Course Customization Options
- To request tailored training for this course, please get in touch to make arrangements.
Antigravity for Developers: Building Agent-First Applications
21 HoursAntigravity serves as a development platform specifically designed to create AI-driven, agent-first applications.
This instructor-led live training, available both online and onsite, targets intermediate developers aiming to build real-world applications leveraging autonomous AI agents within the Antigravity ecosystem.
Upon completing this training, participants will be able to:
- Develop applications that depend on coordinated and autonomous AI agents.
- Utilize the Antigravity IDE, including its editor, terminal, and browser features, for end-to-end development workflows.
- Manage multi-agent workflows effectively using the Agent Manager.
- Integrate agent capabilities into robust, production-grade software systems.
Course Format
- A blend of presentations and in-depth practical demonstrations.
- Extensive hands-on practice accompanied by guided exercises.
- Real-world implementation tasks conducted within the live Antigravity environment.
Course Customization Options
- For content tailored to align with your specific development stack, please contact us to arrange a customized training session.
Getting Started with Antigravity: An Introduction to Agent-First IDEs
14 HoursGoogle Antigravity is an agent-first development environment designed to streamline engineering workflows through intelligent automation.
This instructor-led, live training (online or onsite) is aimed at beginner-level practitioners who wish to explore the fundamentals of Antigravity and understand how agent-driven coding environments enhance productivity.
Upon completion of this training, participants will be able to:
- Install and configure Google Antigravity.
- Navigate and understand both the Editor View and Manager View.
- Work effectively with agents to automate simple development tasks.
- Use Antigravity to generate, refine, and manage project files.
Format of the Course
- Instructor explanations supported by real-time demonstrations.
- Guided exercises focused on hands-on use of agents.
- Practical exploration of core Antigravity features in a controlled lab environment.
Course Customization Options
- If you require a tailored version of this training, please contact us to arrange a customized program.
Antigravity for Web Automation & Browser-Based Tasks
21 HoursGoogle Antigravity serves as a platform for developing agents that interact with web applications, browser environments, and multi-surface workflows.
This instructor-led live training, available online or onsite, is designed for intermediate-level professionals looking to build, automate, and test browser-based workflows using Google Antigravity.
After completing the training, participants will be able to:
- Create agents that interact with web applications via a browser interface.
- Automate end-to-end workflows across various browser contexts.
- Validate and troubleshoot agent behavior in user interface-driven environments.
- Implement cross-surface automation strategies using Antigravity.
Format of the Course
- Guided instruction supported by demonstrations.
- Practical, hands-on activities and scenario-based exercises.
- Implementation of agent workflows in an interactive lab environment.
Course Customization Options
- For customized training requirements, please contact us to tailor the course to your objectives.
Enterprise AIOps with Splunk, Moogsoft, and Dynatrace
14 HoursEnterprise AIOps platforms such as Splunk, Moogsoft, and Dynatrace offer robust capabilities for identifying anomalies, correlating alerts, and automating responses across large-scale IT environments.
This instructor-led live training (available online or onsite) is designed for intermediate-level enterprise IT teams looking to integrate AIOps tools into their current observability stack and operational workflows.
Upon completing this training, participants will be able to:
- Configure and integrate Splunk, Moogsoft, and Dynatrace into a unified AIOps architecture.
- Correlate metrics, logs, and events across distributed systems using AI-driven analysis.
- Automate incident detection, prioritization, and response through built-in and custom workflows.
- Optimize performance, reduce MTTR, and enhance operational efficiency at an enterprise scale.
Course Format
- Interactive lectures and discussions.
- Extensive exercises and practice sessions.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request customized training for this course, please contact us to make arrangements.
Implementing AIOps with Prometheus, Grafana, and ML
14 HoursPrometheus and Grafana are widely adopted tools for observability in modern infrastructure, while machine learning enhances these tools with predictive and intelligent insights to automate operations decisions.
This instructor-led, live training (online or onsite) is aimed at intermediate-level observability professionals who wish to modernize their monitoring infrastructure by integrating AIOps practices using Prometheus, Grafana, and ML techniques.
By the end of this training, participants will be able to:
- Configure Prometheus and Grafana for observability across systems and services.
- Collect, store, and visualize high-quality time series data.
- Apply machine learning models for anomaly detection and forecasting.
- Build intelligent alerting rules based on predictive insights.
Format of the Course
- Interactive lecture and discussion.
- Lots of exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
AI Agent Development with Mastra
14 HoursThis guided live training (available online or in-person) targets intermediate software developers and engineering teams aiming to construct scalable, observable AI systems using Mastra.
Upon completion of this training, participants will be able to:
- Grasp Mastra’s architecture and its integration with Large Language Models (LLMs) and external APIs.
- Design and build AI agents and workflows using TypeScript.
- Leverage Mastra’s observability and memory tools to track and enhance agent performance.
- Deploy production-grade AI applications utilizing Mastra’s framework capabilities.
Mastra Debugging, Evaluation & Quality Assurance for AI Agents
21 HoursMastra is a framework that offers structured tools to evaluate, debug, and ensure the reliability of AI agents operating within complex workflows.
This instructor-led, live training (available online or onsite) targets intermediate-level practitioners who want to rigorously test agent behavior, enhance reliability, and implement measurable evaluation processes.
By the end of this training, participants will be able to confidently:
- Apply debugging techniques to identify and resolve agent behavior issues.
- Evaluate agents using structured metrics, benchmarks, and quality scores.
- Implement tooling and workflows that track reliability, drift, and hallucinations.
- Design QA strategies that ensure consistent and predictable agent performance.
Course Format
- Interactive lectures and discussions.
- Hands-on debugging and evaluation exercises.
- Live-lab analysis of agent behaviors using observability tools.
Course Customization Options
- Customized reliability testing scenarios and industry-specific QA methods can be arranged upon request.
Managing Agent Workflows in Google Antigravity: Orchestration, Planning and Artifacts
14 HoursGoogle Antigravity serves as a platform centered on agents, designed to orchestrate, oversee, and coordinate workflows for AI-driven coding and automation.
This instructor-led training session, available either online or at your facility, targets intermediate-level professionals seeking to design, manage, and optimize multi-agent workflows within the Google Antigravity ecosystem.
Upon completing this training, participants will be able to:
- Set up agent responsibilities and orchestration pipelines through the Manager interface.
- Create and interpret Antigravity artifacts such as task lists, plans, logs, and browser recordings.
- Establish verification strategies to maintain transparency and auditability in agent actions.
- Enhance multi-agent collaboration for intricate development and operational assignments.
Course Format
- Guided presentations combined with practical demonstrations.
- Scenario-based exercises targeting real-world workflow challenges.
- Hands-on exploration within a live Antigravity workspace.
Customization Options
- For those needing a customized version of this course, please reach out to discuss specific customization possibilities.
Testing & Verifying Agent-Driven Code: Quality Assurance in Antigravity
14 HoursAntigravity is a framework designed to represent sophisticated, agent-driven development workflows.
This instructor-led live training (available online or onsite) targets intermediate to advanced professionals seeking to verify, validate, and secure the outputs produced by AI agents operating within Antigravity environments.
After completing this course, participants will be capable of:
- Evaluating the accuracy and safety of code artifacts generated by agents.
- Employing structured methods to verify tasks executed by agents.
- Effectively analyzing browser recordings and tracing agent activities.
- Applying QA and security principles to guarantee the reliability of agent workflows.
Course Format
- Technical briefings and discussions guided by an instructor.
- Practical exercises focused on verifying real-world agent workflows.
- Hands-on testing and validation within a controlled lab environment.
Customization Options
- Scenarios, workflows, and testing examples can be adapted upon request.