Course Outline

Advanced Transformation Building Blocks

  • Working with complex data types
  • Managing fields, metadata, and dynamic structures
  • Reusable transformation patterns

Parameters, Variables, and Job-Oriented Design

  • Runtime variables and scoping
  • Parameterizing transformations
  • Parent-child job structures

Database Integration and Lookup Strategies

  • Advanced lookup steps
  • Caching strategies
  • Efficient join designs

Working with Files, APIs, and External Systems

  • Processing JSON and XML
  • Calling REST and SOAP services
  • Streaming and batch loads

Error Handling and Data Quality Techniques

  • Capturing and routing errors
  • Data validation patterns
  • Auditing and logging

Performance Tuning Essentials

  • Optimizing step design
  • Memory and threading considerations
  • Detecting bottlenecks

Introduction to Repository-Based Development

  • Using the Pentaho repository
  • Version management
  • Team collaboration practices

Deployment and Migration Practices

  • Promoting jobs between environments
  • Configuration management
  • Operational best practices

Summary and Next Steps

Requirements

  • An understanding of ETL fundamentals
  • Experience with Pentaho Data Integration
  • Basic knowledge of data warehousing concepts

Audience

  • ETL developers
  • Data engineers
  • Technical professionals expanding PDI skills
 21 Hours

Number of participants


Price per participant

Testimonials (2)

Provisional Upcoming Courses (Require 5+ participants)

Related Categories