So, following on from the first post in this series, here’s all the technical gubbins.
Firstly, how do you build PDI as an engine? Well simple – you need to create a pom.xml and use maven.
The key parts of that file are:
- Adding the Pentaho repository
- Defining pentaho.kettle.version
- Adding the core lambda java libraries
- Figuring out that the vfs library version needs to be this weird thing: 20050307052300
- And then the key point – using the “Maven shade” plugin, which basically gathers up the whole lot and dumps it into a jar suitable for uploading directly to AWS.
What next? Well topics for next few weeks include:
- The java code wrapper to launch PDI
- Persistence (S3 / redshift)