This recipe guides you through the process of turning an existing Kettle transformation into a data service for the Thin Kettle JDBC Driver. A data service is a configuration that allows the user to query a transformation as if it were a table in a database.
To get ready for this recipe, you first need to start Spoon and the MongoDB server with the same database from the previous chapter.
We are assuming that you have MongoDB with the data generated in the previous chapters and Spoon open from the Pentaho EE version. Perform the following steps to create a data service:
chapter1-using-mongodb-aggregation-framework.ktr
file and save it as chapter2-using-mongodb-aggregation-framework-kettle-thin.ktr
. Change the transformation name to MongoDB Aggregation Kettle Thin
.As we explained in the previous chapter, this transformation will query data from a MongoDB instance using the MongoDB Aggregation Framework.
However, in this recipe, we configured the output of this transformation to serve as a Kettle Data Service. The configuration for this Kettle Data Service will be saved in the <user home folder>/.pentaho/metastore/pentaho/Kettle Data Service/
folder with the name as AggregationTable.xml
in Unix/Linux operating systems and C:Users<user home folder>.pentahometastorepentahoKettle Data ServiceAggregationTable.xml
. This XML contains the metadata that describes the data service.
In the next recipes, we will guide you through running Carte and Pentaho Data Integration Server (DI Server) in a single instance/server. You can run these platforms in a cluster, but that isn't the goal of this book. With Carte or DI Server up and running, you will be able to list all details of all Data Services from these documents.