Check out this blog post (even though it's Scala-specific) for an overview of Pipelines: https://databricks.com/blog/2015/01/07/ml-pipelines-a-new-high-level-api-for-mllib.html