It depends more on your chosen platform. Are you using Hadoop + HDFS + Map Reduce etc.... Pig might be a good option. If are using Spark, either standalone or with Hadoop, then clearly Spark is the right option. So rather than it being a comparison of the Talend functionality, it's more about your chosen environment.