In a blog by Yves i read the following comment "It(Talend) offers an "IN PLACE" data profiling which means data does not need to go through the time-consuming process of being extracted from Hadoop before being profiled. "
questions regarding the above comment:
1)So how is this different from other Data profiling Tools??
2)As per my understanding when we run an analysis on Talend , the query runs a MapReduce program on hive in the backend to get the output from Hadoop , so anyways the data is not "MOVED".So how is Talend providing an "IN PLACE" solution???
@Nayan, Which other data profiling tool do you think is able to profile data without extracting a subset from hadoop?
@vinothrajan, you seem to ask two unrelated questions. You should start a new discussion thread for each of your question. Otherwise, you will not get clear answers.
Thank you for your support,