AWS EMR Node configurations

1xMaster r6g.xlarge 200GB

1xCorenode  r6.4xlarge 1000GB

9xTaskNodes  r6.4xlarge 1000GB


1B CSV Load time 255Seconds

1B Parquet Load time  60Seconds

DataCompare Time is 51 MInutes