Centizen moved the client’s POS data into Hadoop in the AWS cloud and then used Spark for predictive analytics. To ensure security and reduce risk, the bulk import was performed using AWS Direct Connect which provided private and secure connectivity increasing throughput and a more reliable connection. Tableau data visualization was implemented to generate reports on any number of business activities to help the retailer improve its bottom line.
The technologies used include:
AWS, Amazon Hadoop Echo System, EMR, S3 File system, Cloudera Hadoop Distribution, HDFS, HIVE, PIG, Python, S3 CLI, JSON parsing, Spark (Core & SQL) Modules, Airflow, Genie, Amazon Lambda, SQS, SNS and MSMQ.