Overview
Awesome Products Inc. is an e-commerce company selling a wide range of consumer products. Customers who purchase products from Awesome Products Inc. are provided with an option to add reviews and ratings for the products they have bought to help other customers find the right quality products. As the company has grown exponentially in the recent times, each product has a huge number of reviews which are uploaded to an Amazon S3 bucket.
The company management wants to understand how the products are doing in the market and if they are meeting the customer needs. However, it is not possible for them to analyze the reviews manually.
You have been recently hired as a Data Analytics Engineer to derive the customer’s sentiment from the product reviews which are available in the S3 bucket. The analyzed data needs to be stored into an Amazon Redshift table to help the company management to perform historical analysis and reporting.
Task Architecture
Task Architecture
Challenge Progress
Task Status Possible Points Clue Deduction Points Total Earned
Completed! 32 0 32
Completed! 16 0 16
Completed! 32 0 32
Total Points Earned: 80
Task 1: Analyze Sentiment for each review
Possible Points: 32 Clue Penalty: 0 Points Earned: 32
Enter answer here
Completed!
Background
Analytics team needs to consolidate the customer reviews and perform sentiment analysis. As you have been tasked to derive the sentiment insights from the product reviews you need to run an Amazon Comprehend analysis job which creates an output file with the analysis data.
Your task
A text file containing customer reviews for a product is available in S3 bucket name prefixed with s3productreview . Use this as an input file to create the output file having sentiment details for each product review using the Amazon Comprehend analysis job. The output file should be created in the same S3 bucket. Once the output file is created, the task is complete.
Inventory
Your AWS account is provisioned with the following:
S3 bucket prefixed with: s3productreview
Product review file within the S3 bucket
Minimum permissions required for you to complete this task
Getting Started
Navigate to the S3 console and identify the bucket and file required for this task. Copy the S3 URI of the file and move on to Amazon Comprehend Console to create a sentiment Analysis Job with the identified S3 file as input.
For Output data, specify the same S3 bucket value without any object name.
Services involved
Amazon S3
Amazon Comprehend
Task Validation
Once the Amazon Comprehend analysis job is completed successfully, enter the analysis job name in the field located at the the top of this page which says Enter answer here and click submit to get the credit.
Task 2: Get Ready to load the data to database with an IAM Role
Possible Points: 16 Clue Penalty: 0 Points Earned: 16
Completed!
Background
Congratulations on successfully creating the sentiment analysis job. The team is happy as we are one step closer to getting the results. The Amazon Comprehend job is now complete and the output is available in the same S3 bucket. The analyzed data needs to be loaded into an Amazon Reshift table to help the company management to perform historical analysis and reporting. Before loading the output data into the Redshift Cluster, the necessary permissions for the cluster to access the data in S3 bucket needs to be in place.
Your task
Create an IAM role which will allow the Redshift cluster to access S3 objects. Don’t forget to associate the role to the Redshift cluster paving way for the next task. Please ignore errors in the Amazon Redshift console.
Inventory
Your AWS account is provisioned with the following:
S3 bucket prefixed with: s3productreview
Redshift Cluster
Minimum permissions required for you to complete this task
Check the output properties tab on the left for the Redshift cluster identifier details.
Getting Started
Navigate to the IAM Console and create an IAM role for the Redshift Service with necessary privileges to access your S3 bucket.
Services involved
Amazon Redshift
AWS IAM
Task Validation
Once the IAM role is created with the necessary privileges and associated to the Redshift cluster, the task will be automatically validated for completion. You can alternatively click Check my Progress to check the status.
Task 3: Copy sentiment analysis output to redshift
Possible Points: 32 Clue Penalty: 0 Points Earned: 32
Completed!
Background
Great work on creating the necessary permissions as well! One more step closer to the solution! Now we have the Amazon Comprehend sentiment analysis output file as well as the necessary privileges to load the same in the Redshift table. All that is left is to copy the data into the Redshift table.
Your task
The output from the sentiment analysis job is unzipped and available as a json file in the output folder under the folder prefixed with -SENTIMENT-*** in the same S3 bucket. Use the COPY command to load the unzipped Amazon Comprehend json output file into the Redshift table feedback.product_feedback.
Inventory
Your AWS account is provisioned with the following:
Redshift cluster with table feedback.product_feedback
Unzipped output json file within the S3 bucket with prefix s3productreview (Navigate inside the s3 bucket folders until you reach the output/ folder)
Minimum permissions required for you to complete this task
Check the output properties tab on the left side to get the database name and username of the Redshift cluster.
Getting Started
Navigate to the Redshift console and use Query editor v1 (v1 is denoted as just “Query editor” )to connect to the database using Temporary credentials method.
Services involved
Amazon Redshift
Task Validation
Once the file is loaded successfully with the COPY command, the task will be automatically validated for completion. You can alternatively click Check my Progress to check the status.
There is no need to explain the first question. Just follow the steps. The second question needs to use redshift to create Iam. The third question has a bug. Just add a blank line to complete this question