• CAN YOU UNDERSTAND YOUR CUSTOMER SENTIMENT?


    Overview
    Awesome Products Inc. is an e-commerce company selling a wide range of consumer products. Customers who purchase products from Awesome Products Inc. are provided with an option to add reviews and ratings for the products they have bought to help other customers find the right quality products. As the company has grown exponentially in the recent times, each product has a huge number of reviews which are uploaded to an Amazon S3 bucket.

    The company management wants to understand how the products are doing in the market and if they are meeting the customer needs. However, it is not possible for them to analyze the reviews manually.

    You have been recently hired as a Data Analytics Engineer to derive the customer’s sentiment from the product reviews which are available in the S3 bucket. The analyzed data needs to be stored into an Amazon Redshift table to help the company management to perform historical analysis and reporting.

    Task Architecture
    Task Architecture

    Challenge Progress
    Task Status Possible Points Clue Deduction Points Total Earned
    Completed! 32 0 32
    Completed! 16 0 16
    Completed! 32 0 32
    Total Points Earned: 80

    Task 1: Analyze Sentiment for each review
    Possible Points: 32 Clue Penalty: 0 Points Earned: 32
    Enter answer here
    Completed!
    Background
    Analytics team needs to consolidate the customer reviews and perform sentiment analysis. As you have been tasked to derive the sentiment insights from the product reviews you need to run an Amazon Comprehend analysis job which creates an output file with the analysis data.

    Your task
    A text file containing customer reviews for a product is available in S3 bucket name prefixed with s3productreview . Use this as an input file to create the output file having sentiment details for each product review using the Amazon Comprehend analysis job. The output file should be created in the same S3 bucket. Once the output file is created, the task is complete.

    Inventory
    Your AWS account is provisioned with the following:
    S3 bucket prefixed with: s3productreview
    Product review file within the S3 bucket
    Minimum permissions required for you to complete this task
    Getting Started
    Navigate to the S3 console and identify the bucket and file required for this task. Copy the S3 URI of the file and move on to Amazon Comprehend Console to create a sentiment Analysis Job with the identified S3 file as input.

    For Output data, specify the same S3 bucket value without any object name.

    Services involved
    Amazon S3
    Amazon Comprehend
    Task Validation
    Once the Amazon Comprehend analysis job is completed successfully, enter the analysis job name in the field located at the the top of this page which says Enter answer here and click submit to get the credit.

    Task 2: Get Ready to load the data to database with an IAM Role
    Possible Points: 16 Clue Penalty: 0 Points Earned: 16
    Completed!
    Background
    Congratulations on successfully creating the sentiment analysis job. The team is happy as we are one step closer to getting the results. The Amazon Comprehend job is now complete and the output is available in the same S3 bucket. The analyzed data needs to be loaded into an Amazon Reshift table to help the company management to perform historical analysis and reporting. Before loading the output data into the Redshift Cluster, the necessary permissions for the cluster to access the data in S3 bucket needs to be in place.

    Your task
    Create an IAM role which will allow the Redshift cluster to access S3 objects. Don’t forget to associate the role to the Redshift cluster paving way for the next task. Please ignore errors in the Amazon Redshift console.

    Inventory
    Your AWS account is provisioned with the following:
    S3 bucket prefixed with: s3productreview
    Redshift Cluster
    Minimum permissions required for you to complete this task
    Check the output properties tab on the left for the Redshift cluster identifier details.

    Getting Started
    Navigate to the IAM Console and create an IAM role for the Redshift Service with necessary privileges to access your S3 bucket.

    Services involved
    Amazon Redshift
    AWS IAM
    Task Validation
    Once the IAM role is created with the necessary privileges and associated to the Redshift cluster, the task will be automatically validated for completion. You can alternatively click Check my Progress to check the status.

    Task 3: Copy sentiment analysis output to redshift
    Possible Points: 32 Clue Penalty: 0 Points Earned: 32
    Completed!
    Background
    Great work on creating the necessary permissions as well! One more step closer to the solution! Now we have the Amazon Comprehend sentiment analysis output file as well as the necessary privileges to load the same in the Redshift table. All that is left is to copy the data into the Redshift table.

    Your task
    The output from the sentiment analysis job is unzipped and available as a json file in the output folder under the folder prefixed with -SENTIMENT-*** in the same S3 bucket. Use the COPY command to load the unzipped Amazon Comprehend json output file into the Redshift table feedback.product_feedback.

    Inventory
    Your AWS account is provisioned with the following:
    Redshift cluster with table feedback.product_feedback
    Unzipped output json file within the S3 bucket with prefix s3productreview (Navigate inside the s3 bucket folders until you reach the output/ folder)
    Minimum permissions required for you to complete this task
    Check the output properties tab on the left side to get the database name and username of the Redshift cluster.

    Getting Started
    Navigate to the Redshift console and use Query editor v1 (v1 is denoted as just “Query editor” )to connect to the database using Temporary credentials method.

    Services involved
    Amazon Redshift
    Task Validation
    Once the file is loaded successfully with the COPY command, the task will be automatically validated for completion. You can alternatively click Check my Progress to check the status.

    There is no need to explain the first question. Just follow the steps. The second question needs to use redshift to create Iam. The third question has a bug. Just add a blank line to complete this question

  • 相关阅读:
    【Rust】快速教程——从hola,mundo到所有权
    dockerfile来构建自己的docker镜像
    [题]宝物筛选 #单调队列优化
    定向模糊测试aflgo中的能量调度
    Servlet学习之Filter
    [HFCTF2020]EasyLogin-1|JWT身份伪造
    【CPP】数据结构
    性能测试:测试常见的指标(超详细~)
    Spring框架系列(1) - Spring和Spring框架组成
    HashMap、HashTable和ConcurrentHashMap之间的区别
  • 原文地址:https://blog.csdn.net/Tzwf01/article/details/126373382