Skip to main content

Scrape product reviews from Amazon

Learn how to use auto-detect scrape the review data

Updated this week

Product reviews are a good resource for improving your product performance. In this tutorial, we will show you how to scrape product reviews from Amazon.com.

For Amazon reviews scraping, you can use our ready-to-use Template available in our template gallery or follow this tutorial to build a custom task from scratch.

With Custom Task, you can only get at most 100 reviews due to the website limiation. If you want to get more reviews, please use our Amazon Reviews Scraper.


The main steps are listed in the menu on the right, and you can access the sample task here.


1. Create a Go to Web Page - to open the target web page

  • Paste the URL and click Start


2. Set up Pre-Login - to bypass the website limitation

Due to Amazon's restrictions policy, users will need to pre-login to their account to view all the reviews of the products. Therefore, you will need to pre-login to your Amazon account before scraping the reviews from the products.

There are two ways that you can set up the pre-login.

The first method will be Use cookie, which will help you reduce the steps to log-in to your account every time you are running the task.

  • Click on Auto Login

  • Click on Use cookie

  • Enter your account & password into the windows and log-in to your account

  • Click on Set up Auto Login on the bottom right to save your account cookies into the workflow

The second method will be entering your account number and password into the workflow every single time you run the task. You can find out more useful information from this tutorial here: Scrape data behind a login

Note: Saving cookies is important to avoid being blocked by Amazon. Without the cookies, the website will fail to get to the next page of reviews.


3. Create a Click Item - to see all reviews

  • Scroll down the page to find the See all reviews button

  • Click on it and choose Click URL


4. Auto-detect webpage - to create the workflow

  • Select Auto-detect web page data

mceclip4.png
  • Wait for the detection complete ->uncheck Add a page scroll -> Create workflow

mceclip6.png

5. Adjust AJAX timeout for Pagination

  • Click on Click to Paginate - adjust Timeout to 10s

  • Click Apply


6. Check the data and workflow

  • Go to Data Preview (Extract Date) to check the current data output. Double-click on the header to rename it, or click "..." to edit a data field

Here's what the final workflow looks like. Once everything is in place, you can continue to run the task

mceclip0.png

7. Run task to extract data

  • Click Run on the top right corner

  • Click Run on your device to run the task on your local device, or select Run in the Cloud to run the task in the Cloud (for premium users only)

Here is the sample output:

Did this answer your question?