Product reviews are a good resource for improving your product performance. In this tutorial, we will show you how to scrape product reviews from Amazon.com.
For Amazon reviews scraping, you can use our ready-to-use Template available in our template gallery or follow this tutorial to build a custom task from scratch.
With Custom Task, you can only get at most 100 reviews due to the website limiation. If you want to get more reviews, please use our Amazon Reviews Scraper.
The main steps are listed in the menu on the right, and you can access the sample task here.
1. Create a Go to Web Page - to open the target web page
Paste the URL and click Start
2. Set up Pre-Login - to bypass the website limitation
Due to Amazon's restrictions policy, users will need to pre-login to their account to view all the reviews of the products. Therefore, you will need to pre-login to your Amazon account before scraping the reviews from the products.
There are two ways that you can set up the pre-login.
The first method will be Use cookie, which will help you reduce the steps to log-in to your account every time you are running the task.
Click on Auto Login
Click on Use cookie
Enter your account & password into the windows and log-in to your account
Click on Set up Auto Login on the bottom right to save your account cookies into the workflow
The second method will be entering your account number and password into the workflow every single time you run the task. You can find out more useful information from this tutorial here: Scrape data behind a login
Note: Saving cookies is important to avoid being blocked by Amazon. Without the cookies, the website will fail to get to the next page of reviews.
3. Create a Click Item - to see all reviews
Scroll down the page to find the See all reviews button
Click on it and choose Click URL
4. Auto-detect webpage - to create the workflow
Select Auto-detect web page data
Wait for the detection complete ->uncheck Add a page scroll -> Create workflow
5. Adjust AJAX timeout for Pagination
Click on Click to Paginate - adjust Timeout to 10s
Click Apply
6. Check the data and workflow
Go to Data Preview (Extract Date) to check the current data output. Double-click on the header to rename it, or click "..." to edit a data field
Here's what the final workflow looks like. Once everything is in place, you can continue to run the task
7. Run task to extract data
Click Run on the top right corner
Click Run on your device to run the task on your local device, or select Run in the Cloud to run the task in the Cloud (for premium users only)
Here is the sample output: