You are browsing a tutorial guide for the latest Octoparse version. If you are running an older version of Octoparse, we strongly recommend you upgrade because it is faster, easier, and more robust! Download and upgrade here if you haven't already done so!
Google Play is a big database with tons of Application information. In this tutorial, we are going to scrape the basic information of applications from Google Play.
You could visit our easy-to-use "Task Template" on the home screen of the Octoparse. All you need to do is type in several parameters, and the task is ready to go. For further details, please check it out here: Task Templates
To follow through, you may want to use this URL in the tutorial:
We will scrape data such as detail page URL, application name, author name and ratings with Octoparse.
The main steps are shown in the menu on the right, and you can download the sample task file here.
1. Go To Web Page - to open the target web page
Enter the page URL on the home screen and click Start
2. Auto-detect the web page data - create the workflow
Click Auto-detect the web page data
Wait for the detection to complete
Untick the Add a page scroll and click Create workflow on the Tips
Check the data fields in Data Preview section and you can also delete the unwanted fields or rename fields by double-clicking on the header if needed
3. Start Extraction - to run your task and get data
The final workflow should be like this:
Click Save
Click Run on the upper right side
Select Run on your device to run the task on your computer, or select Run in the Cloud to run the task in the Cloud (for premium users only)
Here is the sample output.