You are browsing a tutorial guide for Octoparse's latest version. If you are running an older version of Octoparse, we strongly recommend you upgrade because it is faster, easier and more robust! Download and upgrade here if you haven't already done so!
WhoScored is a popular football website that provides live scores, match results and player ratings from the top football leagues and competitions.
In this tutorial, we are going to show you how to scrape player info from WhoScored with the help of Octoparse.
To follow through, you may want to use this URL in the tutorial:
The main steps are shown in the menu on the right, and you can download the sample task file here.
1. Create a Go to Web Page - to open the target website
Enter the target URL into the search box at the center of the home screen
Click Start to create a new task in Advanced Mode
2. Auto-detect web page data - to generate a workflow
Octoparse's auto-detect function can help to automatically generate a workflow quickly. Further modifications can be made based on this.
Click on Auto-detect web page data and wait for the detection to complete
Untick Add a page scroll
Click Create workflow
The workflow would then be generated as below:
Click More and delete field to delete the unwanted data
Double-click to edit the header
3. Run the task - to get the desired data
Click the Save button first to save all the settings you have made
Then click Run (You can either Run on your device or Run in the cloud)
Select Run on your device and click Standard mode to run the task on your local device
Waiting for the task to complete
Here is the sample output data, which can be exported in Excel, CSV, HTML and JSON formats.
Note: Local runs are great for quick runs and small amounts of data. If you are dealing with more complicated tasks or a mass of data, Run in the Cloud is recommended for higher speed. You are very welcome to try the premium feature by signing up for the 14-day free trial here. Tasks can be scheduled hourly, daily, or weekly and data delivered regularly.