All Collections
Case Tutorial
Sport
Scrape detailed football statistics from WhoScored
Scrape detailed football statistics from WhoScored
Updated over a week ago

You are browsing a tutorial guide for Octoparse's latest version. If you are running an older version of Octoparse, we strongly recommend you upgrade because it is faster, easier and more robust! Download and upgrade here if you haven't already done so!

WhoScored is a popular football website that provides live scores, match results and player ratings from the top football leagues and competitions.

In this tutorial, we are going to show you how to scrape player info from WhoScored with the help of Octoparse.

2022-05-20_10-53-47.png

To follow through, you may want to use this URL in the tutorial:

The main steps are shown in the menu on the right, and you can download the sample task file here.


1. Create a Go to Web Page - to open the target website

  • Enter the target URL into the search box at the center of the home screen

Click Start to create a new task in Advanced Mode


2. Auto-detect web page data - to generate a workflow

Octoparse's auto-detect function can help to automatically generate a workflow quickly. Further modifications can be made based on this.

  • Click on Auto-detect web page data and wait for the detection to complete

detec.jpg
  • Untick Add a page scroll

UNTICK.png
  • Click Create workflow

The workflow would then be generated as below:

wf.png
  • Click More and delete field to delete the unwanted data

  • Double-click to edit the header


3. Run the task - to get the desired data

  • Click the Save button first to save all the settings you have made

  • Then click Run (You can either Run on your device or Run in the cloud)

  • Select Run on your device and click Standard mode to run the task on your local device

  • Waiting for the task to complete

Here is the sample output data, which can be exported in Excel, CSV, HTML and JSON formats.

Note: Local runs are great for quick runs and small amounts of data. If you are dealing with more complicated tasks or a mass of data, Run in the Cloud is recommended for higher speed. You are very welcome to try the premium feature by signing up for the 14-day free trial here. Tasks can be scheduled hourly, daily, or weekly and data delivered regularly.

Did this answer your question?