All Collections
Case Tutorial
Lead Generation
Scrape leads from Yellowpages
Scrape leads from Yellowpages
Updated over a week ago

Lead generation is one of the most important parts of any sales process. Yellowpages is a good data resource for companies in any industry to collect leads. In this tutorial, we are going to show you how to scrape the leads from Yellowpages.

For Yellowpages, you can visit our easy-to-use "Task Template" in the template section of Octoparse. Just input "yellowpage" in the search box and there will be several templates for you to choose from. All you need to do is type in several parameters, and the task is ready to go. For further details, you may check it out here: Task Templates

Please follow the steps below if you want to know how to build a task from scratch with Octoparse. We will use the URL below to scrape data such as title, address, telephone, etc.

The main steps are shown in the menu on the right, and you can download the sample task file here.


1. Create a Go to Web Page - Open the targeted web page

  • Enter the URL on the home page and click Start


2. Auto-detect the webpage data- create a workflow

  • Click Auto-detect webpage data and wait for the detection to complete

066.gif
  • Uncheck Add a page scroll

  • Click Create workflow

13.png

If the data you need can all be scraped from the listing page, you can just jump to Set up wait time to slow down the scraping speed. If you want to click on each detail link to get more information, please follow the next step.

Go to Data Preview to see if you're okay with the current data output

  • Delete unnecessary data fields by clicking "..." and Delete field

  • Modify the data field names by double-clicking the header


3. Select subpage URL - to click on each detail page link

  • Choose Select subpage URL on the Tips panel

  • Select Click on an extracted data field and select the one you want to click on from the drop-down menu (you can confirm if it's the correct link in the Data Preview)

  • Click on Confirm


4. Extract Data - extract data on the detail pages

  • Select information from the web page

  • Choose Text on the Tips panel

  • Repeat the above steps to extract all the data you need

  • Double-click on the field name to rename it if needed

rename.png

5. Set up wait time to slow down the scraping speed

Since Yellowpages might block your IP if you scrape it too much, we need to control the scraping speed.

  • Click on the Extract Data1 action

  • Tick Wait before action under Options

  • Set up time as 5s-10s

15.gif

6. Run extraction - run your task and get data

  • Click Save

  • Click Run on the upper left side

  • Select Standard Mode under Run on your device section to run the task on your computer, or choose to run the task in the Cloud (for premium users only)


Here is the sample output:

wewewe.png
Did this answer your question?