Beyond Basics
XPath, Pagination, data cleaning, anti-blocking, API, and more
60 articles
What is XPath and how to use it in Octoparse?
Use relative XPath to locate data outside a loop item
Use XPath to locate email addresses from "mailto" links on any website
Fix field issues (missing, blank or misplaced fields)
Customize element XPath
Locate elements based on nearby text ("following-sibling" function)
Set up alternative XPath
XPath Cheatsheet for Web Scraping with Octoparse
Regular Expression (Regex) Cheatsheet for Data Extraction