Before You Begin
This tutorial applies to the latest Octoparse version. For optimal performance, upgrade now if you're using an older release.
The Problem: Misdirected Data
When running tasks (locally or in the cloud), you might encounter:
Data extracted to the wrong columns
Missing data fields
Root Cause:
Faulty XPath expressions that fail to consistently locate target elements across pages.
Example:
This is the data we expected:
This is the actual output. Note that not all the highlighted data is being extracted correctly.
The Solution: XPath Correction
Step 1: Write a Robust XPath
Learn XPath fundamentals with our guide:
🔗 What is XPath and How to Use It in Octoparse?
Step 2: Update the XPath in Your Task
Click More (···) next to the problematic data field
Select Customize XPath
Replace the existing XPath with your new expression
Click Save
Step 3: Validate with Test Run
Always test updated tasks with Preview before full execution.
Tips
✔ Use relative XPath (not absolute) for dynamic pages
✔ Bookmark our XPath cheatsheet for common scenarios