Train with Additional URLs
Train with Additional URLs improves the accuracy of an extractor's ability to pick up data by training the extractor against multiple webpages.
Training with just one webpage is often sufficient to train your extractor. However, sometimes the underlying structure of webpages on a website varies, even when the webpages look identical. Adding additional training URLs enables you to check your training against similar webpages. Import.io recommends training multiple URLs to identify structural variations between webpages.
How to Add Training URLs
- Click Train with additional URLs. The Manage Extractor URLs dialog box appears.
- Paste the URL in the Enter a URL textbox and click Go.
- Import.io analyzes the new URL against the existing training and a completion message appears.
- Click Save and Close.
- A new option appears on the editor commands bar, giving you the ability to switch between the training pages.
Retraining Your Extractor
When a webpage's underlying structure is updated, you'll want to refresh the cached page in the extractor training with the new structure by using the "Refresh" button and then using point and click to update the training.
Comments
0 comments
Please sign in to leave a comment.