This is a 6-part tutorial is designed to guide you through building two different types of extractors in Import.io and introduce you to the key concepts of Import.io quickly. By the end of the tutorial, you should be able to create chained extractors and get data from a list of URLs that extracted with Import.io.
On the Import.io Dashboard, start by clicking New Extractor, then paste the URL of a business page in the Create Extractor screen, such as https://www.yelp.com/biz/great-bear-coffee-los-gatos, and finally click Go to load the page.
Once the page is loaded, Import.io will first attempt to identify any lists or microdata on the page. In this case a table of data specific to Great Bear Coffee is presented, such as the Name, Rating, and Price Range.
To add a data point, you can click Add column, and then click on the page to select that data point. To name this can click into the name either in the hovering data column or in the list of data columns at the top of the page.
Since this is a details extractor, you might want to restrict the data selected to be returned in one row per for page extracted, rather than list of data. To do this, reveal the Advanced option, then select Rows, and check to make sure it is set to Single Row.
With the data points selected, click the Save button to save your extractor. For now, you can skip the Change Report Modal, name your extractor, and then click Save and run.
Once the extractor is saved, it will redirect you to the Dashboard, where you can preview or download the results of the run when it is completed.
With your first extractor created, you’ll want to move on to Part 2: Editing a Details Extractor.
Comments
0 comments
Please sign in to leave a comment.