Here at Import.io, we use some unique (and not so unique) terminology to describe our product. Before you dive into our world, here's some useful definitions to keep in mind.
In Import.io, you build an extractor to select what data you want to extract from a type of web page. For example, you can build one extractor to pull the Product ID, Name, and Price from a product page on Site A and another extractor to pull Reviewer, Product, Name, and Rating from a review page on Site B.
This is the act of creating and editing extractor within Import.io. To "train" an extractor, you load in a web page and then use Point-and-Click to select the data you want. When you save an extractor, Import.io uses what you trained to determine what data to extract from the page.
Import.io tracks you usage through the number of queries you use. The easiest way to think of a query is one page or URL. For example if you run through 50 product pages, that would be considered 50 queries. For interactive extractors, a set of inputs is considered one query, two searches that used two different dates would still be two queries.
Each time your extractor runs, we call this a crawl run. Each crawl run has a set of URLs (and inputs for interactive extractors) that it runs through and then returns structured data as its output, in Excel, CSV, and JSON format.
A Report in Import.io is the byproduct of one or more crawl runs. Currently you can generate Data, Change, and/or Comparison Reports. Data Reports allow you filter your column outputs. Change Reports allow you monitor changes between two crawl runs. Comparison Reports allow you to compare data between two or more extractors. All reports can be published to a shareable Report Portal.
Within Import.io you can have one extractor get the list of URLs it runs through from another extractor's output. When you set this up we say that the extractor receiving URLs is chained to the other extractor.
Listings vs Detail Pages
A listings page is a page that has a list of data, such as product links, and thus returns multiple rows per page. A details page is a page that returns data for an individual object, such as an product or business page, and thus returns a single row per page. Generally within Import.io, extractors for detail pages are chained to extractors for listings pages.