What is Import.io?
Import.io enables you to extract data directly from the web, commonly known as web scraping, but Import.io is so much more. Our point-and-click interface transforms websites into data with a few simple clicks, enabling you to get the data you need, even if it requires interaction or is behind a login.
How does data extraction with Import.io work?
You create an extractor and give it an example of a URL that contains the data you want to extract. Once Import.io loads the webpage, it'll present you with the data it finds or give you the option to point-and-click to identify the data on the webpage you want to collect and organize into a tabular data column structure that suits you. As you select the data you want, Import.io identifies the underlying structure of the webpage and where certain elements of data reside on the page.
What makes Import.io unique?
Import.io contains a built-in crawl service specifically designed for multiple URL querying. Import.io uses dynamic rate limiting and contains a retry system to handle errors and restrictions. The result is high quality extraction performance and success. When querying multiple webpages, the crawl service queries URLs asynchronously, each from a rotating IP address pool, making the process more efficient. If a URL fails, the URL is requeued and tried again from a different IP address. The crawl service monitors website response time, which results in a higher quality extraction by ensuring the extraction does not put too much of a load on a website.
Where to begin with Import.io?
After reading through our terminology, you'll want to sign-up for an account to go through our tutorial that help you Build Your First Extractors. If you prefer watching rather than reading along, you can find on YouTube our Getting Started with Import.io Tutorials.