I have what might be a fairly typical issue.
Many of the pages I am scraping have tables which have a header row telling the sport that a coach represents. This is an example URL: http://www.colby-sawyerathletics.com/staff.aspx. I have included the screen shot.
I need to read the sport the coach represents in the header row, and then include it in the individual table rows that I am scraping. Is there any way within import.I/O that you can set a variable, included it on the table rows, and then reset the variable when you encounter another header row?
Is there a better way to do this?
Please sign in to leave a comment.