Image Surfer Pro Toolbar

Processing Webpages
Directed Search

The intended use of a Directed Search is to quickly process the Image Galleries linked to by a Thumbnail Post.

With this purpose in mind, Image Surfer Pro only crawls webpage links in a specific way and cannot be used as a general web spider to crawl the entire world wide web. At most a deep search will crawl three pages deep. While this seems to be a limitation, it keeps searches constrained to a reasonable processing time and efficiently finds the image content for the intended use.

A Directed Search is done in two stages. First the pages are searched for image content. Once all of the pages have been searched the image content found is added to the fusker collection.

Searching For Images

The search for image information builds three lists. A page search list is maintained during the process to assure pages are only searched once and to keep track of how far the page being searched is from the original page. The primary goal of the search is to find Direct Image Links. A list of unique Direct Image Links is maintained throughout the search for addition to the fusker collection. In some cases the desired image content is never found in a Direct Image Link - instead it is embedded into the pages. The third list contains unique image information found embedded on some of the searched pages.

The way the lists are built and maintained depends on the "level" of the page being searched. The following describe how the search progresses through the page levels.

The First Page

Though you do not have to start from a Thumbnail Post, having a graphic to use in the example will help explain the process. Say each small image on the thumbnail post is a link to an Image Gallery. The Image Galleries are hosted by different domains.

Graphical representation of a thumbnail post

Using both the ISP Forms button from the Image Surfer Pro toolbar and Process Page button from the Image Surfer Pro toolbar buttons:
A Directed Search starts by creating an ISP Form. Image Surfer Pro requires this step only as a safety measure. Even clean and well trusted pages may have links you would not want to follow. For example links to book mark the page, use the page as your home page, etc. Creating an ISP Form first allows you to choose which links are searched by using the

Search page for more images

check box for each Image Gallery you wish to search. If the number of links you wish to search out numbers those you wish to ignore, using the "Search All" boxes at the top of the form may make it easier.

If you chose to add image content or direct image links to your fusker collection by selecting the "Add Image" check boxes, those image references will be added to the collection prior to searching subsequent pages for image content.

Second Level of Pages

Internet Explorer will navigate to each of the search links you selected in the ISP form created from the first page. It will navigate to each in the order they were listed in the form from top to bottom. Once the search page is loaded, Image Surfer Pro will process the page looking for image content. Image Surfer Pro processes 2nd level pages in one of three ways based on the type of content and links it finds on the pages.

Graphical representation of an Image Gallery

Direct Image Links:
The most common type of Image Gallery provides Direct Image Links from each thumbnail to the actual image files. Image Surer Pro will pull Direct Image Links from the gallery page directly and add them to the list of images and no new links are then added to the Search List because the desired image content has been found.

Image Page Links:
Another common type of Image Gallery links the thumbnails to additional webpages, each with the desired image embedded. This provides another page where advertisements and such can be placed. If no Direct Image Links are found, Image Surfer Pro will assume the image content will be found on the 3rd level pages referenced by images on the page. Each link referenced by an image will then be inserted into the search list.

Non Image Gallery Pages:
If Image surfer pro does not find any Direct Image Links on the page and does not find any new links to search referenced from images on the page, It will assume the desired image content is actually on this page and will add any image found embedded on the page to a list of possible images of interest. Essentially treating the page as an Image Page.

Third Level of Pages

While searching the second set of pages, Image Surfer Pro may have inserted additional pages into to the search list. This is done if it could find no Direct Image Links on the page but did find images which referenced other pages. Image Surfer Pro attempts to limit the number of these additional pages by not crossing domains. Only links which are found to be referenced by images and are in the same URL path as the referencing page will be added to the search list. In some cases this may cause Image Surfer Pro to miss some content you were expecting it to find but prevents a significant amount of wasted search time.

Graphical representation of an Image Page

When processing this third level of webpages, Image Surfer Pro is expecting to find an Image Page. A webpage where the primary content is a single image. However it will still add any Direct Image Links to the list of images found.

No links will be added to the search list when processing these pages.

If no Direct Image Links are found on these pages, Image Surfer Pro will add any images embedded in the pages to the list of possible images.

What The Search Looks Like

As each page is searched it is displayed in the browser window. Thus the search also works as a slide show of the pages. You may configure a pause in the viewing of these pages in your user preferences on the Views Tab.

As the pages flip by an interactive Image Surfer Pro task progress window is shown at the top right corner of your primary screen. This progress window can be moved and you can continue to work in other applications or even other Internet Explorer windows while the search runs.

Image Surfer Pro interactive progress window showing a Directed Search in progress

This progress window provides some very useful information, including what operation is being performed and which page it is being performed on. The total number of pages which have been searched along with the current count of total pages to search is provided along with the number of Direct Image Links found and the number of possible images found embedded on the pages.

While the Elapsed Time counter is quite accurate, the Estimated Time Remaining may not be quite as accurate. The estimate is based upon the most recently processed pages and load time as well as processing time may vary greatly between pages. Since the number of pages to be searched may also grow the estimate of time remaining may suddenly become longer and the percentage of the progress bar completed may seem to move backwards at times during the search.

cut out of the stopping button for the interactive progress window You may stop the search at any time by clicking the stop button on the status window. When you do click the stop button it will change to a disabled "Stopping" button. The search will stop after processing the next search page. When the search is stopped Image Surfer Pro will begin adding the image content found prior to the stop command.

Adding Images

After the search for image content has completed or been stopped, the image content stored in the Direct Image Link list and the Embedded Image list will be added to the fusker collection.

Screen capture of the Image Surfer Pro interactive progress window adding direct image links Direct Image Links found in the search are automatically added to the fusker collection as that was the primary purpose of running the search. This image information is added without checking the size of the image. You may use the Stop button to skip some or all of these images and move on to adding the Embedded Images.


Screen capture of the Image Surfer Pro interactive progress window adding embedded image links Embedded images may or may not contain information you originally intended to find in your search. It is not uncommon to have banner adds, headers, buttons, and assorted other graphics found by a large search. In some cases data displayed as an image on the pages may not even link to a file which could be directly accessed. To attempt to limit this clutter in your fusker collection, Image Surfer Pro will compare the file size of each embedded image to your user preference. The {Min image file size in Kbytes for auto collection add} configuration on the Processing Tab allows you to tune how large embedded images need to be before adding them to your fusker collections.


When All Is Done

Pop up window when your Directed Search processing completes

Once all of the pages have been searched and all of the images have been added to your fusker collection, this popup window will let you know the processing has completed and the browser will return to the original ISP Form where you started the Directed Search.

You will notice that most or all of the pages searched are not in "back history" of the browser. They do however exist in the browser history - you will need to clear your browser history to remove them from there.

Related User Preferences:

Image of User Preferences Dialog with the General tab selected - nothing highlighted Image of User Preferences Dialog with the Processing tab selected - Directed Search Configuration and Auto Optimize Configuration highlighted Image of User Preferences Dialog with the Processing tab selected - Auto select search options highlighted Image of User Preferences Dialog with the Views tab selected - Directed Search pause highlighted


Processing Tab:
Directed Search: The configurations available for Directed Search help you fine tune how the searches are performed and which images will be added to the fusker collection by the Directed Search.

{Min image file size in Kbytes for auto collection add} is used to determine which Embedded Image Links found on the search pages are added to the fusker collection.

When your browser is requested to navigate to a new page, the time it takes to load the page can be different depending upon how busy the hosting server is, your internet connection speed, and capabilities of your computer. {Page time out in Seconds} allows you to set the maximum time you wish to wait for a page to load. If the page does not finish loading in the configured time you will be asked if you wish to continue waiting or not.

Forms Tab:
Choosing Links To search: These two configurations correspond to the two "Search All" check boxes on the ISP Form. You can choose to automatically check one or both by default when ISP forms are generated. They correspond to the two check boxes in the Directed Search configurations of the Processing tab but allow you to set the default set of links or when an ISP Form is used to initiate the Directed Search.

Views Tab:
Directed Search: The {Pause in seconds between search pages} configuration allows you to customize the slide show effect of watching a Directed Search in progress.

Differences in Free and Full Versions

Screen capture of free version limitation dialog Directed Search:
The Free Version of Image Surfer Pro does not support processing Image Surfer Pro Forms. A Directed Search may only be started from an ISP Form and so is also not supported. This warning will be provided anytime the webpage to be processed is not a direct image link if you are using the free version of Image Surfer Pro.

Screen Capture Examples

Sample screen capture of Directed Search in progress The Processing Image Surfer Pro Forms examples include performing a limited Directed Search.