ViNTs: Visual information aNd Tag structure based wrapper generator

 

 Introduction

 Test bed

 Demo 1

 Demo 2

 Contacts

Demo:

ViNTs a tool for automatically producing the wrappers for any given search engines. This demo consists of two steps, the first step is for wrapper building, and the second step is for result extraction. There are some constraints for using this demo.

Step 1. Wrapper Building:

To build the wrapper for a search engine, please upload sample result pages you collected from the search engine. The interface here supports up to six sample result pages, including one special page called No Result Page. In general, the more sample result pages you provide, the better the quality of the wrapper will be, and, providing a No Result Page will also improve the quality. Since the sample page collection is a tedious process, we have a demo 2, which needs only a search engine URL and a query string.

Sample page 1
Sample page 2
Sample page 3
Sample page 4
Sample page 5
Provide a No Result Page below will improve the performance:
No Result Page


You can click the button "Run Sample" below to view several sample result pages from Google and the extracted search result records.

 


Constraints:

  • Do not mix sample result pages from different search engines in one upload operation.
  • Please ensure the sample result pages contain enough search result records. In general, the more search result records on the sample page, the better the quality of the wrapper.
  • Do not test search engines that organize their search results in a frame. In such case, the sample result page is the frame set page, which doesn’t contain the search result records. We will consider how to deal with frame pages in the near future.
  • If a search engine organizes the search result records in different sections, the demo system will most likely extract the results from the biggest section only. Our test bed consists of search engines that organize their search result records in one section only.
  • Occasionally, the sample result pages returned by some search engines can’t be correctly rendered by the third party browser component we used in the demo, our system will try to report such a problem.

Sample result page:

        A sample result page is the web page returned to you by a search engine in responding to a query you sent to the search engine. To collect a sample result page, enter a query to the search engine and let the search engine search for you. Then save the returned page as a sample result page. Try to save the HTML source code of the result page only because some pages can’t be saved if you want to save the web page completely.

No Result Page:

        It is a special sample result page that contains no search result record at all. It can be obtained by submitting a non-existent word (such as: “ThisIsANoResultQueryTerm”, or “kasljfdiowe14241askdjfi3”) as a query to the search engine.

 

Back to the top.