Guide to sample web scrapers


Web-extract-number

This sample is configured to scrape stock quotes, but can be modified (easily) to scrape labeled numbers from other web pages by changing the Label and the input URL list.

Installs with DTBuild.


Web-extract-title-header

Extracts two items from each web page in the input URL list :

The "rules" in the HeaderTag list determine which headers are extracted.   These rules work well for the news headlines sites specified in the sample input URL list, but will probably require modification for other sites.

Installs with DTBuild.


Census-01

U.S. Census Bureau table scraper:  state populations + areas

Extracts selected fields from a single table on the U.S. Census Bureau website and puts the extracted data in a database.

To run this sample :

See the HTML table parser example for more information.

Installs separately:  dtutilities-census-01-i.exe.   Requires MS Access 2000 or later.


Census-02

U.S. Census Bureau table scraper:  zip code data

Extracts selected fields from multiple zip code tables on the U.S. Census Bureau website.   Just specify the zip codes you're interested in and press the Grab button.

To run this sample :

This sample demonstrates the use of SQL to convert the user-entered zip codes into a source URL table which is scanned to produce the desired output.

Installs separately:  dtutilities-census-02-i.exe.   Requires MS Access 2000 or later.



© 2002-2010 DTUtilities