7. Trapping Topic-Based Information

Monitoring General Search Engines

General search engines are large, unstructured, and generally chaotic. This makes trapping them very difficult. However, since they are the bulk of the foundations of Internet search, many people make information available through search engines that would otherwise be unavailable through RSS or special search categories. Getting the perfect query and actually trapping from search engines can be difficult though.

Google

Yahoo

Yahoo Directory

Ask

Microsoft Live Search

Numerous other search engines can be monitored, including The Open Directory Project (through the directory method of narrowing and through web page monitors only).

Monitoring News Search Engines

News Search Engines are less necessary to monitor as General Search Engines because they are less large/complex. The following News Search Engines can be monitored:

Hoovers offers saved searches and email alerts, but is expensive. The best overall option would appear to be Google News, which is free and also makes use of both the Email Alerts and RSS trapping options.

Searching Blogs

We have covered the Blog Search options:

All offer RSS feeds.

Keyword-Searchable RSS Feeds

These are feeds based on searches on the query words that you specify. These are much more specific than RSS feeds.

Kebberfegg

Sets up keyword based RSS feeds across many resources – over 3 dozens. You can generate feeds in HTML or OPML for a RSS feed reader.
  1. Enter in the query box the words you want to search
  2. Generate a keyword feed list for one category, or select multiple categories
  3. Choose HTML or OPML
  4. For each result you can add it to My Yahoo, forward it to your email via RSSFwd, or just look at the plain RSS feed

More Sources

Commercial

You can also trap information from commercial websites

Government

You can also trap information from government websites

Questions:

  1. Which general web search engine, Google or Yahoo, offers email alerts?
  2. Explain the general idea behind Ask’s “Should Search” and Microsoft Live’s “Prefer”
  3. Which engines are generally easier to monitor, General Search Engines, or News Search Engines?
  4. Which News Search Engine could generally be considered to offer the widest variety of trapping options?
  5. Name three different blog search options.
  6. Which website would one use to trap information from Amazon search queries?
  7. How would one go about trapping information from an Ebay search query?
  8. Explain the syntax used to access a state’s webpage from your web browser.