Notes
Outline
Slide 1
Outline
Speaking Geek: Internet Terminology
A Brief History of the Internet
Net Structure
Email
The Major Search Engines and Directories
Sample Searches
Assignment
Organizing Your Search
Booleans
Searching in Other Places
The Hidden Web
Online Communities
Conclusions and Summary
Speaking Geek: Internet Terminology
ISP
URL
Browser
Home Page
HTTP
FTP
HTML
Internet/internet
Arpanet
Intranet
WWW
Email headers
Encryption
Carnivore
Search Engine/Index
Directory
Spiders
Boolean operators
Strings
Wild cards
Metacrawlers
Portal
Domain name/TLD
Downloads/Uploads
Cache
A Brief History of the Internet
Department of Defense
ARPANET
A “Decentralized Network of Networks”
For use in times of war/crisis
National Science Foundation
The “Backbone”
1993 - Marc Andreessen and Mosaic
1994
Netscape Communication Corp founded…
…and the Web started to become accessible
 The rest is, as they say...
Internet Structure
Networks and Subnetworks
LANs and WANs
Routers
Servers
Hubs and Backbones
A Web Page in a Browser…
What’s Underneath It?
Email
Ubiquitous
“Killer app” of the Web
Variety of email clients
MS Outlook
MS Outlook Express
Eudora
Netscape
Web-based email includes
Yahoo!
Hotmail
How did this mail get here?
An e-mail message travels across the Internet through various computer systems before reaching its final destination.
The path followed is unpredictable, dictated by network traffic at the time and the routers encountered along the way.
Emails contain “headers”
They tell you the route the mail took to get to you.
Email: Like sending a postcard…
Privacy
Encryption
An encryption package consists of data-scrambling technology that allows users to e-mail information across the Internet without misgivings. Through encryption, a message is encoded so that only its intended recipient can decrypt it. This way, if a message is scanned or intercepted, it cannot be read.
The two main forms of e-mail encryption are PGP (short for Pretty Good Privacy) and S/MIME (short for Secure Multipurpose Internet Mail Extensions).
(http://www.gfi.com/me/wpencryption.htm)
Beating the Bad Guys…
Carnivore
According to the FBI: “Carnivore is a computer-based system that is designed to allow the FBI, in cooperation with an Internet Service Provider (ISP), to comply with court orders requiring the collection of certain information about emails or other electronic communications to or from a specific user targeted in an investigation.”
(http://www.robertgraham.com/pubs/carnivore-faq.html)
“Search Engines…”
What we call “Search Engines” actually fall into three categories:
Indexes
Directories
Metacrawlers
These are not pure distinctions
Most “Search Engines” are hybrids…
…and the results contain “commercial content”
Search Engines
“Catalog” the Web
“Spiders” and “Crawlers”
Automated
Examples:
Google
Alta Vista
HotBot
Northern Light
Unindexed…Unseen
Vast tracts of the Web are unindexed
Dynamically-generated pages (DHTML, ASP etc)
Academic and institutional databases (eg Colby-Sawyer library)
Many pages under the same domain name
“Online communities” like Geocities, Tripod
An Unindexed Resource…
Directories
Compiled by humans
Are not comprehensive
“Editorial judgment” implied
Quality vs Quantity
Examples:
Yahoo!
LookSmart
Metacrawlers
Searches sent to several SEs at once
Results appear on one page
Examples:
MetaCrawler
Dogpile
Boolean Operators
“Force” an Engine to conduct a search in a specific way
Simple
AND + Must Include
NOT - Must Not Include
OR ¦ Or
Principles of Searching
Alternate Broad and Deep
Too Broad = ’000s of Results
Too Deep = 0 Results
Use Booleans and Strings
So…
Searching is becoming both easier and more difficult
The amount of information available is increasing exponentially
So is the number of Search Engines and Directories…
Conclusion...
Search Engines: Major Players
Google
http://www.google.com
AltaVista
http://www.altavista.com
HotBot
http://www.hotbot.com
Northern Light
http://www.nlsearch.com
FAST Search
http://www.alltheweb.com
Yahoo!
http://www.yahoo.com
LookSmart
http://www.looksmart.com
Dogpile
http://www.dogpile.com
MetaCrawler
http://www.go2net.com
Search.com
http://www.search.com
Conclusions...
Discussion...
Searching & Sourcing:
Using the Web to Find
the Information You Need
END