Starting in 1996, Alexa Internet has been donating their crawl data to the Internet Archive. Flowing in every day, these data are added to the Wayback Machine after an embargo period.
Crawl data donated by Alexa Internet. This data is currently not publicly accessible
TIMESTAMPS
The Wayback Machine - https://web.archive.org/web/20010424104701/http://www.bsdi.com:80/setext/
Setext Information
This information is stored in setext format and converted on the fly
to HTML (unless stated otherwise). This is used for a demo of the
Plexus setext -> HTML converter.
Sample Setext Documents
Here are a few articles written by
Tony Sanders in
setext for Usenet postings. The hypertext links in these articles
are valid for WWW (i.e., they contain valid URL's).
About the setext->HTML filter, and why to use it (see the raw text)
Setext Documents
These are mostly by Ian Feldman <ianf@random.se>, and slightly edited
by Tony Sanders to update the hyperlink
format since the spec changed. Note that the hyperlinks in these documents
don't work because they aren't valid URL's (they were just examples of
how it might work, the sample documents above have valid hyperlinks if
you want to play around with them).