Web->KB dataset
http://www.cs.cmu.edu/afs/cs.cmu.edu/project/theo-11/www/wwkb/Web pages partitioned into classes, with hyperlink data. The dataset has been used for text categorization and learning to extract symbolic knowledge from the World Wide Web.
Submitted in section: Artificial Intelligence: Machine Learning: Datasets: Web->KB dataset