INFT 979/CSI 979 Advanced Topics: Data Mining
Syllabus prepared by Dr. Edward J. Wegman



Fall Semester, 1996
Tues: 7:20 - 10:00 p.m.



Basic Description

"Data Mining" has become a buzz-word within the computer industry for extraction of knowledge or information from large databases. Often the databases are financial, but the technology extends to other sorts of information databases. Similar ideas have existed in the statistics community for about 15 years under the name of exploratory data analysis. The convergence of these ideas coupled with recent advances in storage technology and database structures offer an interesting, exciting new technology. The idea of data mining is to look for information which may reside in opportunistically collected databases. A classic example involves a large discount chain sales records database in which it was noted that there was a high correlation between sales of disposable diapers and beer. The conclusion was that many men on their way home would be asked by their wives to pick up disposable diapers. The man decided to buy a six pack of beer while he was stopped anyway. The discount chain moved the beer and snacks such as peanuts and pretzels next to the disposable diapers and increased sales on peanuts and pretzels by more that 27%.

This course will focus on bringing together aspects of the computing, graphical/visualization and statistical technologies into an integrated treatment. This course will deal with data mining, exploratory analysis and knowledge discovery, particularly as they relate to graphical tools. Topics will include foundations, classification and clustering, trend and deviation analysis, dependency derivation, graphical tools, integrated discovery systems, and next generation databases.

Lecture Notes

Lecture notes will be made available as the course progresses. Watch this page for further materials.

Approved Assignments for Fall, 1996

URL's For Data Mining

Dr. Wegman's Powerpoint Presentation on Data Mining

Further Information

The course will be project oriented. There will be collateral reading assignments from the course text

Fayyad, U. M. et al. (1996) Advances in Knowledge Discovery and Data Mining, AAAI Press/MIT Press.

For further information, contact Dr. Edward J. Wegman at 703-993-1691 or by email at ewegman@gmu.edu.