Fall 2018: CS6220 Data Mining Techniques,
Section 01, CRN 12896
Lecture time: Tuesdays & Fridays
|
Place: Behrakis Health
Sciences Center, Room 320
|
Instructor: Tina Eliassi-Rad
|
Office hours: Tuesdays 5:30 – 6:30 PM in West Village H (WVH), Room 362
|
TA: Zikun Lin & Hui “Sophie” Wang
|
TAs office hours:
· Zikun Lin:
Wednesdays 3:00 – 4:30 PM in Nightingale Hall, Room 132i
· Sophie Wang: Thursdays
4:00 – 5:30 PM in Ryder Hall, Room 273
|
This 4-credit graduate-level course covers data mining and unsupervised learning. Its prerequisites are:
This course does not have a designated textbook. The readings are assigned in the syllabus (see below). Here are some textbooks (all optional) related to the course.
Lec # |
Date |
Topic |
Readings &
Notes |
1 |
F 9/7 |
Introduction and Overview |
o Chapter
1 of http://eliassi.org/mmds-book-v2L.pdf
|
2 |
T 9/11 |
Review of Linear
Algebra (Guest lecturer: |
o Linear Algebra Tutorial (C.T. Abdallah, Penn) o Linear Algebra Review and Reference (Zico Kolter and Chuong Do, Stanford) o Probability Review (David Blei, Princeton) o Probability Theory Review (Arian Maleki and Tom Do, Stanford) |
3 |
F 9/14 |
Social Bots (Guest lecturer: |
o https://arxiv.org/abs/1407.5225 |
4 |
T 9/18 |
Frequent Itemsets & Association Rules |
o Chapter
6 of http://eliassi.org/mmds-book-v2L.pdf o Optional:
Sections 6.1-6.6 of http://www-users.cs.umn.edu/~kumar/dmbook/ch6.pdf
|
5 |
F 9/21 |
Frequent Itemsets & Association Rules |
o Chapter
6 of http://eliassi.org/mmds-book-v2L.pdf o Optional:
Sections 6.1-6.6 of http://www-users.cs.umn.edu/~kumar/dmbook/ch6.pdf
|
6 |
T 9/25 |
Density Estimation |
o http://ned.ipac.caltech.edu/level5/March02/Silverman/Silver_contents.html
o http://eliassi.org/Sheather_StatSci_2004.pdf
o Optional:
Sections 6.6-6.9 of http://statweb.stanford.edu/~tibs/ElemStatLearn/printings/ESLII_print10.pdf
|
Homework #1 o out
on Tuesday September 25 o
due on Friday October 5 at 11:59
PM Eastern o grade
by Friday October 19 |
|||
7 |
F 9/28 |
Finding Similar Items |
o Chapter
3 of http://eliassi.org/mmds-book-v2L.pdf |
8 9 |
T 10/2 F 10/5 |
Mining Data Streams |
o Chapter
4 of http://eliassi.org/mmds-book-v2L.pdf |
10 11 |
T 10/9 F 10/12 |
Dimensionality
Reduction (PCA, SVD, CUR, |
o Chapter
11 of http://eliassi.org/mmds-book-v2L.pdf o
Section 14.5 of http://statweb.stanford.edu/~tibs/ElemStatLearn/printings/ESLII_print10.pdf
|
Homework #2 o out
on Friday October 12 o due
on Tuesday October 23 at 11:59 PM Eastern o grade
by Tuesday November 6 |
|||
12 13 |
T 10/16 F 10/19 |
Clustering: |
o Chapter
9 of http://robotics.stanford.edu/~nilsson/MLBOOK.pdf
o Sections
7.1-7.3 of http://eliassi.org/mmds-book-v2L.pdf o Chapter
8 of https://www.cs.cornell.edu/jeh/book2016June9.pdf o Section
14.3 of http://statweb.stanford.edu/~tibs/ElemStatLearn o http://cs229.stanford.edu/notes/cs229-notes7b.pdf o http://cs229.stanford.edu/notes/cs229-notes8.pdf o Optional:
https://www.cs.rutgers.edu/~mlittman/courses/lightai03/jain99data.pdf
o Optional:
http://web.itu.edu.tr/sgunduz/courses/verimaden/paper/validity_survey.pdf o Optional:
http://www.dbs.ifi.lmu.de/Publikationen/Papers/KDD-96.final.frame.pdf
|
14 |
F 10/23 |
EM, K-mediods, Hierarchical Clustering, Evaluation Metrics and Practical Issues |
o Same
readings as for 10/16 & 10/19 |
15 |
T 10/26 |
Spectral Clustering |
o http://ai.stanford.edu/~ang/papers/nips01-spectral.pdf o http://www.cs.columbia.edu/~jebara/4772/papers/Luxburg07_tutorial.pdf |
16 |
T 10/30 |
Midterm Exam |
Grade by Tuesday November 13 |
17 |
F 11/2 |
Project Proposal Pitches |
|
18 |
T 11/6 |
Link Analysis |
o Chapter
5 of http://eliassi.org/mmds-book-v2L.pdf o Optional:
http://bit.ly/2iYxo82 |
19 |
F 11/9 |
Recommendation Systems |
o Chapter
9 of http://eliassi.org/mmds-book-v2L.pdf |
20 |
T 11/13 |
Recommendation Systems |
o Chapter
9 of http://eliassi.org/mmds-book-v2L.pdf |
21 |
F 11/16 |
Matrix Factorization |
o Chapter
14.6 of http://statweb.stanford.edu/~tibs/ElemStatLearn/ o
http://papers.nips.cc/paper/1861-algorithms-for-non-negative-matrix-factorization.pdf
o Optional:
http://www.sandia.gov/~tgkolda/pubs/pubfiles/TensorReview.pdf
|
Homework #3 o out
on Friday November 16 o
due on Tuesday November 27 at
11:59 PM Eastern o grade
by Tuesday December 11 |
|||
22 |
T 11/20 |
Topic Models |
o http://www.cs.columbia.edu/~blei/papers/Blei2012.pdf
o http://www.cs.columbia.edu/~blei/papers/Blei2011.pdf
o http://www.cs.columbia.edu/~blei/papers/BleiLafferty2009.pdf
|
23 |
F 11/23 |
Thanksgiving Break --- no class --- |
|
24 |
T 11/27 |
Hidden Markov Models & Review for Final |
|
25 |
F 11/30 |
Final Exam |
o Grade
by Friday December 14 |
26 |
T 12/4 |
Project Presentations: Group 1 |
|
27 |
F 12/7 |
Project Presentations: Group 2 |
Last class day |
Project reports o due
on Tuesday December 11 at 11:59 PM o grade
by Sunday December 16 |
|||
Final grades are due to the Registrar Office on Monday December 17 at 2:00 PM Eastern. |
A |
93-100 |
A- |
90-92 |
B+ |
87-89 |
B |
83-86 |
B- |
80-82 |
C+ |
77-79 |
C |
73-76 |
C- |
70-72 |
F |
< 70 |