Carnegie Mellon University
15-826 Multimedia Databases and Data Mining
Spring 2016 - C. Faloutsos
Midterm Study Guide
Preliminaries - IMPORTANT
- No aids allowed, except
- a standard, 8'' x 11.5'' page with your notes - use both sides, and
- Pocket calculators: strongly recommended (logarithms)
- Photo id: please bring one
- Material to be examined: all lecture foils, and all required papers / book-chapters listed below.
Additional info:
Notice:
Several of the links are internal to CMU.Required text
Recommended text
- Undergraduate DB textbook, for
those who took a db class too long ago:
- Raghu Ramakrishnan, Johannes Gehrke, "Database Management
Systems," McGraw-Hill 2002 (3rd ed).
MATERIAL TO BE EXAMINED
All the material covered, up to and including
the lecture on inversion and signature files. Specifically:
1. Foils:
- From the course schedule, all the
foils, up to and including the lecture of '220_text2.pdf'.
Notice that
the file names for the foils are numbered in increasing order.
- Attention: also included: SQL, B-trees, hashing
- Excluded: all foils marked with the string 'optional' in a yellow diamond.
2. Multimedia Indexing
- Primary key access methods
- Secondary key and spatial access methods
- Jon Louis Bentley,
Multidimensional binary search trees used for associative
searching, Comm. of the ACM (CACM), Volume 18 ,
Issue 9, pp. 509-517, (September 1975)
- A. Guttman
R-Trees: a Dynamic Index Structure for Spatial
Searching, Proc. ACM SIGMOD, June 1984, pp. 47-57, Boston,
Mass.
- J. Orenstein,
Spatial Query Processing in an Object-Oriented Database
System, Proc. ACM SIGMOD, May, 1986, pp. 326-336,
Washington D.C.
- Roberto F. Santos Filho, Agma Traina, Caetano Traina Jr., and
Christos Faloutsos:
Similarity search without tears: the OMNI family of all-purpose
access methods ICDE, Heidelberg, Germany, April 2-6
2001.
- MM-Textbook, chapters 4 and 5.
- Fractals
- Christos Faloutsos and Ibrahim Kamel,
Beyond Uniformity and Independence: Analysis of R-trees Using the
Concept of Fractal Dimension, Proc. ACM
SIGACT-SIGMOD-SIGART PODS, May 1994, pp. 4-13, Minneapolis, MN.
(and
gzipped postscript)
- Alberto Belussi and Christos Faloutsos, Estimating
the Selectivity of Spatial Queries Using the `Correlation' Fractal
Dimension Proc. of VLDB, p. 299-310, 1995 (and
gzipped postscript )
- Power laws, lognormals etc: M. E. J. Newman, Power laws, Pareto distributions and Zipf's law Contemporary Physics 46, 323-351 (2005) (local pdf copy)
- Text and LSI
Last modified: Feb. 17, 2016, by Christos Faloutsos