Carnegie Mellon University 
15-415A - Database Applications 
C. Faloutsos - Fall 2002

Homework 7- Due: 12/05

Reminders - Notes

1) Spatial Joins in an R-Tree - Introduction

You are given a dataset of galaxies from the Sloan Digital Sky Survey, where each galaxy is represented by an object-id and its (x, y, z) coordinates. Insert them into an R-Tree. Then implement the spatial join, and report the counts of qualifying pairs of galaxies, for the specified  ranges below.

2) Details

3) What to Turn In

  1. Hard copy: a printout of ONLY the part of your source code dealing with spatial joins.
  2. Hard copy: results (= pair-counts) from your program from running spatial joins at the following ranges:
    1. 0.00
    2. 1E-09
    3. 1E-05
    4. 2E-5
    5. 3E-05

For your information (no points for this part)

If you are interested in astronomy, check the web site for the Sloan Digital Sky Survey, which is expecting to collect 0.5 billion galaxies, and which actually has a relational DBMS underneath  (MS SQL server), to help astronomers collect the data. Also, the pair-count of galaxies is an important measure, which gives astronomers an idea about the intrinsic ('fractal') dimensionality of the galaxies.


Last updated: 11/18/2002, by Christos and Deepayan