Carnegie Mellon University
15721 Database System Design and Implementation
Spring 2003 - C. Faloutsos
Homework 2 - Due: 4/10

0) Reminders:

0.1) Time estimates

1) R-trees and 'count' queries [100pts]

Use the R-tree package provided, and augment it with counters. The goal is to accelerate 'count' queries, like 'how many galaxies are inside the rectangle (xlow, xhigh, ylow, yhigh, zlow, zhigh)'. Thus, for every R-tree node, make sure you have a counter with the points (or rectangles) in that sub-tree. Of course, you'll need to modify the insertion and deletion routines appropriately, to maintain the correct counts.

1.1) Details

3) What to Turn In

  1.  [70pts] Hard copy: a printout of your source code (you are welcome to give only the parts dealing with 'count queries')
  2.  [30pts] Hard copy: results (= galaxy counts) from your program, applied on several ranges - we'll announce the query ranges later.

For your information (no points for this part)

If you want really large datasets, and/or are interested in astronomy, check the Sloan Digital Sky Survey (SDSS). The project expects to have half a billion galaxies, with their coordinates, spectra, images and much more.


Last modified by Christos Faloutsos, 3/16/2003