More about HKUST
CROSS-MATCHING BIG ASTRONOMIC CATALOGS ON HETEROGENEOUS CLUSTERS
PhD Thesis Proposal Defence
Title: "CROSS-MATCHING BIG ASTRONOMIC CATALOGS ON HETEROGENEOUS CLUSTERS"
by
Miss Xiaoying JIA
Abstract:
In astronomy, cross-match is a central operation to integrate multi-wavelength
information by identifying celestial objects across multiple catalogs. With the
rapid increase in data volume from space and ground-based surveys, it becomes
crucial to process large astronomic catalogs efficiently. In this thesis
proposal, we study how to accelerate the cross-match of billion-record catalogs
on a cluster of computers with both CPUs and GPUs. Two critical factors are
discussed in this proposal: (1) the choice of a suitable indexing method that
supports efficient operations on GPU; (2) cross-match algorithms with design
choices and optimizations targeting to the multi-node cluster environment. We
present two cross-match algorithms, namely IB-CM and MASJ-CM, both of which
work as follows: First, the positional cross-matching objects from astronomic
catalogs is essentially a spatial distance join on two sets of points. Second,
the query circle for each reference point overlaps a small set of cells under a
partitioning scheme. Specifically, IB-CM follows a filter-and-refine approach
to directly filter out most unlikely sample points, which fall out of the
overlapping cells. MASJ-CM performs the cross-match by further replicating
reference candidate objects for each sample object for matching. Our
evaluations show that: (1) HEALPix was the best indexing method for cross-match
tasks; (2) IB-CM outperformed MASJ-CM for cross-matching small scale catalogs
on a single node, whereas MASJ-CM won on billion-record catalogs on a
multi-node cluster; (3) self-match of a billion-record catalogs was completed
under 4 minutes with MASJ-CM on a six-node cluster.
Date: Thursday, 27 April 2017
Time: 10:00am - 12:00noon
Venue: Room 1505
(lifts 25/26)
Committee Members: Dr. Qiong Luo (Supervisor)
Dr. Wei Wang (Chairperson)
Prof. Lei Chen
Dr. Ke Yi
**** ALL are Welcome ****