COMP 630030 Data Intensive Computing [Fall 2015]

Meeting Time: Friday: 8:55am -- 11:00am
Meeting Room: Zhangjiang Branch, Room Z2209A

Course Description

Data Intensive Computing course will explore theoretical and practical aspects of working with massive volumes of data. We study the current scalable architectures, methods, tools, programming languages and applications and learn design principles for building large clusters to support data intensive computing.

Textbook:
           Mining of Massive Datasets, Anand Rajaraman and Jeff Ullman.
           Data-Intensive Text Processing with MapReduce, Jimmy Lin and Chris Dyer
           The Art of Concurrency - A Thread Monkey's Guide to Writing Parallel Applications, O'Reilly Media, 2009.
           CUDA by Example: An Introduction to General-Purpose GPU Programming, Jason Sanders, Edward Kandrot.

Useful Links:
           nVidia CUDA Programming Guide
           CUDA BBS
           Hadoop
           PASI training Boston University
           Course on CUDA Programming on NVIDIA GPUs, by Mike Giles
           Hot Topics in Distributed Systems: Data-Intensive Computing, by Ioan Raicu
           Cloud Computing Courses, by Jimmy Lin
           High Performance Computing with CUDA, by Ying Liu

Schedule (Subject to revision)

No. Date Agenda
1 09/15/2015  Course Organization; [slides]
 Lecture 1 - What is Cloud Computing [slides]
2 09/22/2015  Lecture 2 - The Art of Concurrency [slides]
3 09/29/2015  Lecture 3 - MapReduce Introduction [slides]
4 10/06/2015  Lecture 4 - MapReduce Text Retrieval & LM [slides]
5 10/13/2015  Lecture 5 - MapReduce Graph Algorithms & databases [slides]
6 10/20/2015  Lecture 6 - CUDA Introduction [slides]
7 10/27/2015  Lecture 7 - CUDA Memory & Threads [slides]
8 11/03/2015  Lecture 8 - CUDA Efficient Shared Memory Use &Optimization [slides]